Results for "fbank"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

18 skills found

yeyupiaoling / VoiceprintRecognition PaddlePaddle

307

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

universal

arcfaceecapa-tdnnpaddlepaddle+2

Updated 15d ago

narcotic-sh / Senko

243

Very fast, accurate speaker diarization

universal

audio-aidiarizationfbank+5

Updated 1d ago

csukuangfj / Kaldifeat

214

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

universal

cppfbankfeatures-extraction+7

Updated 18d ago

csukuangfj / Kaldi Native Fbank

144

Kaldi-compatible online fbank extractor without external dependencies

universal

cppfbankkaldi-compatible+2

Updated 13d ago

mcimpoi / Deep Fbanks

Deep Filter Banks for Texture Recognition, Description and Segmentation (CVPR15)

universal

Updated 8mo ago

echocatzh / Torch Mfcc

A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.

universal

filter-bankmel-spectrogramshort-time-fourier-transform+1

Updated 6mo ago

ZitengWang / Python Kaldi Features

python codes to extract MFCC and FBANK speech features for Kaldi

universal

kaldimfcc

Updated 5mo ago

a-n-rose / Build CNN Or LSTM Or CNNLSTM With Speech Features

A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LSTM models with those features.

universal

Updated 3mo ago

Magic-Bubble / SpeechProcessForMachineLearning

用于机器学习的语音特征提取，包含FBank和MFCC等，原理讲解和step by step的实现

universal

Updated 7mo ago

DataXujing / ASR Paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

universal

asrcitrinetconformer+17

Updated 7mo ago

hangtingchen / MFCC

C code to extract mfcc or fbank features from wav files

universal

Updated 2mo ago

YUCHEN005 / RATS Channel A Speech Data

This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log-Mel Fbank features and several raw wavform listening samples.

universal

Updated 5mo ago

adam2go / Mfcc

Calculate MFCC/Fbank feature for wav files

universal

Updated 1mo ago

manyeyes / KaldiNativeFbankSharp

c# wrapper for kaldi-native-fbank，used to extract audio features in speech recognition (ASR) task

universal

Updated 5mo ago

QDPeng / Kaldi NDK Feature

No description available

universal

androidfbankfeature+5

Updated 1y ago

pengzhendong / Online Fbank

No description available

universal

Updated 6mo ago

manyeyes / SpeechFeatures

A C# library for extract audio features in speech recognition (ASR) task, support kaldi fbank

universal

Updated 5mo ago

ZhongshuHou / Personalized Speech Enhancement Demo

This is a demo page of current ongoing personalized speech enhancement (pSE) project. The speaker embedding is generated through the fbank and mfcc features of enrollment speech.

zed

Updated 1y ago