Results for "speech-toolkit"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

153 skills found · Page 1 of 6

coqui-ai / TTS

45.0k

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

universal

deep-learningglow-ttshifigan+16

Updated 3h ago

modelscope / FunASR

15.5k

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

universal

audio-visual-speech-recognitionconformerdfsmn+12

Updated 1h ago

PaddlePaddle / PaddleSpeech

12.6k

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

universal

asrcode-switchconformer+17

Updated 10h ago

speechbrain / Speechbrain

11.4k

A PyTorch-based Speech Toolkit

universal

asraudioaudio-processing+17

Updated 10h ago

espnet / Espnet

9.8k

End-to-End Speech Processing Toolkit

universal

chainerdeep-learningend-to-end+13

Updated 2h ago

open-mmlab / Amphion

9.7k

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

universal

audio-generationaudio-synthesisaudioldm+14

Updated 22h ago

flashlight / Wav2letter

6.4k

Facebook AI Research's Automatic Speech Recognition Toolkit

universal

cppdeep-learningend-to-end+2

Updated 3d ago

wenet-e2e / Wenet

5.1k

Production First and Production Ready End-to-End Speech Recognition Toolkit

universal

asrautomatic-speech-recognitionconformer+6

Updated 1d ago

modelscope / ClearerVoice Studio

4.0k

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

universal

audiobandwidth-extensiondeep-learning+8

Updated 5h ago

coqui-ai / STT

2.6k

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

universal

asrautomatic-speech-recognitiondeep-learning+7

Updated 5d ago

s3prl / S3prl

2.5k

Self-Supervised Speech Pre-training and Representation Learning Toolkit

universal

apccpcdata2vec+17

Updated 3d ago

mravanelli / Pytorch Kaldi

2.4k

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

universal

asrdeep-learningdeep-neural-networks+14

Updated 5d ago

NVIDIA / OpenSeq2Seq

1.6k

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

universal

deep-learningfloat16language-model+11

Updated 3d ago

alumae / Kaldi Gstreamer Server

1.1k

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

universal

speech-recognition

Updated 4d ago

freewym / Espresso

940

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

universal

asrend-to-endfairseq+4

Updated 1mo ago

ina-foss / InaSpeechSegmenter

879

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

universal

audio-analysisfemalegender+17

Updated 6h ago