SkillAgentSearch skills...

Sherpa

Speech-to-text server framework with next-gen Kaldi

Install / Use

/learn @k2-fsa/Sherpa
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

sherpa

sherpa is an open-source speech-text-text inference framework using PyTorch, focusing exclusively on end-to-end (E2E) models, namely transducer- and CTC-based models. It provides both C++ and Python APIs.

This project focuses on deployment, i.e., using pre-trained models to transcribe speech. If you are interested in how to train or fine-tune your own models, please refer to icefall.

We also have other similar projects that don't depend on PyTorch:

sherpa-onnx and sherpa-ncnn also support iOS, Android and embedded systems.

Installation and Usage

Please refer to the documentation at https://k2-fsa.github.io/sherpa/

Try it in your browser

Try sherpa from within your browser without installing anything: https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition

View on GitHub
GitHub Stars910
CategoryDevelopment
Updated21m ago
Forks149

Languages

C++

Security Score

100/100

Audited on Apr 7, 2026

No findings