DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Install / Use
/learn @mozilla/DeepSpeechREADME
Status
This project is now discontinued.
Project DeepSpeech
.. image:: https://readthedocs.org/projects/deepspeech/badge/?version=latest :target: https://deepspeech.readthedocs.io/?badge=latest :alt: Documentation
.. image:: https://github.com/mozilla/DeepSpeech/actions/workflows/macOS-amd64.yml/badge.svg :target: https://github.com/mozilla/DeepSpeech/actions/workflows/macOS-amd64.yml :alt: macOS builds
.. image:: https://github.com/mozilla/DeepSpeech/actions/workflows/lint.yml/badge.svg :target: https://github.com/mozilla/DeepSpeech/actions/workflows/lint.yml :alt: Linters
.. image:: https://github.com/mozilla/DeepSpeech/actions/workflows/docker.yml/badge.svg :target: https://github.com/mozilla/DeepSpeech/actions/workflows/docker.yml :alt: Docker Images
DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper <https://arxiv.org/abs/1412.5567>. Project DeepSpeech uses Google's TensorFlow <https://www.tensorflow.org/> to make the implementation easier.
Documentation for installation, usage, and training models are available on deepspeech.readthedocs.io <https://deepspeech.readthedocs.io/?badge=latest>_.
For the latest release, including pre-trained models and checkpoints, see the latest release on GitHub <https://github.com/mozilla/DeepSpeech/releases/latest>_.
For contribution guidelines, see CONTRIBUTING.rst <CONTRIBUTING.rst>_.
For contact and support information, see SUPPORT.rst <SUPPORT.rst>_.
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
API
A learning and reflection platform designed to cultivate clarity, resilience, and antifragile thinking in an uncertain world.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
