ML
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learners! This repo contains the ML training, evaluation, and data processing code
Install / Use
/learn @KoelLabs/MLREADME
Koel Labs - Machine Learning
Contains the EDA, training, evaluation, and data processing code for Koel Labs. Evaluation results will be made available via Hugging Face Leaderboards. Cleaned datasets and model weights will also be made available via Hugging Face. We will be releasing a paper as well so stay tuned!
Read about all our repositories here.
Setup
Checkout the guides directory for standalone guides on finetuning, evaluation, dataset processing, and other topics. These can be run independently of the setup for the rest of the codebase, e.g., in a Colab notebook.
See the DEVELOPMENT.md for alternative setup instructions and details.
git clone https://github.com/KoelLabs/ML.git- Install Python 3.10.16
- Duplicate the
.env.examplefile and rename it to.env. Fill in the necessary environment variables. - Run the commands in './scripts/install.sh', e.g., with
. ./scripts/install.sh.
Contributing
Check out the CONTRIBUTING.md for specific guidelines on contributing to this repository.
License
The code in this repository is licensed under the GNU Affero General Public License.
With the exception of a few models and Huggingface spaces released during the builders program under the Mozilla Public License, all Huggingface models and code will be released under the GNU Affero General Public License.
We retain all rights to the Koel Labs brand, logos, blog posts and website content.
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
workshop-rules
Materials used to teach the summer camp <Data Science for Kids>
last30days-skill
19.8kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
000-main-rules
Project Context - Name: Interactive Developer Portfolio - Stack: Next.js (App Router), TypeScript, React, Tailwind CSS, Three.js - Architecture: Component-driven UI with a strict separation of conce
