Yap
๐๏ธ Vibe typing with your voice. Local real-time speech-to-text that auto-types into any app. Powered by MLX on Apple Silicon. No cloud, no cap.
Install / Use
/learn @TorchFun-AI/YapREADME
๐ฌ Demo
๐ฅ Demo video coming soon โ stay tuned!
<!-- Optional: embed a video showing Yap + Claude Code / Cursor workflow --> <!-- https://github.com/user-attachments/assets/agentic-workflow.mp4 -->๐ก Inspired by the agentic coding movement โ like OpenClaw's founder voice-chatting with 10+ agents to build software. Yap is the missing input layer that makes talking to your dev tools feel native.
๐ค Why Yap?
The agentic coding era is here. You're talking to Claude Code, Cursor, Copilot โ but you're still typing every prompt with your fingers.
Your voice is 3x faster than your keyboard. Yap bridges the gap.
- ๐ฃ๏ธ Voice-first workflow โ Talk to your agents, your terminal, your browser. Yap types it out.
- ๐ 100% local โ On-device VAD + ASR via MLX. No cloud. No data leaves your machine.
- ๐ Multilingual โ Speak Chinese, English, Japanese, Korean, and more. Real-time translation built in.
- โจ Smart correction โ LLM-powered spoken โ written style conversion. Your voice, but polished.
โก How It Works
Yap lives as a floating ball on your screen. Toggle input mode, and it listens:
๐๏ธ Voice โโโ ๐ VAD โโโ ๐ง ASR โโโ ๐ฌ LLM โโโ โจ๏ธ Input
Silero MLX Correct Types into
detects on-device & Translate active app
speech transcribe (optional)
Models auto-download from HuggingFace on first launch. Zero config to get started.
โจ Features
| | Feature | Description | |---|---------|-------------| | ๐๏ธ | Multilingual Voice Input | Chinese, English, Japanese, and more โ switch on the fly | | ๐ | Real-time Translation | Speak in one language, type in another | | โ๏ธ | Formal Correction | Spoken โ written style, powered by any LLM | | ๐ฅ๏ธ | Universal Input | Works with any app โ Claude Code, Cursor, VS Code, Terminal, browser, Slack... | | ๐ซง | Floating Ball UI | Always-on-top, draggable, with live waveform visualization | | ๐ | Fully Local | On-device ASR, no cloud dependency, your data stays yours | | ๐ | i18n Menu | ไธญๆ / English interface |
๐ Quick Start
Prerequisites
- macOS with Apple Silicon (M1/M2/M3/M4)
- Node.js 18+
- Python 3.10 โ 3.12
Rust and uv will be installed automatically by the setup script if missing.
Development
git clone https://github.com/TorchFun-AI/Yap.git && cd Yap
# One-click setup (install all dependencies + dev environment)
./setup.sh
# Terminal 1 โ Python AI backend
cd src-backend && uv run python main.py
# Terminal 2 โ Tauri + Vue dev server
make dev
Production Build
# Build .app bundle (compiles backend + Tauri app)
./build.sh
Output in src-tauri/target/release/bundle/.
๐๏ธ Architecture
โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโ
โ Vue 3 UI โโโโโโบโ Tauri Core โโโโโโบโ Python AI โ
โ (Webview) โ IPC โ (Rust) โ WS โ (FastAPI) โ
โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโ โโโโโโโโโโโโโโโโโโโ
โ โ
โผ โผ
โโโโโโโโโโโโโ โโโโโโโโโโโโโ
โ Keyboard โ โ VAD + ASR โ
โ Simulationโ โ + LLM โ
โโโโโโโโโโโโโ โโโโโโโโโโโโโ
| Layer | Stack | |-------|-------| | Frontend | Vue 3 + TypeScript + Ant Design Vue + Pinia | | Core | Tauri 2 (Rust) | | Backend | Python + FastAPI + Silero VAD + MLX Audio |
๐ง LLM Configuration
Yap uses any OpenAI-compatible API for text correction and translation. Configure in Settings:
- API Key
- Base URL (e.g.
https://api.openai.com/v1, or a local Ollama endpoint) - Model name
This is optional โ without it, Yap still does voice-to-text perfectly fine.
๐ License
CC BY-NC 4.0 โ Free to use, modify, and share. Not for commercial use.
Related Skills
node-connect
342.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
85.3kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
342.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
342.5kQQBot ๅฏๅชไฝๆถๅ่ฝๅใไฝฟ็จ <qqmedia> ๆ ็ญพ๏ผ็ณป็ปๆ นๆฎๆไปถๆฉๅฑๅ่ชๅจ่ฏๅซ็ฑปๅ๏ผๅพ็/่ฏญ้ณ/่ง้ข/ๆไปถ๏ผใ
