Dsnote
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Install / Use
/learn @mkiol/DsnoteREADME
Speech Note
Linux desktop and Sailfish OS app for note taking, reading and translating with offline Speech to Text, Text to Speech and Machine Translation
<a href='https://flathub.org/apps/net.mkiol.SpeechNote'><img width='240' alt='Download on Flathub' src='https://dl.flathub.org/assets/badges/flathub-badge-en.png'/></a>
Contents of this README
- Description
- Languages and Models
- How to install
- Flatpak packages
- Beta version
- Extra features
- Building from sources
- How to enable a custom model
- Contributing to Speech Note
- How to support
- Reviews and demos
- License
Description
Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet.
Speech Note uses many different processing engines to do its job. Currently these are used:
- Speech to Text (STT)
- Text to Speech (TTS)
- Machine Translation (MT)
Languages and Models
Speech Note installation package does not include checkpoint files for supported models, but instead they can be easily downloaded using the graphical model browser built into the application.
Following languages and models are supported and enable for download:
| Lang ID | Name | DeepSpeech (STT) | Whisper (STT) | Vosk (STT) | April-ASR (STT) | Piper (TTS) | RHVoice (TTS) | espeak (TTS) | MBROLA (TTS) | Coqui (TTS) | Mimic3 (TTS) | WhisperSpeech (TTS) | Kokoro (TTS) | F5-TTS | Parler-TTS | S.A.M. (TTS) | Bergamot (MT) | | ----------- | ------------- | -------------------- | ----------------- | -------------- | ------------------- | --------------- | ----------------- | ---------------- | ---------------- | --------------- | ---------------- | ----------------------- | ---------------- | ---------- | -------------- | ---------------- | ----------------- | | af | Afrikaans | | ● | | | | | ● | | | ● | | | | | | | | am | Amharic | ● (e) | ● | | | | | ● | | ● | | | | | | | | | ar | Arabic | | ● | ● | | ● | | ● | ● | ● | | | | | | | ● | | az | Azerbaijani | | ● | | | | | | | | | | | | | | ● | | be | Belarusian | | ● | | | | | | | | | | | | | | ● | | bg | Bulgarian | | ● | | | | | ● | | ● | | | | | | | | | bn | Bengali | | ● | | | | | ● | | ● | ● | | | | | | | | bs | Bosnian | | ● | | | | | ● | | | | | | | | | ● | | ca | Catalan | ● | ● | ● | | ● | | ● | | ● | | | | | | | ● | | cs | Czech | ● | ● | ● | | ● | ● | ● | ● | ● | | | | | | | ● | | cy | Welsh | | | | | ● | | | | | | | | | | | | | da | Danish | | ● | | | ● | | ● | | ● | | | | | | | ● | | de | German | ● | ● | ● | | ● | | ● | | ● | ● | ● | | | ●(e) | | ● | | el | Greek | ● (e) | ● | | | ● | | ● | | ● | ● | | | | | | ● | | en | English | ● | ● | ● | ● | ● | ● | ● | | ● | ● | ● | ● | ● | ● | ● | ● | | eo | Esperanto | | | ● | | | ● | ● | | | | | | | | | | | es | Spanish | ● | ● | ● | | ● | ● | ● | | ● | ● | ● | ● | | ●(e) | | ● | | et | Estonian | ● (e) | ● | | | | | ● | ● | ● | | | | | | | ● | | eu | Basque | ● (e) | ● | | | | | ● | | ● | | | | | | | | | fa | Persian | ● | ● | ● | | ● | | ● | ● | ● | ● |
Related Skills
node-connect
336.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.0kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
336.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.0kCommit, push, and open a PR
