334 skills found · Page 1 of 12
Kyubyong / G2pg2p: English Grapheme To Phoneme Conversion
axelspringer / DeepPhonemizerGrapheme to phoneme conversion with deep learning.
open-speech / Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
vlomme / Multi Tacotron Voice CloningPhoneme multilingual(Russian-English) voice cloning based on
GitYCC / G2pWChinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
kakaobrain / G2pmA Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
VinAIResearch / XPhoneBERTXPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
rhasspy / GruutA tokenizer, text cleaner, and phonemizer for many human languages.
dmort27 / PanphonPython package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
morevnaproject-org / Papagayo NgPapagayo is a lip-syncing program designed to help you line up phonemes (mouth shapes) with the actual recorded sound of actors speaking. Papagayo makes it easy to lip sync animated characters by making the process very simple - just type in the words being spoken (or copy/paste them from the animation's script), then drag the words on top of the sound's waveform until they line up with the proper sounds.
itinerarium / Phoneme SynthesisA browser-based tool to convert International Phonetic Alpha (IPA) phonetic notation to speech using the meSpeak.js package
yl4579 / PL BERTPhoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
ASR-project / Multilingual PRPhoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
Kyubyong / G2pCg2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
tiberiu44 / TTS CubeEnd-2-end speech synthesis with recurrent neural networks
ZDisket / TensorVoxDesktop application for neural speech synthesis written in C++
NRC-ILT / G2pGrapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
xinjli / Transphonephoneme tokenizer and grapheme-to-phoneme model for 8k languages
microsoft / PhoneticMatchingA phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to characters.
persephone-tools / PersephoneA tool for automatic phoneme transcription