14 skills found
Tatoeba / Tatoeba2Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
RichardLitt / Low Resource LanguagesResources for conservation, development, and documentation of low resource (human) languages.
livingtongues / Living DictionariesSpeeding the availability of language resources for endangered languages. Tools such as this have the power to shift how we think about endangered languages. Rather than perceiving them as being antiquated, difficult to learn and on the brink of vanishing, we see them as modern, easily accessible for learning online in text and audio formats.
CoEDL / Vad Sli AsrA pipeline to isolate and transcribe one language in mixed-language speech
RichardLitt / ThesisMy thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University
ankitjh4 / Endangered Indian LanguagesInteractive dashboard for 141+ endangered and vulnerable languages of India — with 4700+ discovered digital resources
Halvani / AlphabeticA Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals as well as Latin script codes
ReML-AI / English Pivoted Cot[AAAI-26] Reasoning Transfer for an Extremely Low-Resource and Endangered Language: Bridging Languages Through Sample-Efficient Language Understanding
openimpactai / VoiceAccessVoiceAccess is an open-source project dedicated to bringing automatic speech recognition (ASR) to low-resource and endangered languages. By leveraging transfer learning, data augmentation, and community-driven data collection, we aim to democratize speech technology for linguistic communities.
TuluAI / TranslatorA translator for the Tulu language and other endangered as well languages without translator support.
iafarhan / Machine Translation For Endangered Language RevitalizationCherokee is a highly endangered Native American language spoken by the Cherokee people. We provide a way to mitigate this issue by providing a Neural Machine Translation System.
mokha / VerddVeʹrdd is an open-source dictionary editing framework with the focus on low-resourced and endangered languages. The framework is mainly built to facilitate collecting, importing, editing and exporting dictionaries while allowing the involvement of the native speakers to contribute easily to the preservation of the language and construction of the dictionary.
elderonline / ELDEREndangered Language Data Electronic Repository: A web-based ontologically-compliant collaborative linguistic data cataloguing tool.
karimongitb / KumzariFirst comprehensive dataset and translation model for the Kumzari language