46 skills found · Page 1 of 2
embeddings-benchmark / MtebMTEB: Massive Text Embedding Benchmark
neubig / Lowresource Nlp Bootcamp 2020The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
cisnlp / GlotLID💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
adbar / SimplemmaSimple multilingual lemmatizer for Python, especially useful for speed and efficiency
csebuetnlp / BanglanmtThis repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
ljvmiranda921 / CalamanCyNLP pipelines for Tagalog using spaCy
afrisenti-semeval / Afrisent Semeval 2023AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
KennethEnevoldsen / Scandinavian Embedding BenchmarkA Scandinavian Benchmark for sentence embeddings
231sm / Reasoning In EECode and datasets for the ACL 2021 paper "OntoED: Low-resource Event Detection with Ontology Embedding"
zjunlp / RAP[SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction
zerohd4869 / SACLThe repository for ACL 2023 paper "Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations", and SemEval 2023 paper "UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis"
luciusssss / Mc2 Corpus[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
NLP-Tutorials / AACL IJCNLP2022 KGC TutorialMaterials for AACL-IJCNLP-2022 tutorial: Efficient and Robust Knowledge Graph Construction
luciusssss / ZhuangBench[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly
ElotlMX / Py ElotlPython package for Natural Language Processing (NLP), focused on low-resource languages spoken in Mexico.
kidist-amde / Amharic Ir BenchmarksOfficial codebase for the ACL 2025 Findings paper: Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval.
nicolay-r / Awesome Sentiment Attitude ExtractionA curated list of awesome sentiment analysis studies, in which attitude corresponds to the text position conveyed by Subject towards other Object mentioned in text such as: entities, events, etc.
StefanHeng / ProgGenCode for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
wannaphong / Awesome Lao NLPAwesome Lao Natural Language Processing
csebuetnlp / BanglaparaphraseThis repository contains the code, data, and associated models of the paper titled "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset", accepted in Proceedings of the Asia-Pacific Chapter of the Association for Computational Linguistics: AACL 2022.