10 skills found
rsennrich / Subword NmtUnsupervised Word Segmentation for Neural Machine Translation and Text Generation
aalto-speech / MorfessorMorfessor is a tool for unsupervised and semi-supervised morphological segmentation
finalfusion / FinalfrontierContext-sensitive word embeddings with subwords. In Rust.
mast-group / OpenVocabCodeNLMContains the code for our ICSE 2020 paper: Big Code != Big Vocabulary: Open-Vocabulary Language Models for Source Code and for its earlier pre-print: Maybe Deep Neural Networks are the Best Choice for Modeling Source Code (https://arxiv.org/abs/1903.05734). This is the first open vocabulary language model for code that uses the byte pair encoding algorithm (BPE) to learn a segmentation of code tokens into subword units.
claravania / Subword Lstm LmLSTM Language Model with Subword Units Input Representations
aalto-speech / Subword KaldiProperly handle position-dependent phones in a subword lexicon FST
aalto-speech / FlatcatMorfessor FlatCat
Waino / Morfessor EmpruneMorfessor EM+Prune
finalfusion / Finalfusion PythonFinalfusion embeddings in Python
gozdesahin / Subword Semantic Role LabelingSemantic role labeling with subwords (character, character-ngram and morphology)