167 skills found · Page 1 of 6
Embedding / Chinese Word Vectors100+ Chinese Word Vectors 上百种预训练中文词向量
lancopku / Pkuseg Pythonpkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
baidu / Lac百度NLP:分词,词性标注,命名实体识别,词重要性
ownthink / JiaguJiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
wolfgarbe / SymSpellSymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
hankcs / Pyhanlp中文分词
didi / ChineseNLPDatasets, SOTA results of every fields of Chinese NLP
ckiplab / CkiptaggerCKIP Neural Chinese Word Segmentation, POS Tagging, and NER
fukuball / Jieba Php"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
lionsoul2014 / JcsegJcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
messense / Jieba RsThe Jieba Chinese Word Segmentation Implemented in Rust
mammothb / SymspellpyPython port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
lionsoul2014 / FrisoHigh performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Moonshile / ChineseWordSegmentationChinese word segmentation algorithm without corpus(无需语料库的中文分词)
hankcs / Multi Criteria CwsSimple Solution for Multi-Criteria Chinese Word Segmentation
monpa-team / MonpaMONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
guokr / GksegYet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm
Kyubyong / G2pCg2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
Cyberbolt / CemotionA Chinese NLP library based on BERT for sentiment analysis and general-purpose Chinese word segmentation. | 基于 BERT 的中文 NLP 库,用于中文情感倾向分析、通用领域中文分词。
hemingkx / WordSegA PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .