WordSimilarityAnalogyData
用于词向量的相似性任务、类比任务的数据。包含了中文新闻语料分类的实验数据。(Data for similarity tasks and analogy tasks of word vectors. The experimental data of Chinese news corpus classification are included.)
Install / Use
/learn @CallMeJiaGu/WordSimilarityAnalogyDataREADME
WordSimilarityAnalogyData 用于验证词向量效果好坏的数据集。
词的相似性任务-Word Similarity
常用的英文数据集:WordSim-353 、MEN、SCWS
WordSim-353: http://alfonseca.org/eng/research/wordsim353.html、 http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/
MEN: https://staff.fnwi.uva.nl/e.bruni/MEN
SCWS:http://ai.stanford.edu/~ehhuang/
常用的中文数据集:wordsim-240、wordsim-297
在该仓库能找到(wordsim-240、wordsim-297)
词的类比任务-Word Analogy
常用的中文数据集:Chen 2015年构造的评测文件
在本仓库能找到。(Chen 2015年构造的评测文件)
Related Skills
node-connect
349.7kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.7kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.7kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Dec 9, 2025
