Kgdata

Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)

Generate Convert Improve

Install / Use

/learn @binh-vu/Kgdata

About this skill

Quality Score

0/100

README

kgdata

KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:

Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
Create embedded key-value databases to access entities from the dumps.
Extract Wikidata ontology.
Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
Create Pyserini indices to search Wikidata’s entities.
and more

For a full documentation, please see the website.

Installation

From PyPI (using pre-built binaries):

pip install kgdata[spark]   # omit spark to manually specify its version if your cluster has different version

Related Skills

node-connect

351.4k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.7k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.4k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.4k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。