SkillAgentSearch skills...

Kgdata

Library to process dumps of knowledge graphs (Wikipedia, DBpedia, Wikidata)

Install / Use

/learn @binh-vu/Kgdata
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

kgdata PyPI Documentation

KGData is a library to process dumps of Wikipedia, Wikidata. What it can do:

  • Clean up the dumps to ensure the data is consistent (resolve redirect, remove dangling references)
  • Create embedded key-value databases to access entities from the dumps.
  • Extract Wikidata ontology.
  • Extract Wikipedia tables and convert the hyperlinks to Wikidata entities.
  • Create Pyserini indices to search Wikidata’s entities.
  • and more

For a full documentation, please see the website.

Installation

From PyPI (using pre-built binaries):

pip install kgdata[spark]   # omit spark to manually specify its version if your cluster has different version

Related Skills

View on GitHub
GitHub Stars9
CategoryDevelopment
Updated2mo ago
Forks1

Languages

Jupyter Notebook

Security Score

90/100

Audited on Feb 5, 2026

No findings