GPTKB
Code for KB construction from LLMs - ACL 2025 paper: "Enabling LLM Knowledge Analysis via Extensive Materialization""
Install / Use
/learn @Knowledge-aware-AI/GPTKBREADME

Welcome to GPTKB
Overview
GPTKB is a large general-domain knowledge base (KB) entirely from a large language model (LLM). It demonstrates the feasibility of large-scale KB construction from LLMs, while highlighting specific challenges arising around entity recognition, entity and property canonicalization, and taxonomy construction.
Based on GPT-4.1, GPTKB contains 100 million triples for more than 6.1 million entities, at a cost 10x less than previous KBC projects. GPTKB is a landmark for two fields:
- For NLP, for the first time, it provides constructive insights into the knowledge (or beliefs) of LLMs.
- For the Semantic Web, it shows novel ways forward for the long-standing challenge of general-domain KB construction.
This repository contains the code for KB construction from LLMs to materialize entity-centric parametric knowledge of LLMs.
Check out our project page for more details and demonstrations: https://gptkb.org/.
Getting Started
Install
Please use Python 3.11+. Install the required packages by running:
pip install -r requirements.txt
Don't forget to have your OpenAI API key in the environment variable
OPENAI_API_KEY.
Run
- See the knowledge elicitation folder for instructions on how to run the knowledge elicitation phase.
- See the knowledge consolidation folder for instructions on how to run the knowledge consolidation phase.
Citation
If you use this work please cite our paper:
@InProceedings{GPTKB,
title={Enabling LLM Knowledge Analysis via Extensive Materialization},
author={Hu, Yujia and Nguyen, Tuan-Phong and Ghosh, Shrestha and Razniewski, Simon},
year={2025},
booktitle={ACL},
}
Related Skills
node-connect
346.8kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
107.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
346.8kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
346.8kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
