Knowledge2Data

[TASLP 2025] Spatial Knowledge Graph-Guided Synthesis for Multimodal LLMs

Generate Convert Improve

Install / Use

/learn @zjunlp/Knowledge2Data

About this skill

Quality Score

0/100

README

<div align="center"> <h1 align="center"> 👉 Knowledge2Data 👈 </h1> <b>Spatial Knowledge Graph-Guided Multimodal Synthesis</b>

<div align="center"> <img src="figs/figure1.gif" width="90%"> </div> <p align="center"> <a href="https://github.com/zjunlp/Knowledge2Data">Project</a> • <a href=https://arxiv.org/abs/2505.22633>Paper</a> • <a href="https://huggingface.co/datasets/zjunlp/Knowledge2Data">HuggingFace</a> • <a href="#overview">Overview</a> • <a href="#quickstart">Quickstart</a> • <a href="#citation">Citation</a> </p> </div>

<a href="#news">What's New</a> •
<a href="#overview">Overview</a> •
<a href="#quickstart">Quickstart</a> •
<a href="#citation">Citation</a>

🔔News

2025-11-01, Our paper has been ACCEPTED for publication as a REGULAR paper in the IEEE TASLP(Transactions on Audio, Speech and Language Processing).
2025-02-28, We release the paper.

🌟Overview

⏩Quickstart

Data

Get training data and test data from HuggingFace: https://huggingface.co/datasets/zjunlp/Knowledge2Data

Installation

git clone https://github.com/zjunlp/Knowledge2Data
cd Knowledge2Data
conda create -n skg python==3.9
conda activate skg
pip install -r requirements.txt

Download the models

Download the following models from HuggingFace

| 🎯 Model Name | 🤗 HuggingFace | |-------------------------------|---------------------------------------------------------------------------| | Diffusers-generation-text-box | gligen/diffusers-generation-text-box | | Sam-vit-base | stabilityai/stable-diffusion-xl-refiner-1.0 | | Stable-diffusion-xl-refiner | facebook/sam-vit-base |

Export the environment variables.

cd src
export OPENAI_API_KEY="YOUR_API_KEY"
export SKG_HF_MODELS="LOCAL_HUGGINGFACE_MODELS_DIR"

Generate Spatial KG and multimodal synthetic data.

Execute script to generate Spatial KG.

sh run_skg.sh

You can also customize objects and their spatial relationships to form Spatial KG. Save the file format as a JSON file similar to "src/data/skg_demo.json".

Execute script to multimodal synthetic data.

sh run_data.sh

For custom data, only the input file parameters "--input_file" need to be modified.

You can find generated data in "src/data" and images in "src/img_generations" as default. If you want to generate more data, you can modify the parameters including "--num_scenes" (generate_scenes.py) and "--repeats" (generate_images.py).

🌻Acknowledgement

This project is based on open-source projects including LLM-groundedDiffusion. Thanks for their great contributions!

🚩Citation

Please cite the following paper if you use this project in your work.

@misc{xue2025spatialknowledgegraphguidedmultimodal,
      title={Spatial Knowledge Graph-Guided Multimodal Synthesis}, 
      author={Yida Xue and Zhen Bi and Jinnan Yang and Jungang Lou and Huajun Chen and Ningyu Zhang},
      year={2025},
      eprint={2505.22633},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.22633}, 
}

Related Skills

node-connect

354.5k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

112.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

354.5k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

354.5k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

zjunlp

View profile

View on GitHub

GitHub Stars8

CategoryDevelopment

Updated1mo ago

Forks0

zjunlp/Knowledge2Data

Languages

Python

Security Score

90/100

Audited on Mar 7, 2026

No findings

Knowledge2Data

Install / Use

README

Table of Contents

🔔News

🌟Overview

⏩Quickstart

Data

Installation

Download the models

Download the following models from HuggingFace

Export the environment variables.

Generate Spatial KG and multimodal synthetic data.

Execute script to generate Spatial KG.

Execute script to multimodal synthetic data.

🌻Acknowledgement

🚩Citation

Related Skills