GraphDoc
Graph-based Document Structure Analysis
Install / Use
/learn @yufanchen96/GraphDocREADME
🏡 Project Homepage
This is the official repository for our ICLR 2025 paper Graph-based Document Structure Analysis. For more results and details, please visit our project homepage.
🔎 Introduction
we construct a relation graph-based document structure analysis dataset (GraphDoc), enabling training models to complete tasks like reading order, hierarchical structure analysis, and complex inter-element relation inference. Furthermore, a document relation graph generator (DRGG) is proposed to address the tasks.
<p align="center"> <img src="assets/GraphDoc.png" width="480" /> </p>📝 Catalog
- [ ] Graph-based Document Structure Dataset
- [ ] DRGG Model Checkpoints
- [ ] DRGG Model Training Code
- [ ] DRGG Model Evaluation Code
📦 Code and Implementations
Code and Implementation details will come soon!
🌳 Citation
If you find this code useful for your research, please consider citing:
@inproceedings{chen2025graphdoc,
title={Graph-based Document Structure Analysis},
author={Yufan Chen and Ruiping Liu and Junwei Zheng and Di Wen and Kunyu Peng and Jiaming Zhang and Rainer Stiefelhagen},
booktitle={ICLR},
year={2025}
}
Related Skills
node-connect
351.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Apr 2, 2026
