SkillAgentSearch skills...

SimChart9K

The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.

Install / Use

/learn @InternScience/SimChart9K
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

arXiv GitHub issues PRs Welcome

SimChart9K

<div align=center> <img src="https://github.com/Uni-Modal/SimChart9K/blob/main/images/SC.png" height="100"> </div> <div align="center"> <h1>SimChart9K: An LLMs-based Simulatied Visual Chart Understanding Benchmark<br></h1> </div>

We perform data augmentation for chart perception and reasoning by leveraging an LLMs-based self-inspection data production scheme, producing the SimChart9K dataset, where the simulated dataset consists of 9,536 chart images and associated data annotations in CSV format. Besides, we observe that StructChart continuously improves the chart perception performance as more simulated charts are used for pre-training.

SimChart9K Dataset Download from google drive

Downloading the official SimChart9K dataset from google drive

SimChart9K Dataset Download from Opendatalab

a. Register an account from OpenXLab website as follows.

https://openxlab.org.cn/home

b. Install the dependent libraries as follows:

  • Install the openxlab dependent libraries.
      pip install openxlab
    
  • Obtain the Access Key and Secret Key on the OpenXLab website by clicking the button of Account Security
  • Login the OpenXLab using the Access Key and Secret Key
      openxlab login
    

c. Download the SimChart9K dataset by performing the following command:

openxlab dataset get --dataset-repo  Lonepic/SimChart9K

t-SNE comparisons with Real Chart Datasets

<p align="center"> <img src="images/t-SNE_a.png" width="62%"> <div>Feature Distribution using t-SNE of Real Datasets.</div> </p> <p align="center"> <img src="images/t-SNE_b.png" width="62%"> <div>Feature Distribution using t-SNE of both Real Datasets and SimChart9K.</div> </p>

Visualization Exapmles

<p align="center"> <img src="images/multi_task_1.png" width="85%"> <div>Visualization results using the proposed StructChart on different chart-related reasoning tasks including Question Answering (QA), Summarization, and Redrawing.</div> </p> <p align="center"> <img src="images/multi_task_2.png" width="85%"> <div>Visualization results using the proposed StructChart on different chart-related reasoning tasks including Question Answering (QA), Summarization, and Redrawing.</div> </p>

Citation

Please consider citing our work if this dataset is helpful for your research:

@article{xia2023structchart,
  title={StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding},
  author={Xia, Renqiu and Zhang, Bo and Peng, Haoyang and Ye, Hancheng and Yan, Xiangchao and Ye, Peng and Shi, Botian and Yan, Junchi and Qiao, Yu},
  journal={arXiv preprint arXiv:2309.11268},
  year={2023}
}
View on GitHub
GitHub Stars26
CategoryDevelopment
Updated3mo ago
Forks1

Security Score

72/100

Audited on Dec 6, 2025

No findings