SolidGeo
SolidGeo: Measuring Multimodal Spatial Math Reasoning in Solid Geometry
Install / Use
/learn @HarryYancy/SolidGeoREADME
SolidGeo: Measuring Multimodal Spatial Math Reasoning in Solid Geometry
[🌐 Homepage] [💻 Github] [🤗 Huggingface Dataset]
[📊 Leaderboard ] [🔍 Visualization] [📖 Paper]
Dataset Description
SolidGeo is the first large-scale benchmark specifically designed to evaluate the performance of MLLMs on mathematical reasoning tasks in solid geometry. SolidGeo consists of 3,113 real-world K–12 and competition-level problems, each paired with visual context and annotated with 3 difficulty levels and 8 fine-grained solid geometry categories. Our benchmark covers a wide range of 3D reasoning subjects such as projection, unfolding, spatial measurement, and spatial vector, offering a rigorous testbed for assessing solid geometry.Through extensive experiments, we observe that MLLMs encounter substantial challenges in solid geometry math tasks, with a considerable performance gap relative to human capabilities on SolidGeo .We hope SolidGeo serves as a catalyst for advancing MLLMs toward deeper geometric reasoning and spatial intelligence.

Performance of six MLLMs on SolidGeo benchmark across 8 solid geometry subjects (left), and trade-off between accuracy and average generated token length across 25 MLLMs (right).
Dataset Examples
Some examples of our datasets.

Click to expand/collapse more examples
<details> <summary>Measurement of Solid Geometric Forms</summary><p align="center"> <img src="assets/Measurement of Solid Geometric Forms.jpg" width="50%"> <br> </p></details> <details> <summary>Solid Shape Identification</summary><p align="center"> <img src="assets/Solid Shape Identification.jpg" width="50%"> <br> </p></details> <details> <summary>Spatial Metric Relations</summary><p align="center"> <img src="assets/Spatial Metric Relations.jpg" width="50%"> <br> </p></details> <details> <summary>Multi-view Projection</summary><p align="center"> <img src="assets/Multi-view Projection.jpg" width="50%"> <br> </p></details> <details> <summary>Planar Unfolding and Configuration</summary><p align="center"> <img src="assets/Planar Unfolding and Configuration.jpg" width="50%"> <br> </p></details> <details> <summary>Composite Solid Structures</summary><p align="center"> <img src="assets/Composite Solid Structures.jpg" width="50%"> <br> </p></details> <details> <summary>3D Coordinate and Vector Reasoning</summary><p align="center"> <img src="assets/3D Coordinate and Vector Reasoning.jpg" width="50%"> <br> </p></details> <details> <summary>Solid Geometry Modeling</summary><p align="center"> <img src="assets/Solid Geometry Modeling.jpg" width="50%"> <br> </p></details>Data Format
{
"question": [string] The question text.
"choices": [list of string] Choice options for some questions.
"image": [list of string] A list of image file paths used in the question text.
"answer": [string] The final answer to the question, provided in LaTeX format for consistent mathematical formatting.
"solution": [list of string] A Optional list of solution steps.
"answer_type": [string] The format of the answer:"single_step", "multi_step", "choice"
"problem_type": [list of strings] A classification of the type of math problem :"Measurement of Solid Geometric Forms","Solid Shape Identification","Spatial Metric Relations","Multi-view Projection","Planar Unfolding and Configuration","Composite Solid Structures","3D Coordinate and Vector Reasoning","Solid Geometry Modeling".
"complexity_level": [string] An indicator of the difficulty level:"Level 1","Level 2","Level 3"
"source": [string] The origin of the question.
"qa_id": [integer] A unique identifier for the question-answer pair.
}
Citation
If you find this work useful, welcome to cite/star us.
@misc{wang2025solidgeomeasuringmultimodalspatial,
title={SOLIDGEO: Measuring Multimodal Spatial Math Reasoning in Solid Geometry},
author={Peijie Wang and Chao Yang and Zhong-Zhi Li and Fei Yin and Dekang Ran and Mi Tian and Zhilong Ji and Jinfeng Bai and Cheng-Lin Liu},
year={2025},
eprint={2505.21177},
archivePrefix={arXiv},
primaryClass={cs.CG},
url={https://arxiv.org/abs/2505.21177},
}
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
isf-agent
a repo for an agent that helps researchers apply for isf funding
