Craft
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
Install / Use
/learn @hucvl/CraftREADME
CRAFT
This repository contains the codes used to generate the CRAFT dataset as described in the paper:
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions <br>Tayfun Ates*, M. Samil Atesoglu*, Cagatay Yigit*, Ilker Kesen, Mert Kobas, Erkut Erdem, Aykut Erdem, Tilbe Goksun, Deniz Yuret <br>NeurIPS 2021 Submission, Datasets and Benchmarks Track
OpenReview Preprint, Project Website
The dataset is available on Zenodo.
Below is a sample from the dataset, a video of the simulation and some of the questions asked:

Descriptive <br> Q: "What is the shape of the first object that collided with the tiny green circle? A: "Triangle" <br> Q: "After hitting the floor, does the small green triangle collide with other objects?" A: "False" <br> Counterfactual <br> Q: "If the tiny green circle is removed, will the small green triangle fall to the ground?" A: "False" <br> Q: "If any of the other objects are removed, will the tiny green circle end up in the basket?" A: "True" <br> Cause <br> Q: "There is a tiny green circle, does it stimulate the tiny green triangle to fall to the floor?" A: "True" <br> Q: "Does the tiny green triangle lead to the small red circle ending up in the bucket?" A: "True" <br> Enable <br> Q: "There is a small red circle, does it enable the tiny green circle to hit the ground?" A: "True" <br> Q: "What is the number of objects that the small red circle allows to hit the floor?" A: "1" <br> Prevent <br> Q: "There is a tiny green triangle, does it hinder the tiny green circle from going into the container?" A: "True" <br> Q: "How many objects are prevented by the tiny green triangle from falling into the basket?" A: "1"
About the Codebase
Simulator
data_generation folder contains the simulator that we used to render 2D physics simulations which output simulation data used to generate questions. You can go to here for detailed explanations.
Question Generation
question_generation folder contains the scripts used to generate the dataset by running simulations and generating questions. You can go to here for detailed explanations.
Models
models folder contains the training and testing scripts used to train the methods evaluated in our paper. You can go to here for detailed explanation.
Related Skills
node-connect
347.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
107.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
347.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
347.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
