Manibox
ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Install / Use
/learn @thkkk/ManiboxREADME
ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
- 📝Paper
- 🌍Project Page: Spatial Generalization and its Scaling Laws~
- 🛢️Dataset
- 👏Contributors: Hengkai Tan, Xuezhou Xu, Chengyang Ying, Xinyi Mao
The code of the simulator part (including teacher policy and data generation) could be seen in sim branch.
Installation Instructions
conda deactivate
conda create -n manibox python=3.9
conda activate manibox
pip install "setuptools<60" numpy==1.22.4
pip install -e .
# for student inference code, you should install these in `isaac lab conda env`:
pip install einops
pip install torch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1
Dataset
The full space version dataset is in 🛢️Download link. It has a total of 38,150 trajectories, which you should rename to integration.pkl in any directory.
Student Training Instructions
The train.py will will read integration.pkl in the --dataset directory as a dataset. integration.pkl is a dict containing three keys, image_data, qpos_data, action_data, with shape (num_episodes, episode_len , dim).
# BBOX RNN
python ManiBox/train.py --policy_class RNN --batch_size 128 --dataset ../ --num_episodes 38000 --loss_function l1 --rnn_layers 3 --rnn_hidden_dim 512 --actor_hidden_dim 512 --num_epochs 50 --lr 2e-3 --gradient_accumulation_steps 1 > train.log 2>&1
Deployment Instructions
python ManiBox/inference_real_world.py --ckpt_dir /PATH/TO/ManiBox/ckpt/2024-xx-xx_xx-xx-xxRNN --policy_class RNN --ckpt_name policy_best.ckpt
Other Instructions
Teacher Policy Training Instructions
# PPO Training
python source/standalone/workflows/rsl_rl/train.py --task Isaac-Lift-Cube-MobileAloha-v0 --num_envs 4096 --headless
Data Collection Instructions
# Collect data in simulator
python source/standalone/workflows/rsl_rl/play_collect_data.py --task Isaac-Lift-Cube-MobileAloha-Play-v0 \
--num_envs 40 --load_run 2024-09-12_17-42-24 --headless --enable_cameras
Simulator Inference Instruction
# Student policy inference in simulator
python source/standalone/workflows/rsl_rl/student_inference_orbit_multi_envs.py --task Isaac-Lift-Cube-MobileAloha-Play-v0 \
--ckpt_dir "\PATH\TO\Your\CKPT" --policy_class "RNN" --ckpt_name policy_best.ckpt --nheads 48 --num_train_step 38000 --seed 0
Acknowledgement
BibTeX
If you find our work useful for your project, please consider citing the following paper.
@article{tan2024manibox,
title={ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation},
author={Tan, Hengkai and Xu, Xuezhou and Ying, Chengyang and Mao, Xinyi and Liu, Songming and Zhang, Xingxing and Su, Hang and Zhu, Jun},
journal={arXiv preprint arXiv:2411.01850},
year={2024}
}
Related Skills
node-connect
344.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
96.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
344.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
344.1kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
