RoboManipBaselines

A software framework integrating various imitation learning methods and benchmark environments for robotic manipulation

Generate Convert Improve

Install / Use

/learn @isri-aist/RoboManipBaselines

About this skill

Quality Score

0/100

README

🤖 RoboManipBaselines

<a href="https://isri-aist.github.io/RoboManipBaselines-ProjectPage/">Project Page</a> | <a href="https://arxiv.org/abs/2509.17057">arXiv</a>

A software framework integrating various imitation learning methods and benchmark environments for robotic manipulation.
Provides easy-to-use baselines for policy training, evaluation, and deployment.

https://github.com/user-attachments/assets/1ad3dcbe-bb4c-4711-96c3-5e196e9d067b

🚀 Quick Start

Start collecting data in the MuJoCo simulation, train your model, and rollout the ACT policy in just a few steps!
📄 See the Quick Start Guide.

⚙️ Installation

Follow our step-by-step Installation Guide to get set up smoothly.

🧠 Policies

We provide several powerful policy architectures for manipulation tasks:

🔹 MLP: Simple feedforward policy
🔹 SARNN: Recurrent policy for sequential data
🔹 ACT: Transformer-based action chunking policy
🔹 MT-ACT: Multi-task Transformer-based imitation policy
🔹 Diffusion Policy: Diffusion-based imitation policy
🔹 3D Diffusion Policy: Diffusion-based policy with 3D point cloud input
🔹 Flow Policy: Flow-matching-based policy with 3D point cloud input
🔹 ManiFlow Policy: Flow-matching and consistency-based policy with 2D/3D vision
🔹 pi0: Flow-matching-based policy with a vision-language model backbone (PaliGemma)
🔹 GR00T: Diffusion Transformer-based policy with a vision-language model backbone (Eagle-2)

📦 Data

📂 Dataset List: Pre-collected expert demonstration datasets
🧠 Learned Parameters: Trained model checkpoints and configs
📄 Data Format: Description of the custom RMB data format used in RoboManipBaselines
🪄 Point Cloud Preprocessing: Data preprocessing for 3D point cloud policies

🎮 Teleoperation

Use your own teleop interface to collect expert data.
See Teleop Tools for more info.

🎮 Multiple SpaceMouse: Setup multiple SpaceMouse for high-degree-of-freedom robots

🌍 Environments

Explore diverse manipulation environments:

📚 Environment Catalog: Overview of all task environments
🔧 Env Setup: Installation guides per environment
✏️ How to Add a New Environment: Guide for adding a custom environment
🔅️ MuJoCo Tactile Sensor: Guide for using tactile sensors in MuJoCo environments

🧰 Miscellaneous

Check out Misc Scripts for standalone tools and utilities.

📊 Evaluation Results

See Benchmarked Performance across environments and policies.

🤝 Contributing

We welcome contributions!
Check out the Contribution Guide to get started.

📄 License

This repository is licensed under the BSD 2-Clause License, unless otherwise stated.
Please check individual files or directories (especially third_party and assets) for specific license terms.

📖 Citation

If you use RoboManipBaselines in your work, please cite our paper:

@article{RoboManipBaselines_Murooka_2025,
  title={RoboManipBaselines: A Unified Framework for Imitation Learning in Robotic Manipulation across Real and Simulation Environments},
  author={Murooka, Masaki and Motoda, Tomohiro and Nakajo, Ryoichi and Oh, Hanbit and Makihara, Koshi and Shirai, Keisuke and Ogata, Tetsuya and Domae, Yukiyasu},
  journal={arXiv preprint arXiv:2509.17057},
  year={2025}
}

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

isf-agent

a repo for an agent that helps researchers apply for isf funding

last30days-skill

17.6k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary