SkillAgentSearch skills...

Seer

[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation

Install / Use

/learn @InternRobotics/Seer
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

<div align="center">

Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation

</div> <h3 align="center"> <a href="https://arxiv.org/pdf/2412.15109">Arxiv</a> | <a href="https://nimolty.github.io/Seer/">Webpage</a> </h3>

https://github.com/user-attachments/assets/49036e84-c397-4589-9024-efb05b14efa0

<br><br>

:books: Table of Contents:

  1. Highlights
  2. Getting Started
  3. Checkpoints
  4. TODO List
  5. License
  6. Citation.
  7. Acknowledgment

:fire: Highlights <a name="high"></a>

<img width="1000" alt="seer" src="assets/seer_method.jpg">
  • :trophy: SOTA simulation performance Seer achieves state-of-the-art performance on simulation benchmarks CALVIN ABC-D and LIBERO-LONG.
  • :muscle: Impressive Real-World performance Seer demonstrates strong effectiveness and generalization across diverse real-world downstream tasks.

:door: Getting Started <a name="start"></a>

We provide step-by-step guidance for running Seer in simulations and real-world experiments. Follow the specific instructions for a seamless setup.

Simulation <a name="simulation"></a>

CALVIN ABC-D <a name="calvin abc-d"></a>

LIBERO LONG <a name="libero long"></a>

Real-World<a name="real-world"></a>

Real-World (Quick Training w & w/o pre-training)<a name="real-world-qs"></a>

For users aiming to train Seer from scratch or fine-tune it, we provide comprehensive instructions for environment setup, downstream task data preparation, training, and deployment.

Real-World (Pre-training)<a name="real-world-fv"></a>

This section details the pre-training process of Seer in real-world experiments, including environment setup, dataset preparation, and training procedures. Downstream task processing and fine-tuning are covered in Real-World (Quick Training w & w/o pre-training).

:pencil2: Checkpoints <a name="checkpoints"></a>

Relevant checkpoints are available on the website. |Model|Checkpoint| |:------:|:------:| |CALVIN ABC-D|Seer (Avg.Len. : 3.98) / Seer Large (Avg.Len. : 4.30)| |Real-World|Seer (Droid Pre-trained)|

📆 TODO <a name="todos"></a>

  • [x] Release real-world expriment code.
  • [x] Release CALVIN ABC-D experiment code (Seer).
  • [x] Release the evaluation code of Seer-Large on CALVIN ABC-D experiment.
  • [x] Release the training code of Seer-Large on CALVIN ABC-D experiment.
  • [x] Release LIBERO-LONG experiment code.
  • [ ] Release simpleseer, a quick scratch training & deploying code.

License <a name="license"></a>

All assets and code are under the Apache 2.0 license unless specified otherwise.

Citation <a name="citation"></a>

If you find the project helpful for your research, please consider citing our paper:

@article{tian2024predictive,
  title={Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation},
  author={Tian, Yang and Yang, Sizhe and Zeng, Jia and Wang, Ping and Lin, Dahua and Dong, Hao and Pang, Jiangmiao},
  journal={arXiv preprint arXiv:2412.15109},
  year={2024}
}

Acknowledgment <a name="acknowledgment"></a>

This project builds upon GR-1 and Roboflamingo. We thank these teams for their open-source contributions.

Related Skills

View on GitHub
GitHub Stars288
CategoryDevelopment
Updated1d ago
Forks11

Languages

Python

Security Score

100/100

Audited on Mar 31, 2026

No findings