DensePolicy

[ICCV 2025] :bouquet: Dense Policy: Bidirectional Autoregressive Learning of Actions :rocket:DSP

Generate Convert Improve

Install / Use

/learn @Selen-Suyue/DensePolicy

About this skill

Quality Score

0/100

README

Dense Policy: Bidirectional Autoregressive Learning of Actions

Authors: <a href="https://selen-suyue.github.io" style="color: maroon; text-decoration: none; font-style: italic;">Yue Su*</a>, <a href="https://scholar.google.com/citations?user=WurpqEMAAAAJ&hl=en" style="color: maroon; text-decoration: none; font-style: italic;">Xinyu Zhan*</a>, <a href="https://tonyfang.net/" style="color: maroon; text-decoration: none; font-style: italic;">Hongjie Fang</a>, <a href="https://hanxue.me/" style="color: maroon; text-decoration: none; font-style: italic;">Han Xue</a>, <a href="https://fang-haoshu.github.io/" style="color: maroon; text-decoration: none; font-style: italic;">Haoshu Fang</a>, <a href="https://dirtyharrylyl.github.io/" style="color: maroon; text-decoration: none; font-style: italic;">Yong-Lu Li</a>, <a href="https://www.mvig.org/" style="color: maroon; text-decoration: none; font-style: italic;">Cewu Lu</a>, <a href="https://lixiny.github.io/" style="color: maroon; text-decoration: none; font-style: italic;">Lixin Yang†</a>

teaser

🛫 Getting Started

This is the 3D version of Dense Policy, you can also refer 2D Dense Policy Code here.

You can also refer DSPv2, an effective and generalizable dense policy, can deploy for whole-body mobile manipulation.

💻 Installation

Please following the installation guide to install the dsp conda environments and the dependencies, as well as the real robot environments. Also, remember to adjust the constant parameters in dataset/constants.py and utils/constants.py according to your own environment.

📷 Calibration

Please calibrate the camera(s) with the robot before data collection and evaluation to ensure correct spatial transformations between camera(s) and the robot. Please refer to calibration guide for more details.

🌠 DSP in the Community

RoboMIND 2.0, where dsp shows strong transferability and generalization capabilities across diverse robotic platforms, making it suitable for open-world, multi-robot manipulation scenarios.
DSPv2, where dsp is used for whole-body mobile manipulation.

🛢️ Data Collection

You can view the sampled data (cut task) from this link, which contains task data (one trajectory for instance). You can ignore other files since they are for MBA We follow the data managemnet pattern as RH20T.

Task_name
`-- train/
    |-- [episode identifier 1]
    |   |-- metadata.json              # metadata
    |   |-- timestamp.txt              # calib timestamp  
    |   |-- cam_[serial_number 1]/    
    |   |   |-- color                  # RGB
    |   |   |   |-- [timestamp 1].png
    |   |   |   |-- [timestamp 2].png
    |   |   |   |-- ...
    |   |   |   `-- [timestamp T].png
    |   |   |-- depth                  # depth
    |   |   |   |-- [timestamp 1].png
    |   |   |   |-- [timestamp 2].png
    |   |   |   |-- ...
    |   |   |   `-- [timestamp T].png
    |   |   |-- tcp                    # tcp
    |   |   |   |-- [timestamp 1].npy
    |   |   |   |-- [timestamp 2].npy
    |   |   |   |-- ...
    |   |   |   `-- [timestamp T].npy
    |   |   `-- gripper_command        # gripper command
    |   |       |-- [timestamp 1].npy
    |   |       |-- [timestamp 2].npy
    |   |       |-- ...
    |   |       `-- [timestamp T].npy
    |   `-- cam_[serial_number 2]/     # similar camera structure
    `-- [episode identifier 2]         # similar episode structure

🧑🏻‍💻 Training

conda activate dsp
bash train.sh

🤖 Evaluation

Please follow the deployment guide to modify the evaluation script.

Modify the arguments in eval.sh, then

conda activate dsp
bash eval.sh

✍️ Citation

@InProceedings{DSP,
    author    = {Su, Yue and Zhan, Xinyu and Fang, Hongjie and Xue, Han and Fang, Hao-Shu and Li, Yong-Lu and Lu, Cewu and Yang, Lixin},
    title     = {Dense Policy: Bidirectional Autoregressive Learning of Actions},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {14486-14495}
}

📃 License

<a property="dct:title" rel="" rel="cc:attributionURL" href="https://selen-suyue.github.io/DspNet/">DSP</a> is licensed under <a href="https://creativecommons.org/licenses/by-nc-sa/4.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">CC BY-NC-SA 4.0<img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/cc.svg?ref=chooser-v1" alt=""><img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/by.svg?ref=chooser-v1" alt=""><img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/nc.svg?ref=chooser-v1" alt=""><img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/sa.svg?ref=chooser-v1" alt=""></a>

Related Skills

proje

Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

research_rules

Research & Verification Rules Quote Verification Protocol Primary Task "Make sure that the quote is relevant to the chapter and so you we want to make sure that we want to have it identifie

Selen-Suyue

View profile

View on GitHub

GitHub Stars78

CategoryEducation

Updated16d ago

Forks2

Selen-Suyue/DensePolicy

Languages

Python

Security Score

80/100

Audited on Mar 18, 2026

No findings