robosense

<img src="assets/icon.png" width="80">  <h3>Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments</h3>

Haisheng Su<sup>1,2</sup>, Feixiang Song<sup>2</sup>, Cong Ma<sup>2</sup>, Wei Wu<sup>2,3</sup>, Junchi Yan<sup>1 :email:</sup>

<sup>1</sup> School of Computer Science and School of AI, SJTU
<sup>2</sup> SenseAuto Research, <sup>3</sup> Tsinghua University

<sup>:email:</sup> Corresponding author, yanjunchi@sjtu.edu.cn

</div>

RoboSense is a large-scale multimodal dataset constructed to facilitate egocentric robot perception capabilities especially in crowded and unstructured environments.

News
Key Features
Sensor Setup and Coordinate System
Dataset Example
Getting started
Tools
Evaluation
Contact
Citation

News 📰 <a name="news"></a>

[2025/06/05]: 🤖 <span style="color: #B91C1C; font-weight: bold;">RoboSense dataset released, including training/validation splits.</span>
[2025/03/09]: Our paper has been accepted to CVPR 2025, [Poster]!
[2024/08/25]: We released our paper on Arxiv. Code and dataset are coming soon. Please stay tuned! ☕️

Key Features 🔑 <a name="keyfeatures"></a>

133k+ synchronized frames of 4C+4F+4L sensor data.
1.4 million+ 3D bounding boxes and IDs annotated in the full 360°view.
7.6K temporal sequences across 6 kinds of target domains (i.e., scenic spots, parks, squares, campuses, streets and sidewalks).
216K+ trajectories of objects.
270x and 18x as many annotations of near-field obstacles as KITTI and nuScenes.
6 benchmarks for both perception and prediction tasks.

Sensor Setup and Coordinate System

<div align="center"> <img src="assets/robosense.png" width="700"/> </div>

Dataset Example

Getting started 🔥 <a name="gettingstarted"></a>

Installation <a name="installation"></a>

Download our source code:

git clone https://github.com/suhaisheng/RoboSense.git
cd RoboSense

How to Get Started with Our RoboSense Data <a name="preaparedata"></a>

Download data from our HuggingFace page.

huggingface-cli download --resume-download --repo-type dataset suhaisheng0527/RoboSense --local-dir ./

Combine all splitted files for image and LiDAR&OCC respectively.

cd dataset
cat image_trainval_part_* > image_trainval.tar.gz
cat lidar_occ_trainval_part_* > lidar_occ_trainval.tar.gz
rm image_trainval_part_*
rm lidar_occ_trainval_part_*
tar -xzf image_trainval.tar.gz
tar -xzf lidar_occ_trainval.tar.gz
cd ..

Tools <a name="tools"></a>

Visualize the dataset sample.

cd scripts
python visualize_dataset.py

Generate the occupancy label.

cd scripts
python occ_label_gen.py

Evaluation <a name="evaluation"></a>

Evaluating detection models with our proposed metrics (CCDP: Closest- Collision Distance Proportion matching function).

cd evaluation
cd det_metrics
python robosense_evaluator.py

Evaluating motion prediction results.

cd evaluation
python evaluate_motion.py

Evaluating occupancy prediction results.

cd evaluation
python evaluate_occ.py

As for multi-object tracking result evaluation, please refer to this repo: AB3DMOT.

License <a name="license"></a>

All assets and code within this repo are under the CC BY-NC-SA 4.0 unless specified otherwise.

Contact

If you have any questions, please contact Haisheng Su via email (suhaisheng@sjtu.edu.cn).

Citation

If you find RoboSense is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@inproceedings{su2025robosense,
  title={RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments},
  author={Su, Haisheng and Song, Feixiang and Ma, Cong and Wu, Wei and Yan, Junchi},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2025}
}

RoboSense

Install / Use

README

Table of Contents