Mars

MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Generate Convert Improve

Install / Use

/learn @OPEN-AIR-SUN/Mars

About this skill

Quality Score

0/100

README

<div align="center"><h2>MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving</h1></div> <p align="center">  <a href="https://open-air-sun.github.io/mars/"><img src="https://img.shields.io/badge/Project-Page-ffa"/></a>  <a href="https://arxiv.org/abs/2307.15058"> <img src='https://img.shields.io/badge/arXiv-Page-aff'> </a> <a href="https://pypi.org/project/mars-nerfstudio/"><img src="https://img.shields.io/badge/pypi package-0.3.4-brightgreen" alt="PyPI version"></a> <a href='https://open-air-sun.github.io/mars/static/data/MARS-poster.pdf'> <img src='https://img.shields.io/badge/Poster-PDF-pink' /> </a> </p> <div align="center"><h4>CICAI 2023 Best Paper Runner-up Award</h4></div> <div align="center"> <img alt="" src="https://github.com/OPEN-AIR-SUN/mars/assets/107318439/b4c747e8-8165-4576-94c3-666c976063c6"> </div> <div align="center"><h4>For business inquiries, please contact us at <a href="mailto:zhaohao@air.tsinghua.edu.cn">zhaohao@air.tsinghua.edu.cn</a>.</h4></div>

We have just finished a refactorization of our codebase. Now you can use pip install to start using mars instantly! Please contact us without hesitation if you encounter any issues using the latest version. Thanks!

Our related(dependent) project -- CarStudio is accepted by RAL and their code is available at GitHub. Congrats to them!

1. Installation: Setup the environment

Prerequisites

You must have an NVIDIA video card with CUDA installed on the system. This library has been tested with version 11.7 of CUDA. You can find more information about installing CUDA here.

Create environment

Nerfstudio requires python >= 3.7. We recommend using conda to manage dependencies. Make sure to install Conda before proceeding.

conda create --name mars -y python=3.9
conda activate mars

Installation

This section will walk you through the installation process. Our system is dependent on the <a href="https://github.com/NVlabs/tiny-cuda-nn">tiny-cuda-nn</a> project.

pip install mars-nerfstudio
cd /path/to/tiny-cuda-nn/bindings/torch
python setup.py install

2. Training from Scratch

The following will train a MARS model.

Our repository provides dataparser for KITTI and vKITTI2 datasets, for your own data, you can write your own dataparser or convert your own dataset to the format of the provided datasets.

From Datasets

Data Preparation

The data used in our experiments should contain both the pose parameters of cameras and object tracklets. The camera parameters include the intrinsics and the extrinsics. The object tracklets include the bounding box poses, types, ids, etc. For more information, you can refer to KITTI-MOT or vKITTI2 datasets below.

KITTI

The KITTI-MOT dataset should look like this:

.(KITTI_MOT_ROOT)
├── panoptic_maps                    # (Optional) panoptic segmentation from KITTI-STEP dataset.
│   ├── colors
│   │   └── sequence_id.txt
│   ├── train
│   │   └── sequence_id
│   │       └── frame_id.png
└── training
    ├── calib
    │   └── sequence_id.txt
    ├── completion_02                # (Optional) depth completion
    │   └── sequence_id
    │       └── frame_id.png
    ├── completion_03
    │   └── sequence_id
    │       └── frame_id.png
    ├── image_02
    │   └── sequence_id
    │       └── frame_id.png
    ├── image_03
    │   └── sequence_id
    │       └── frame_id.png
    ├── label_02
    │   └── sequence_id.txt
    └── oxts
        └── sequence_id.txt

We use a monocular depth estimation model to generate the depth maps for KITTI-MOT dataset. Here is the estimation result of 0006 sequence of KITTI-MOT datasets. You can download and put them in the KITTI-MOT/training directory.

We download the KITTI-STEP annotations and generate the panoptic segmentation maps for KITTI-MOT dataset. You can download the demo panoptic maps here and put them in the KITTI-MOT directory, or you can visit the official website of KITTI-STEP for more information.

To train a reconstruction model, you can use the following command:

ns-train mars-kitti-car-depth-recon --data /data/kitti-MOT/training/image_02/0006

or if you want to use the Python script (please refer to the launch.json file in the .vscode directory):

python nerfstudio/nerfstudio/scripts/train.py mars-kitti-car-depth-recon --data /data/kitti-MOT/training/image_02/0006

vKITTI2

The vKITTI2 dataset should look like this:

.(vKITTI2_ROOT)
└── sequence_id
    └── scene_name
        ├── bbox.txt
        ├── colors.txt
        ├── extrinsic.txt
        ├── info.txt
        ├── instrinsic.txt
        ├── pose.txt
        └── frames
            ├── depth
            │   ├── Camera_0
            │   │   └── frame_id.png
            │   └── Camera_1
            │   │   └── frame_id.png
            ├── instanceSegmentation
            │   ├── Camera_0
            │   │   └── frame_id.png
            │   └── Camera_1
            │   │   └── frame_id.png
            ├── classSegmentation
            │   ├── Camera_0
            │   │   └── frame_id.png
            │   └── Camera_1
            │   │   └── frame_id.png
            └── rgb
                ├── Camera_0
                │   └── frame_id.png
                └── Camera_1
                    └── frame_id.png

To train a reconstruction model, you can use the following command:

ns-train mars-vkitti-car-depth-recon --data /data/vkitti/Scene06/clone

or if you want to use the python script:

python nerfstudio/nerfstudio/scripts/train.py mars-vkitti-car-depth-recon --data /data/vkitti/Scene06/clone

Your Own Data

For your own data, you can refer to the above data structure and write your own dataparser, or you can convert your own dataset to the format of the dataset above.

From Pre-Trained Model

Our model uses nerfstudio as the training framework, we provide the reconstruction and novel view synthesis tasks checkpoints.

Our pre-trained model is uploaded to Google Drive, you can refer to the below table to download the model.

<center> <table class="tg"> <thead> <tr> <th>Dataset</th> <th>Scene</th> <th>Setting</th> <th>Start-End</th> <th>Steps</th> <th>PSNR</th> <th>SSIM</th> <th>Download</th> <th>Wandb</th> </tr> </thead> <tbody> <tr> <td rowspan="4">KITTI-MOT</td> <td>0006</td> <td>Reconstruction</td> <td>65-120</td> <td>400k</td> <td>27.96</td> <td>0.900</td> <td><a href="https://drive.google.com/drive/folders/118qj8GA1lnkx90yXREAwWtARJquEIn6d?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/ff6tjef7">report</a></td> </tr> <tr> <td>0006</td> <td>Novel View Synthesis 75%</td> <td>65-120</td> <td>200k</td> <td>27.32</td> <td>0.890</td> <td><a href="https://drive.google.com/drive/folders/117MIMkaDhEPDhoyCAAr8o_xATj891STP?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/ns8w2guc">report</a></td> </tr> <tr> <td>0006</td> <td>Novel View Synthesis 50%</td> <td>65-120</td> <td>200k</td> <td>26.80</td> <td>0.883</td> <td><a href="https://drive.google.com/drive/folders/12BnkfO6Jv33MUfBbW1s2BWfm0pAlWecX?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/bk97y3mp">report</a></td> </tr> <tr> <td>0006</td> <td>Novel View Synthesis 25%</td> <td>65-120</td> <td>200k</td> <td>25.87</td> <td>0.866</td> <td><a href="https://drive.google.com/drive/folders/12Esij9r9f4wAf5mFvvJ1uWV3DgEZu7eg?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/r1mbaeqw">report</a></td> </tr> <tr> <td rowspan="3">Vitural KITTI-2</td> <td>Scene06</td> <td>Novel View Synthesis 75%</td> <td>0-237</td> <td>600k</td> <td>32.32</td> <td>0.940</td> <td><a href="https://drive.google.com/drive/folders/10S6GcbfyIUCAgxwr6Mp7FgYcBzY-eWgB?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/3747qu1z">report</a></td> </tr> <tr> <td>Scene06</td> <td>Novel View Synthesis 50%</td> <td>0-237</td> <td>600k</td> <td>32.16</td> <td>0.938</td> <td><a href="https://drive.google.com/drive/folders/1-m943ggGEgXRdK7NYGtGFEhWX4PA6DiT?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/fch9iiy8">report</a></td> </tr> <tr> <td>Scene06</td> <td>Novel View Synthesis 25%</td> <td>0-237</td> <td>600k</td> <td>30.87</td> <td>0.935</td> <td><a href="https://drive.google.com/drive/folders/1-9mvzbd1j4vFJ7Zy3CBMWezOHmSpEfcx?usp=drive_link">model</a></td> <td><a href="https://api.wandb.ai/links/wuzirui-research/ne5xa2n1">report</a></td> </tr> </tbody> </table> </center>

You can use the following command to train a model from a pre-trained model:

ns-train mars-kitti-car-depth-recon --data /data/kitti-MOT/training/image_02/0006 --load-dir outputs/experiment_name/method_name/timestamp/nerfstudio

Model Configs

Our modular framework supports combining different architectures for each node by modifying model configurations. Her

Related Skills

node-connect

350.8k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

350.8k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

350.8k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。