StreamingFlow

StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation

Generate Convert Improve

Install / Use

/learn @synsin0/StreamingFlow

About this skill

Quality Score

0/100

README

StreamingFlow

StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation

</div>

This repo introduces StreamingFlow (CVPR2024 poster(hightlight)).

Demo videos

Occupancy forecasting on nuScenes dataset

https://github.com/synsin0/StreamingFlow/assets/37300008/ee225603-8434-4825-b912-7f3fc7095c85

Occupancy forecasting on Lyft dataset

https://github.com/synsin0/StreamingFlow/assets/37300008/54232c6c-4ae2-456a-9381-4df5c5624712

Streaming forecasting: foreseeing the future to 8s

https://github.com/synsin0/StreamingFlow/assets/37300008/d67c3eff-822d-43d8-946f-f7bde8c4a693

Streaming forecasting: predicting at given interval 0.05s/0.10s/0.25s

https://github.com/synsin0/StreamingFlow/assets/37300008/5754bfc3-6649-40fa-a3b7-5647aaba7b3d

https://github.com/synsin0/StreamingFlow/assets/37300008/8f1ddc37-e5f8-492b-b553-8fb37e7e26e8

https://github.com/synsin0/StreamingFlow/assets/37300008/c051c049-083f-453b-845e-b609c1b55ae0

Future (Ongoing) works

We implement StreamingFlow on Vidar codebase and generates streaming prediction on self-supervised 4d occupancy forecasting task with future point clouds as proxy. It is still in an early stage. We provide demo videos of current process.

Streaming forecasting with interval 0.5s:

https://github.com/synsin0/StreamingFlow/assets/37300008/a1dbe140-c33b-4800-b433-70f100e5bf6d

Streaming forecasting with interval 0.05s:

https://github.com/synsin0/StreamingFlow/assets/37300008/3509d5bd-7b4c-44f5-a9e7-2b02e4f94775

Framework

teaser

Abstract（TL DR）

StreamingFlow is a streaming occupancy forecasting framework which can input multi-modal asynchronous data streams (possibly with different given frequency) as input, and outputs future instance prediction in a continuous manner.

Installation and data setup

We follow the ST-P3 setup and bevfusion setup for environoment. For data setup, simply organize nuscenes and lyft dataset in ./data/nuscenes and ./data/lyft.

Models

| Settings | Image | LiDAR | ODE Step | IoU | VPQ | config | checkpoint | | ------------- | ------- | -------- | -------- | -------- | -------- | -------- | -------- | | past_1s, future_2s | Effi-B4-224x480-2Hz | Spconv8x-0050-5Hz | variable | 53.7 | 50.7 | config | ckpt |

Train command:

python train.py --config /path/to/config

Test command:

python evaluate.py --checkpoint /path/to/checkpoint

Experiments

We use streamingflow with variable ode step config and checkpoint to conduct the following experiments.

Predicting the unseen future exps

| Settings | 1s | 2s | 3s | 4s | 5s | 6s | 8s | | ------------- | ------- | -------- | -------- | -------- | -------- | -------- | -------- | | Variable | 56.5/54.4 | 53.7/50.7 | 50.4/47.2 | 47.2/44.1 | 44.1/41.1 | 40.7/38.0 | 34.4/32.6 |

Test command:

 python evaluate.py --checkpoint /path/to/checkpoint --future-frames N

here, N is for N * 0.5s future seconds.

Predicting at any future interval

| Settings | 0.05s | 0.1s | 0.25s | 0.5s | 0.6s |
| ------------- | ------- | -------- | -------- | -------- | -------- | | Variable | 48.2/45.2 | 49.5/46.4 | 51.5/48.5 | 53.6/49.6 | 53.4/49.8 |

Test command:

export PYTHONPATH=/project_root_dir/nuscenes-devkit/python-sdk:$PYTHONPATH
python evaluate_streaming.py --checkpoint /path/to/checkpoint --eval-interval N

here, N is for N * 0.05s interval.

Predicting with different data stream intervals

| Settings | 0.15s | 0.2s | 0.25s | 0.4s | 0.5s | | ------------- | ------- | -------- | -------- | -------- | -------- | | Variable | 53.1/50.0 | 53.7/50.7 | 53.2/50.3 | 50.6/47.4 | 47.6/44.5 |

Test command:

python evaluate_datastream.py --checkpoint /path/to/checkpoint --frame-skip N

here, N is for 20/N interval for lidar input stream interval.

License

All assets and code are under the Apache 2.0 license unless specified otherwise.

Citation

Please consider citing our paper if the project helps your research with the following BibTex:

@inproceedings{shi2024streamingflow,
  title={StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation},
  author={Shi, Yining and Jiang, Kun and Wang, Ke and Li, Jiusi and Wang, Yunlong and Yang, Mengmeng and Yang, Diange},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={14833--14842},
  year={2024}
}

Acknowledgements

Thanks to prior excellent open source projects:

Related Skills

node-connect

347.0k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

107.8k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

347.0k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

347.0k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。