U4D
[CVPR 2026] U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences
Install / Use
/learn @worldbench/U4DREADME
In this work, we introduce U4D, an uncertainty-aware framework for 4D LiDAR world modeling. The main contributions are:
- We introduce the first uncertainty-aware LiDAR generation framework that explicitly models spatial difficulty to enhance reliability in 4D world modeling.
- We design a two-stage hard-to-easy generation paradigm that reconstructs uncertain regions first and then completes the full scene under these priors.
- We develop a Mixture of Spatio-Temporal (MoST) block that ensures temporal consistency across frames by adaptively balancing spatial geometry and temporal dynamics.
:books: Citation
If you find this work helpful for your research, please kindly consider citing our paper:
@article{xu2025U4D,
title = {{U4D}: Uncertainty-Aware {4D} World Modeling from {LiDAR} Sequences},
author = {Xu, Xiang and Liang, Ao and Liu, Youquan and Li, Linfeng and Kong, Lingdong and Liu, Ziwei and Liu, Qingshan},
journal = {arXiv preprint arXiv: 2512.02982},
year = {2025}
}
Updates
- [12.2025] - The technical report of U4D is available on arXiv.
License
This work is under the <a rel="license" href="https://www.apache.org/licenses/LICENSE-2.0">Apache License Version 2.0</a>, while some specific implementations in this codebase might be with other licenses. Kindly refer to LICENSE.md for a more careful check, if you are using our code for commercial matters.
Acknowledgements
This work is developed based on the R2DM codebase.
Related Skills
node-connect
349.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.4kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
