DAAAM
Describe Anything, Anywhere, at Any Moment (DAAAM), a novel approach to real-time, large-scale, spatio-temporal memory
Install / Use
/learn @MIT-SPARK/DAAAMREADME
CVPR 2026: DAAAM - Describe Anything, Anywhere, at Any Moment
[arXiv] [Project Page]
<p align="center"> <img src="assets/Title_Figure_compressed.drawio.png" alt="DAAAM Overview"/> </p>Real-time foundation-model-first robot mapping: SAM segmentation + BotSort tracking + VLM grounding feed into Hydra to build 3D Dynamic Scene Graphs on the fly.
Key contributions:
- Novel optimization-based frontend for semantic descriptions from localized captioning models
- Hierarchical 4D scene graph construction with real-time performance
- State-of-the-art results on NaVQA and SG3D benchmarks
The ROS 2 interface lives in DAAAM-ROS.
Installation | Running | Codebase | DAAAM-ROS
Paper
If you use this code in your work, please cite the following paper:
Nicolas Gorlo, Lukas Schmid, and Luca Carlone, "Describe Anything, Anywhere, at Any Moment". arXiv preprint arXiv:2512.00565, 2025.
@article{Gorlo2025DAAAM,
title={Describe Anything Anywhere At Any Moment},
author={Nicolas Gorlo and Lukas Schmid and Luca Carlone},
year={2025},
eprint={2512.00565},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.00565}
}
This work was supported by the ARL DCIST program and the ONR RAPID program.
Related Skills
node-connect
345.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
104.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
345.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
345.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
