CIGOcc
CIGOcc: Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion
Install / Use
/learn @VitaLemonTea1/CIGOccREADME
CIGOcc: Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion
<div style="text-align: justify">This is the code of CIGOcc.
Camera-based occupancy prediction is a main- stream approach for 3D perception in autonomous driving, aiming to infer complete 3D scene geometry and semantics from 2D images. Almost existing methods focus on improv- ing performance through structural modifications, such as lightweight backbones and complex cascaded frameworks, with good yet limited performance. Few studies explore from the perspective of representation fusion, leaving the rich diversity of features in 2D images underutilized. Motivated by this, we propose CIGOcc, a two-stage occupancy prediction framework based on multi-level representation fusion. CIGOcc extracts segmentation, graphics, and depth features from an input image and introduces a deformable multi-level fusion mechanism to fuse these three multi-level features. Additionally, CIGOcc incorporates knowledge distilled from SAM to further enhance prediction accuracy. Without increasing training costs, CIGOcc achieves state-of-the-art performance on the SemanticKITTI benchmark.
<p align="center"> <img src="figs/pipeline.png" width="1000"> </p>Getting Start
Our code will be released soon.
Qualitative Results
<p align="center"> <img src="figs/visualization.png" width="1000"> </p>Related Skills
node-connect
351.8kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.9kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.8kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.8kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Jan 20, 2026
