ZoomNet
ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
Install / Use
/learn @detectRecog/ZoomNetREADME
ZoomNet(AAAI2020 oral)
This is the repository for paper "ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection".
The pixel-wise annotations on the KITTI trainval set is available via:
Sample code for processing the provided annotations.
import pickle
import numpy as np
def load_pickle(filename):
with open(filename, 'rb') as f:
obj = pickle.load(f)
return obj
info = load_pickle(‘000002.pkl) # info.keys(): dict_keys(['data_idx', 'objects', 'calib', 'imw', 'imh', 'instL', 'instR’]). ‘InstL’ contains the pixel-wise inst_id(1-channel), depth(1-channel), part location (3-channel).
# sample code for pixel-wise depth annotation
pkl_objects = info['objects']
calib = info['calib']
f = calib['P'][0, 0]
bl = (calib['P'][0, 3] - calib['P3'][0, 3]) / f
f_bl = f * bl
inst_map_left_ = np.concatenate([np.expand_dims(el.toarray(), -1) for el in info['instL']], axis=-1)
# convert depth to disp
dispMapL = f_bl / inst_map_left_[:, :, 1].copy()
dispMapL[np.isinf(dispMapL)] = 0
print(dispMap.shape)
The code for generating pixel-wise annotations and ZoomNet (pytorch) needs to be organised before release. A version on paddle-paddle is also expected to be released. However, I’m currently working on a workshop on CVPR and thus delayed the release of code. I'm sorry about that.
If you are benefited from this paper, please cite our paper as follows:
@inproceedings{xu2020zoomnet,
title={ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection},
author={Xu, Zhenbo and Zhang, Wei and Ye, Xiaoqing and Tan, Xiao and Yang, Wei and Wen, Shilei and Ding, Errui and Meng, Ajin and Huang, Liusheng},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={2},
pages={7},
year={2020}
}
Related Skills
node-connect
351.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.6kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Security Score
Audited on Mar 5, 2026
