ZoomNet

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection

Generate Convert Improve

Install / Use

/learn @detectRecog/ZoomNet

About this skill

Quality Score

0/100

README

ZoomNet(AAAI2020 oral)

This is the repository for paper "ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection".

The pixel-wise annotations on the KITTI trainval set is available via:

Sample code for processing the provided annotations.

import pickle
import numpy as np

def load_pickle(filename):
    with open(filename, 'rb') as f:
        obj = pickle.load(f)
    return obj

info = load_pickle(‘000002.pkl) # info.keys(): dict_keys(['data_idx', 'objects', 'calib', 'imw', 'imh', 'instL', 'instR’]). ‘InstL’ contains the pixel-wise inst_id(1-channel), depth(1-channel), part location (3-channel).
# sample code for pixel-wise depth annotation
pkl_objects = info['objects']
calib = info['calib']
f = calib['P'][0, 0]
bl = (calib['P'][0, 3] - calib['P3'][0, 3]) / f
f_bl = f * bl
inst_map_left_ = np.concatenate([np.expand_dims(el.toarray(), -1) for el in info['instL']], axis=-1)
# convert depth to disp
dispMapL = f_bl / inst_map_left_[:, :, 1].copy()
dispMapL[np.isinf(dispMapL)] = 0
print(dispMap.shape)

The code for generating pixel-wise annotations and ZoomNet (pytorch) needs to be organised before release. A version on paddle-paddle is also expected to be released. However, I’m currently working on a workshop on CVPR and thus delayed the release of code. I'm sorry about that.

If you are benefited from this paper, please cite our paper as follows:

@inproceedings{xu2020zoomnet,
  title={ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection},
  author={Xu, Zhenbo and Zhang, Wei and Ye, Xiaoqing and Tan, Xiao and Yang, Wei and Wen, Shilei and Ding, Errui and Meng, Ajin and Huang, Liusheng},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={2},
  pages={7},
  year={2020}
}

Related Skills

node-connect

351.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。