Pvcnn
[NeurIPS 2019, Spotlight] Point-Voxel CNN for Efficient 3D Deep Learning
Install / Use
/learn @mit-han-lab/PvcnnREADME
PVCNN: Point-Voxel CNN for Efficient 3D Deep Learning
NVIDIA Jetson Community Project Spotlight
@inproceedings{liu2019pvcnn,
title={Point-Voxel CNN for Efficient 3D Deep Learning},
author={Liu, Zhijian and Tang, Haotian and Lin, Yujun and Han, Song},
booktitle={Advances in Neural Information Processing Systems},
year={2019}
}
Prerequisites
The code is built with following libraries (see requirements.txt):
Data Preparation
S3DIS
We follow the data pre-processing in PointCNN.
The code for preprocessing the S3DIS dataset is located in data/s3dis/.
One should first download the dataset from here, then run
python data/s3dis/prepare_data.py -d [path to unzipped dataset dir]
ShapeNet
We follow the data pre-processing in PointNet2. Please run the following command to down the dataset
./data/shapenet/download.sh
KITTI
For Frustum-PointNet backbone, we follow the data pre-processing in Frustum-Pointnets. One should first download the ground truth labels from here, then run
unzip data_object_label_2.zip
mv training/label_2 data/kitti/ground_truth
./data/kitti/frustum/download.sh
Code
The core code to implement PVConv is modules/pvconv.py. Its key idea costs only a few lines of code:
voxel_features, voxel_coords = voxelize(features, coords)
voxel_features = voxel_layers(voxel_features)
voxel_features = trilinear_devoxelize(voxel_features, voxel_coords, resolution)
fused_features = voxel_features + point_layers(features)
Pretrained Models
Here we provide some of the pretrained models. The accuracy might vary a little bit compared to the paper, since we re-train some of the models for reproducibility.
S3DIS
We compare PVCNN against the PointNet, 3D-UNet and PointCNN performance as reported in the following table. The accuracy is tested following PointCNN. The list is keeping updated.
| Models | Overall Acc | mIoU | | :---------------------------------------------------------------------------------------------------------: | :---------: | :----------: | | PointNet | 82.54 | 42.97 | | PointNet (Reproduce) | 80.46 | 44.03 | | PVCNN (0.125 x C) | 82.79 | 48.75 | | PVCNN (0.25 x C) | 85.00 | 53.08 | | 3D-UNet | 85.12 | 54.93 | | PVCNN | 86.47 | 56.64 | | PointCNN | 85.91 | 57.26 | | PVCNN++ (0.5 x C) | 86.88 | 58.30 | | PVCNN++ | 87.48 | 59.02 |
ShapeNet
We compare PVCNN against the PointNet, PointNet++, 3D-UNet, Spider CNN and PointCNN performance as reported in the following table. The accuracy is tested following PointNet. The list is keeping updated.
| Models | mIoU | | :-------------------------------------------------------------------------------------------------------------: | :----------: | | PointNet (Reproduce) | 83.5 | | PointNet | 83.7 | | 3D-UNet | 84.6 | | PVCNN (0.25 x C) | 84.9 | | PointNet++ SSG (Reproduce) | 85.1 | | PointNet++ MSG | 85.1 | | PVCNN (0.25 x C, DML) | 85.1 | | SpiderCNN | 85.3 | | PointNet++ MSG (Reproduce) | 85.3 | | PVCNN (0.5 x C) | 85.5 | | PVCNN | 85.8 | | PointCNN | 86.1 | | PVCNN (DML) | 86.1 |
KITTI
We compare PVCNN (Efficient Version in the paper) against PointNets performance as reported in the following table. The accuracy is tested on val set following Frustum PointNets using modified code from kitti-object-eval-python. Since there is random sampling in Frustum Pointnets, random seed will influence the evaluation. All results provided by us are the average of 20 measurements with different seeds, and the best one of 20 measurements is shown in the parentheses. The list is keeping updated.
| Models | Car | Car | Car | Pedestrian | Pedestrian | Pedestrian | Cyclist | Cyclist | Cyclist | |:--------------------------------------------------------------------------------------------------------------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:|:-----------------:| | | Easy | Moderate | Hard | Easy | Moderate | Hard | Easy | Moderate | Hard | | Frustum PointNet | 83.26 | 69.28 | 62.56 | - | - | - | - | - | - | | Frustum PointNet (Reproduce) | 85.24 (85.17) | 71.63 (71.56) | 63.79 (63.78) | 66.44 (66.83) | 56.90 (57.20) | 50.43 (50.54) | 77.14 (78.16) | 56.46 (57.41) | 52.79 (53.66) | | Frustum PointNet++ | 83.76 | 70.92 | 63.65 | 70.00 | 61.32 | 53.59 | 77.15 | 56.49 | 53.37 | | Frustum PointNet++ (Reproduce) | 84.72 (84.46) | 71.99 (71.95) | 64.20 (64.13) | 68.40 (69.27) | 60.03 (60.80) | 52.61 (53.19) | 75.56 (79.41) | 56.74 (58.65) | 53.33 (54.82) | | Frustum PVCNN (Efficient) | 85.25 (85.30) | 72.12 (72.22) | 64.24 (64.36) | 70.60 (70.60) | 61.24 (61.35) | 56.25 (56.38) | 78.10 (79.79) | 57.45 (58.72) | 53.65 (54.81) |
Testing Pretrained Models
For example, to test the downloaded pretrained models on S3DIS, one can run
python train.py [config-file] --devices [gpu-ids] --evaluate --configs.evaluate.best_checkpoint_path [path to the model checkpoint]
For instance, to evaluate PVCNN on GPU 0,1 (with 4096 points on Area 5 of S3DIS), one can run
python train.py configs/s3dis/pvcnn/area5.py --devices 0,1 --evaluate --configs.evaluate.best_checkpoint_path s3dis.pvcnn.area5.c1.pth.tar
Specially, for Frustum KITTI evaluation, one can specify the number of measurements to eliminate the random seed effects,
