RPVNet
This job is Non-official PyTorch implementation of the Range-Point-Voxel Funsion Network for lidar point cloud segmentation.
Install / Use
/learn @GuoPingPan/RPVNetREADME
RPVNet
Introdution
This is an Non-official implement of RPVNet:A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation. It is a fusion network based on three kinds of representation including Range image,Point Clouds and Voxels. For each Range or Voxel branch, it consist of a stem, four upblocks and four downblock. And point branch only uses four simple mlps bringing great efficiency and ability to extract fine-grained geometric features. What's more, Gated Fusion Module(GFM) was designed to adaptively measurse the importance of feature for each branch.
<div align="center"> <a href="https://github.com/GuoPingPan/RPVNet"><img width="500px" height="auto" src="demo/rpvnet.png"></a> </div> </br></br>More details for this job for searching this paper.
Chinese comments Version: https://github.com/GuoPingPan/RPVNet/tree/chinese
Dependencies
- python 3
- pyyaml
- argparse
- torch
- torchvision
- tqdm
- numpy
- torchsparse
</br>torchsparse can be install by flowing this
Data Preparation
SemanticKITTI
Please follow the instructions from here to download the SemanticKITTI dataset (both KITTI Odometry dataset and SemanticKITTI labels) and extract all the files in the sequences folder to /dataset/semantic-kitti. You shall see 22 folders 00, 01, …, 21; each with subfolders named velodyne and labels.
dataset
- sequences
- 00
- velodyne
- 000000.bin
- 000001.bin
- ...
- labels
- 000000.label
- 000001.label
- ...
- poses.txt
</br>
Quick Start
- ensure the dependencies
- train
python train.py -d dataset/sequences --log <path to save model> [--ckpt <path to pretrained model> --freeze_layers --device <cpu or cuda>]
- test
python inference.py -d dataset/sequences -ckpt <path to model> [--device <cpu or cuda>]
</br>
Reference
- paper: RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation
- spvnas: https://github.com/mit-han-lab/spvnas
- SalsaNext: https://github.com/TiagoCortinhal/SalsaNext
- kprnet: https://github.com/DeyvidKochanov-TomTom/kprnet
Related Skills
node-connect
348.0kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
108.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
348.0kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
348.0kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
