RPVNet

This job is Non-official PyTorch implementation of the Range-Point-Voxel Funsion Network for lidar point cloud segmentation.

Generate Convert Improve

Install / Use

/learn @GuoPingPan/RPVNet

About this skill

Quality Score

0/100

README

RPVNet

Introdution

This is an Non-official implement of RPVNet:A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation. It is a fusion network based on three kinds of representation including Range image，Point Clouds and Voxels. For each Range or Voxel branch, it consist of a stem, four upblocks and four downblock. And point branch only uses four simple mlps bringing great efficiency and ability to extract fine-grained geometric features. What's more, Gated Fusion Module(GFM) was designed to adaptively measurse the importance of feature for each branch.

More details for this job for searching this paper.

Chinese comments Version: https://github.com/GuoPingPan/RPVNet/tree/chinese

</br>

Dependencies

python 3
pyyaml
argparse
torch
torchvision
tqdm
numpy
torchsparse

torchsparse can be install by flowing this

</br>

Data Preparation

SemanticKITTI

Please follow the instructions from here to download the SemanticKITTI dataset (both KITTI Odometry dataset and SemanticKITTI labels) and extract all the files in the sequences folder to /dataset/semantic-kitti. You shall see 22 folders 00, 01, …, 21; each with subfolders named velodyne and labels.

dataset
 - sequences
  - 00
    - velodyne
      - 000000.bin
      - 000001.bin
      - ...
    - labels
      - 000000.label
      - 000001.label
      - ...
      
    - poses.txt

</br>

Quick Start

ensure the dependencies
train

python train.py -d dataset/sequences  --log <path to save model> [--ckpt <path to pretrained model> --freeze_layers --device <cpu or cuda>]

test

python inference.py -d dataset/sequences  -ckpt <path to model> [--device <cpu or cuda>]

</br>

Reference

paper: RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation
spvnas: https://github.com/mit-han-lab/spvnas
SalsaNext: https://github.com/TiagoCortinhal/SalsaNext
kprnet: https://github.com/DeyvidKochanov-TomTom/kprnet

Related Skills

node-connect

348.0k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

108.8k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

348.0k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

348.0k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。