FreSCo

Frequency-Domain Scan Context for Robust LiDAR-based Place Recognition with Translation and Rotation Invariance.

Generate Convert Improve

Install / Use

/learn @soytony/FreSCo

About this skill

Quality Score

0/100

README

FreSCo

Frequency-Domain Scan Context for Robust LiDAR-based Place Recognition with Translation and Rotation Invariance.

News

This work is now accepted for ICARCV 2022.

1. Introducing FreSCo

Place recognition is a task expected to recognize a place when revisiting it using data from sensors like Camera or LiDAR. Existing works on hand-crafted global descriptors for LiDAR-based place recognition show remarkable performance, yet are usually viewpoint-dependent. The capibility of viewpoint invariance is important, as when revists occur, one cannot assure the new viewpoint around the same place does not deviate from the former one.

FreSCo is a novel global descriptor inspired by Scan Context and LiDAR Iris, but provides both translation invariance and rotation invariance at the same time.

2. How to Use

2.1 Data Preprocessing

Note that we separated the preprocessing part of the code from the project, and users may find more about how to preprocess the data from publicly available datasets (e.g. MulRan, KITTI, and Oxford Radar) at Point-Cloud-Preprocessing-Tools.

2.2 Getting Your Data Ready

Make sure the <keyframes_root_dir> directory is organized as follows:

<keyframes_root_dir>
├ ... 
├ non_ground_point_cloud/ <- folder for ground-removed point clouds in pcd format
├ output_multi_bev/ <- folder for multi-layer BEV images(NOT used currently)
└ output_single_bev <- folder for single-layer BEV images

2.3 Run Estimations

Check the bvm_dir and gt_pose_filename in ./fresco-matlab/test4XXX.m.

XXX is the sequence name, can be KASIT03, KITTI08 or Oxford.

Make sure they look like this:

bvm_dir = '<keyframes_root_dir>/output_single_bev/csv/';
gt_pose_filename = '<keyframes_root_dir>/keyframe_pose.csv';

Remember to replace <keyframes_root_dir> with the actual directory path.

Now run this script. It first computes the descriptors (if not cached), then computes the similarities, and finally estimate the performance in different threshold settings.

After the computations are done, it visualizes the retrieval results on the trajectory and shows precision-recall curve.

3. Performance

Details can be found in the paper.

3.1 Precision-Recall

<img src="./results_preview/pr_on_KAIST03.svg" width=300 /> <br> <div align="center" style="center;font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">KAIST03</div> | <img src="./results_preview/pr_on_KITTI08.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">KITTI 08</div> | <img src="./results_preview/pr_on_Oxford_01-11-13-24.svg" width=300 /><br><div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">Oxford</div> ---|---|---

3.2 Visualizations

3.2.1 MulRan KAIST 03

<img src="./results_preview/viz_fresco_KAIST03.svg" width=300 /> <br> <div align="center" style="justify-content:center;font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:700">FreSCo</div> | <img src="./results_preview/viz_scancontext_KAIST03.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">Scan Context</div> | <img src="./results_preview/viz_lidariris_KAIST03.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR Iris</div> | <img src="./results_preview/viz_lidarorb_KAIST03.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR ORB</div> ---|---|---|---

3.2.2 KITTI 08

<img src="./results_preview/viz_fresco_KITTI08.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:700">FreSCo</div> | <img src="./results_preview/viz_scancontext_KITTI08.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">Scan Context</div> | <img src="./results_preview/viz_lidariris_KITTI08.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR Iris</div> | <img src="./results_preview/viz_lidarorb_KITTI08.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR ORB</div> ---|---|---|---

3.2.3 Oxford Robotcar Radar

<img src="./results_preview/viz_fresco_Oxford.svg" width=300 /><br><div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:700">FreSCo</div> | <img src="./results_preview/viz_scancontext_Oxford.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">Scan Context</div> | <img src="./results_preview/viz_lidariris_Oxford.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR Iris</div> | <img src="./results_preview/viz_lidarorb_Oxford.svg" width=300 /> <br> <div align="center" style="font-size:14px;color:#808080;font-family:'Helvetica Neue';font-weight:400">LiDAR ORB</div> ---|---|---|---

4. Citation

If you find this work useful to your research, please cite our paper (PDF):

@article{fan2022fresco,
  title={FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance},
  author={Fan, Yongzhi and Du, Xin and Luo, Lun and Shen, Jizhong},
  journal={arXiv preprint arXiv:2206.12628},
  year={2022}
}

5. TO-DOs

[ ] Re-implement the main framework in C++;
[ ] Try creating the keys using VLAD / NetVLAD;
[ ] Integrate the framework to popular LiDAR-Inertial SLAM projects.

Related Skills

node-connect

343.3k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

92.1k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

343.3k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

343.3k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。