SGP
[CVPR 2021 Oral] Self-supervised Geometric Perception
Install / Use
/learn @theNded/SGPREADME
SGP: Self-supervised Geometric Perception
[CVPR 2021 Oral] Self-supervised Geometric Perception https://arxiv.org/abs/2103.03114
Introduction
In short, SGP is, to the best of our knowledge, the first general framework for feature learning in geometric perception without any supervision from ground-truth geometric labels.
SGP runs in an EM fashion. It iteratively performs robust estimation of the geometric models to generate pseudo-labels, and feature learning under the supervision of the noisy pseudo-labels.
<img src="assets/overview.png" alt="overview" width="600"/>We applied SGP to camera pose estimation and point cloud registration, demonstrating performance that is on par or even superior to supervised oracles in large-scale real datasets.
Camera pose estimation
Deep image features like CAPS can be trained with relative pose labels generated by 5pt-RANSAC, bootstraped with the handcrafted SIFT feature. They can be later used in robust relative camera pose estimation.
<div float="left"> <img src="assets/caps-megadepth.png" width="350" /> <img src="assets/caps-scannet.png" width="350" /> </div>Point cloud registration
Deep 3D features like FCGF can be trained with relative pose labels generated by 3pt-RANSAC, bootstraped by the handcrafted FPFH feature. They can be later used in robust point cloud registration.
<div float="left"> <img src="assets/fpfh-3dmatch.png" width="350" /> <img src="assets/fcgf-3dmatch.png" width="350" /> </div>Code
Please see code/ for detailed intructions about how to use the code base.
Citation
@inproceedings{yang2021sgp,
title={Self-supervised Geometric Perception},
author={Yang, Heng and Dong, Wei and Carlone, Luca and Koltun, Vladlen},
booktitle={CVPR},
year={2021}
}
Related Skills
node-connect
339.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.9kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
339.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.9kCommit, push, and open a PR
