<h2 align="center">Multi-View Azimuth Stereo via Tangent Space Consistency</h2> <h4 align="center"> <a href="https://xucao-42.github.io/homepage/"><strong>Xu Cao</strong></a> · <a href="https://sites.google.com/view/hiroaki-santo/"><strong>Hiroaki Santo</strong></a> · <a href="http://cvl.ist.osaka-u.ac.jp/user/okura/"><strong>Fumio Okura</strong></a> · <a href="http://www-infobiz.ist.osaka-u.ac.jp/en/member/matsushita/"><strong>Yasuyuki Matsushita</strong></a> </h3> <h4 align="center"><a href="https://cvpr2023.thecvf.com">CVPR 2023 </a></h3> <p align="center"> <br> <a href="https://arxiv.org/abs/2303.16447"> <img src='https://img.shields.io/badge/arXiv-Paper-981E32?style=for-the-badge&Color=B31B1B' alt='arXiv PDF'> </a> <a href='https://xucao-42.github.io/mvas_homepage/'> <img src='https://img.shields.io/badge/MVAS-Project Page-5468FF?style=for-the-badge' alt='Project Page'></a> </p> <div align="center"> <img src="./media/teaser.png" alt="Teaser" width="100%"> 3D reconstruction only using surface azimuth maps. </div>

Quick Start

Our code was tested on Ubuntu18.04 with Python 3.9, PyTorch 1.12, and Cuda 11.3. Follow these steps to reproduce our environment and results.

git clone https://github.com/xucao-42/mvas.git
cd mvas

conda env create -f environment.yml 
conda activate mvas

mkdir ./data
cd ./code

<details><summary>Train on DiLiGenT-MV data (280 MB)</summary>

Download data from Google Drive and extract it under data folder. Run

python exp_runner.py --config configs/diligent_mv.conf

</details> <details><summary>Train on SymPS data (5.7 GB)</summary>

Download data from Google Drive and extract it under data folder.

Run


python exp_runner.py --config configs/symps_gargoyle.conf
# or you can try symps_house.conf and symps_moai.conf

</details> <details><summary>Train on PANDORA data (2.7 GB)</summary>

Download data from Google Drive and extract it under data folder. Run

python exp_runner.py --config configs/pandora.conf

</details>

Results will be saved in results/$obj_name/$exp_time.

Data structure

<details><summary> DiLiGenT-MV </summary>

input_azimuth_maps: These are 16-bit RGBA images where the alpha channel represents the object mask and the RGB channels are identical. Each RGB channel can be converted to azimuth angles within [0, pi] by multiplying it by pi/65535. The azimuth angle is measured clockwise from the x-axis, which points to the right, and is consistent with OpenCV convention (x-axis to the right, y-axis downward). The azimuth maps do not need to be stored in the range [0, 2π], as our method is π-invariant.
vis_azimuth_maps: These are for visualization purposes only and are not used during training.
normal_maps: These are the normal maps used to create the input azimuth maps. We applied SDPS-Net independently in each view to obtain the normal maps.
params.json: This file is from PS-NeRF preprocessing and contains the camera intrinsic parameters, as the normal and azimuth maps are cropped to 400 x 400.
Calib_Results.mat: This file is from the original DiLiGenT-MV dataset and provides the camera extrinsic information.

</details> <details><summary> SymPS </summary>

input_azimuth_maps: These are 16-bit gray-scale images. The pixel values can be converted to azimuth angles within [0, pi] by multiplying them by pi/65535. The azimuth angle is measured clockwise from the x-axis, which points to the right, and is consistent with OpenCV convention (x-axis to the right, y-axis downward). The azimuth maps do not need to be stored in the range [0, 2π], as our method is π-invariant.
mask: Binary masks indicating the object silhouettes.
sparse: This folder contains Colmap-calibrated camera intrinsic and extrinsic information.
images_SfM: These are images used for structure from motion in Colmap.

</details> <details><summary> PANDORA </summary>

input_azimuth_maps: These are 16-bit RGBA images where the alpha channel represents the object mask and the RGB channels are identical. Each RGB channel can be converted to azimuth angles within [0, pi] by multiplying it by pi/65535. The azimuth angle is measured clockwise from the x-axis, which points to the right, and is consistent with OpenCV convention (x-axis to the right, y-axis downward). The azimuth maps do not need to be stored in the range [0, 2π], as our method is π-invariant. Note that since PANDORA is a polarization image dataset, the azimuth maps have half-pi ambiguity.
vis_azimuth_maps: These are for visualization purposes only and are not used during training.
sparse: This folder is from PANDORA and contains Colmap-calibrated camera intrinsic and extrinsic information.
images: These are for reference purpose and not used in training.

</details>

Acknowledgement

Our implementation is built upon IDR , and benefits from PS-NeRF and PANDORA.

Bibtex

@inproceedings{mvas2023cao,
title = {Multi-View Azimuth Stereo via Tangent Space Consistency},
author = {Cao, Xu and Santo, Hiroaki and Okura, Fumio and Matsushita, Yasuyuki},
year = {2023},
booktitle = CVPR,
}

Mvas

Install / Use

README

Quick Start

Data structure

Acknowledgement

Bibtex