SDFit

[ICCV 2025] SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image

Generate Convert Improve

Install / Use

/learn @anticdimi/SDFit

About this skill

Quality Score

0/100

README

<h1 align="center">SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image</h1> <a href="https://anticdimi.github.io/">Dimitrije Antić</a>1    <a href="https://gpaschalidis.github.io/">Georgios Paschalidis</a>1    <a href="https://sha2nkt.github.io/">Shashank Tripathi</a>2    <a href="https://staff.fnwi.uva.nl/th.gevers/">Theo Gevers</a>1    <a href="https://saidwivedi.in/">Sai Kumar Dwivedi</a>2    <a href="https://dtzionas.com/">Dimitrios Tzionas</a>1 1 University of Amsterdam, The Netherlands      2 Max Planck Institute for Intelligent Systems, Tübingen, Germany <h3 align="center">ICCV 2025</h3> <img src="assets/git/sdfit_teaser.png" alt="Project Teaser" width="600px" /> <a href="https://arxiv.org/abs/2409.16178"> <img src="https://img.shields.io/badge/arXiv-2409.16178-b31b1b.svg?style=for-the-badge" alt="Paper PDF" /> </a>    <a href="https://anticdimi.github.io/sdfit/"> <img src="https://img.shields.io/badge/Project-Page-blue?style=for-the-badge&logo=Google%20chrome&logoColor=white" alt="Project Page" /> </a> 🔗 <a href="#-setup">Setup</a> • <a href="#-running-sdfit">Run</a> • <a href="#-citation">Citation</a> • <a href="#-code-release-status">Status</a>

🔧 Setup

To clone the repository, run:

git clone --recurse-submodules https://github.com/anticdimi/sdfit.git

To use our method, you need to download pretrained Deep Implicit Templates checkpoints from here, and place them in the ./pretrained folder.

export CONDA_CMD=mamba # (or conda if you use conda)
$CONDA_CMD create -n sdfit_env python=3.10 -y
$CONDA_CMD activate sdfit_env

$CONDA_CMD install -c conda-forge gcc_linux-64=12.2.0 gxx_linux-64=12.2.0 ffmpeg x264 -y

pip install torch==2.3.0+cu121 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

wget https://anaconda.org/pytorch3d/pytorch3d/0.7.8/download/linux-64/pytorch3d-0.7.8-py310_cu121_pyt231.tar.bz2
$CONDA_CMD install pytorch3d-0.7.8-py310_cu121_pyt231.tar.bz2
rm -rf pytorch3d-0.7.8-py310_cu121_pyt231.tar.bz2

python assets/test_torch_p3d_install.py

pip install opencv-python tqdm pre-commit scikit-learn iopath omegaconf plyfile scikit-image tensorboardX h5py tensorboard trimesh matplotlib accelerate huggingface_hub==0.23.0 diffusers==0.21.4 xformers==0.0.26.post1 transformers==4.26.1 git+https://github.com/anticdimi/videoio.git git+https://github.com/NVlabs/nvdiffrast.git

📋 Code Release Status

We will release our code in two stages:

[x] Demo
[ ] Full Framework w/ detailed instructions about steps (est. December)

🚀 Running SDFit

For running the demo, download the demo example from here, and place it under ${YOUR_DEMO_PATH}.

🧪 Demo

After setting up your environment, and downloading the demo example, run:

python -m sdfit cfg/default.yaml data.input_path=${YOUR_DEMO_PATH}

📚 Citation

If you find this code useful for your research, please consider citing:

@inproceedings{antic2025sdfit,
    title     = {{SDFit}: {3D} Object Pose and Shape by Fitting a Morphable {SDF} to a Single Image},
    author    = {Anti\'{c}, Dimitrije and Paschalidis, Georgios and Tripathi, Shashank and Gevers, Theo and Dwivedi, Sai Kumar and Tzionas, Dimitrios},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
}

🙏 Acknowledgements

Our method builds upon prior open-source efforts. We thank the authors for releasing their code and models: Deep Implicit Templates, nvdiffrast, FlexiCubes, OpenShape, and Diff3F.

Related Skills

node-connect

351.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。