TransHuman
Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".
Install / Use
/learn @pansanity666/TransHumanREADME
News
05/08/2023The code and the pretrained model are released!23/10/2023We have collected recent advances in Human Avatars in this repo, welcome stars!
<br><br>
TransHuman
Official code for ICCV 2023 paper:
TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering <br> Xiao Pan<sup>1,2</sup>, Zongxin Yang<sup>1</sup>, Jianxin Ma<sup>2</sup>, Chang Zhou<sup>2</sup>, Yi Yang<sup>1</sup> <br> <sup>1</sup> ReLER Lab, CCAI, Zhejiang University; <sup>2</sup> Alibaba DAMO Academy
[Project Page | arXiv]
- We present a brand-new framework named TransHuman for generalizable neural human rendering, which learns the painted SMPL under the canonical space and captures the global relationships between human parts with transformers.
- We achieve SOTA performance on various settings and datasets.
- We also have better efficiency.
<br><br>
Environment
We test with:
- python==3.6.12
- pytorch==1.10.2
- cuda==11.3
# under TransHuman dir
# create conda environment
conda create -n transhuman python=3.6
conda activate transhuman
# make sure that the pytorch cuda is consistent with the system cuda
# install pytorch via conda
https://pytorch.org/get-started/locally/
# install requirements
pip install -r requirements.txt
# install pytorch3d (we build from source)
https://github.com/facebookresearch/pytorch3d/blob/main/INSTALL.md
<br><br>
Dataset Preparation
ZJU-MoCap
Please follow NHP to prepare the ZJU-MoCap dataset.
The final structure of data folder should be:
# under TransHuman dir
-data
-smplx
- smpl
- ...
-zju_mocap
- CoreView_313
- ...
-zju_rasterization
- CoreView_313
- ...
TODO: Code for more datasets are coming.
<br><br>
Training
sh ./scripts/train.sh
The checkpoints will be saved under ./data/trained_model/transhuman/$EXP_NAME .
<br><br>
Evaluation
sh ./scripts/test.sh $GPU_NUMBER $EPOCH_NUMBER $EXP_NAME
The config of different settings are provided in test.sh. Modify them as you need.
For reproducing the results in the paper, please download the official checkpoints from here.
Put it under ./data/trained_model/transhuman/official, and run:
sh ./scripts/test.sh 0 2100 official
<br><br>
Visualization
Free-viewpoint video
<p float="left" align="middle"> <img src="./docs/static/images/313_dyn_demo.gif" width="30%"/> <img src="./docs/static/images/390_dyn_demo.gif" width="30%"/> <img src="./docs/static/images/396_dyn_demo.gif" width="30%"/> </p>-
Render the free-viewpoint frames via running:
sh ./scripts/video.sh $GPU_NUMBER $EPOCH_NUMBER $EXP_NAMEThe rendered frames will be saved under
./data/perform/$EXP_NAME. -
Use
gen_freeview_video.pyfor getting the final video.
Mech reconstruction
<p float="left" align="middle"> <img src="./docs/static/images/mesh_390.gif" width="40%"/> <img src="./docs/static/images/mesh_313.gif" width="40%"/> </p>-
Extract the mesh via running:
sh ./script/mesh.sh $GPU_NUMBER $EPOCH_NUMBER $EXP_NAMEThe meshes will be saved under
./data/mesh/$EXP_NAME. -
Render the meshes using
render_mesh_dynamic.py. The rendered frames will also be saved under./data/mesh/$EXP_NAME -
Use
gen_freeview_video.pyfor getting the final video.
<br><br>
Citation
If you find our work useful, please kindly cite:
@InProceedings{Pan_2023_ICCV,
author = {Pan, Xiao and Yang, Zongxin and Ma, Jianxin and Zhou, Chang and Yang, Yi},
title = {TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2023},
pages = {3544-3555}
}
<br><br>
Contact
For questions, feel free to contact xiaopan@zju.edu.cn.
<br><br>
Acknowledgments
This project is mainly based on the code from NHP and humannerf. We also thank Sida Peng of Zhejiang University for helpful discussions on details of ZJU-MoCap dataset.
Related Skills
node-connect
343.3kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
92.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
343.3kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
343.3kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
