Mask2Hand
PyTorch Implementation of "Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow"
Install / Use
/learn @lijenchang/Mask2HandREADME
Mask2Hand
PyTorch implementation of "Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow",
Li-Jen Chang, Yu-Cheng Liao, Chia-Hui Lin, Shih-Fang Yang-Mao, and Hwann-Tzong Chen
APSIPA ASC 2023
[arXiv] [Paper]
Environment Setup
- Create the environment from the provided
environment.ymlfile.cd Mask2Hand conda env create -f environment.yml conda activate pytorch3d - Download the pretrained model from Dropbox link and put it in the directory
checkpointusing the following commands.mkdir -p ./checkpoint wget -O ./checkpoint/model_pretrained.pth https://www.dropbox.com/s/mujjj8ov5e8r9ok/model_pretrained.pth?dl=1 - Download FreiHAND Dataset v2 from the official website and unzip it into the directory
dataset/freihand/.mkdir -p ./dataset/freihand cd dataset wget https://lmb.informatik.uni-freiburg.de/data/freihand/FreiHAND_pub_v2.zip unzip -q ./FreiHAND_pub_v2.zip -d ./freihand cd .. - Download the file of dataset splits from this link and put it into
dataset/freihand/.
Run a Demo
- If you want to use GPU, run
CUDA_VISIBLE_DEVICES=0 python demo.py - Otherwise, run
python demo.py - The demo results will be saved in the directory
demo_output.
Evaluation
- Calculate the error of the predicted hand joints and meshes
CUDA_VISIBLE_DEVICES=0 python test.py - Calculate the mIoU between the ground-truth and projected silhouettes
CUDA_VISIBLE_DEVICES=0 python test_iou.py
Training
Run the following script to train a model from scratch.
CUDA_VISIBLE_DEVICES=0 python train.py
Citation
@article{chang2022mask2hand,
author={Li-Jen Chang and Yu-Cheng Liao and Chia-Hui Lin and Hwann-Tzong Chen},
title={Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow},
journal={CoRR},
volume={abs/2205.15553},
year={2022}
}
Acknowledgement
The PyTorch implementation of MANO comes from GrabNet and some visualization utilities are modified from CMR.
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
16.5kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
