STAG

The official repository for Staged Contact-Aware Global Human Motion Forecasting [BMVC2023 oral].

Generate Convert Improve

Install / Use

/learn @L-Scofano/STAG

About this skill

Quality Score

0/100

README

<div align="center"> <h1>Staged Contact-Aware Global Human Motion Forecasting (<b>BMVC2023 oral</b>)</h1> <h3> <i>L. Scofano¹, A. Sampieri¹, E. Schiele², E. De Matteis¹</i></h3> <h3> <i>L. Leal-Taixé², F. Galasso¹ </i></h3> <h4> <i>Sapienza University of Rome¹, Technical University of Munich² </i></h4>

Alt Text

Overview

We propose a STAGed contact-aware global human motion forecasting STAG, a novel three-stage pipeline for predicting global human motion in a 3D environment. We first consider the scene and the respective human interaction as contact points. Secondly, we model the human trajectory forecasting within the scene, predicting the coarse motion of the human body as a whole. The third and last stage matches a plausible fine human joint motion to complement the trajectory considering the estimated contacts.

Alt Text

</div>

Setup

Run the following code to create an environment and install the required packages.

conda create -n stag
conda activate stag

pip install -r requirements.txt

Datasets

GTA-IM

To obtain the original dataset please contact the authors of Long-term Human Motion Prediction with Scene Context.

Data pre-process. After downloading the original GTA-IM Dataset (the FPS-30 one), unzip every file and put it in the data foloder. Run the code below to process the dataset.

python datasets/process_gta_dataset.py

The file structure of data folder should look like below.

data
    ├── GTA-IM-Dataset
    │   ├── 2020-06-03-13-31-46
    │   │   ├── 0000.jpg
    │   │   ├── 0000.png
    │   │   ├── 0000_id.png
    │   │    ......          
    │   ├── 2020-06-04-23-14-02
    │    ......
    │   
    ├── data_v2_downsample0.02
    │   ├── 2020-06-04-23-14-02_r013_sf0.npz
    │   ├── 2020-06-04-23-08-16_r013_sf0.npz
    │   └── 2020-06-04-23-08-16_r013_sf1.npz
    └──

Usage

To train each stage of the pipeline, run the following code:

# Stage 1 -> contact points
python s1_cont_est_train.py --cfg gta_stage1_PVCNN2_DCT_CONT_GCN --is_amp --gpus 1

# Stage 2 -> Trajectory Forecasting
python s2_traj_train.py --cfg gta_stage2_TRAJ_GCN --is_amp --gpus 1

# Stage 3 -> Pose Forecasting
python s3_pose_train.py --cfg gta_stage3_GCN_POSE --is_amp --gpus 1

To test STAG run the following code:

# Using GT contact points
python s3_pose_test.py --cfg_cont gta_stage1_PVCNN2_DCT_CONT_GCN --cfg gta_stage3_GCN_POSE --iter 50 --iter_cont 50

# Using estimated contact points
python s3_pose_test.py --cfg_cont gta_stage1_PVCNN2_DCT_CONT --cfg gta_stage3_GCN_POSE --iter 50 --iter_cont 50 --w_est_cont

Notes

[ ] Include the code for the CU-Mocap dataset soon.
[ ] Add the checkpoints for the models soon.

Reference

@article{Scofano2023StagedCG,
  title={Staged Contact-Aware Global Human Motion Forecasting},
  author={Luca Scofano and Alessio Sampieri and Elisabeth Schiele and Edoardo De Matteis and Laura Leal-Taix'e and Fabio Galasso},
  journal={ArXiv},
  year={2023},
  volume={abs/2309.08947},
  url={https://api.semanticscholar.org/CorpusID:262043278}
}

Acknowledgements

Our code is based on the following repositories:

Related Skills

node-connect

351.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

351.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

351.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。