SkillAgentSearch skills...

Mojito

Official repository of paper "Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens".

Install / Use

/learn @koyui/Mojito
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

<h1 align="center">Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens</h1> <p align="center"> <a href="https://cunkaixin.netlify.app" target="_blank">Ziwei Shan</a><sup>1,*</sup>, <a href="https://tropinoneh.github.io/profile/" target="_blank">Yaoyu He</a><sup>1,*</sup>, <a href="https://afterjourney00.github.io/" target="_blank">Chengfeng Zhao</a><sup>1,*,&dagger;</sup>, <a href="https://alt-js.github.io/" target="_blank">Jiashen Du</a><sup>1</sup>, <br> <a href="https://zhanglele12138.github.io/" target="_blank">Jingyan Zhang</a><sup>1</sup>, <a href="https://scholar.google.com/citations?user=YvwsqvYAAAAJ&hl=en" target="_blank">Qixuan Zhang</a><sup>1,2</sup>, <a href="https://scholar.google.com/citations?user=R9L_AfQAAAAJ&hl=en" target="_blank">Jingyi Yu</a><sup>1,&Dagger;</sup>, <a href="https://www.xu-lan.com/" target="_blank">Lan Xu</a><sup>1,&Dagger;</sup> </p> <p align="center"> <sup>1</sup>ShanghaiTech University&nbsp;&nbsp; <sup>2</sup>Deemos Technology <br> <i><sup>*</sup>Equal contribution</i> <br> <i><sup>&dagger;</sup>Project lead</i><i> &nbsp;&nbsp; <sup>&Dagger;</sup>Corresponding author</i> </p> <p align="center"> <a href="https://arxiv.org/abs/"><img src='https://img.shields.io/badge/arXiv-Paper-red?logo=arxiv&logoColor=white' alt='arXiv'></a> <a href='https://koyui.github.io/mojito/'><img src='https://img.shields.io/badge/Project_Page-Website-green?logo=googlechrome&logoColor=white' alt='Project Page'></a> </p> <div align="center"> <img width="900px" src="./assets/teaser.png"/> </div>

🚀 Getting Started

1. Environment Setup

We tested our environment on Ubuntu 20.04 LTS and Windows 11 with CUDA 12.1.

conda create python=3.10 --name mojito
conda activate mojito

conda install pytorch==2.5.0 torchvision==0.20.0 torchaudio==2.5.0 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements.txt

# ignore deepspeed installation if using Win 11
DS_BUILD_OPS=1 DS_BUILD_CUTLASS_OPS=0 DS_BUILD_RAGGED_DEVICE_OPS=0 DS_BUILD_EVOFORMER_ATTN=0 pip install deepspeed

conda install -c fvcore -c iopath -c conda-forge fvcore iopath
pip install "git+https://github.com/facebookresearch/pytorch3d.git@stable"

pip install "fastapi[standard]"

2. Prepare Body Model and Weights

Download SMPL-H (the extended SMPL+H model) and put the models under body_model/ folder. The structure of body_model/ folder should be:

body_model/
|--body_model.py
|--utils.py
|--smplh/
|----info.txt
|----LICENSE.txt
|----female/
|------model.npz
|----male/
|------model.npz
|----neutral/
|------model.npz

3. Download pretrained imu tokenizer model

We are releasing the IMU tokenizer model mojito_imu_tokenizer.pth. To set up:

  1. Download the model checkpoint.
  2. Create a checkpoints/ directory in your project if it doesn't exist.
  3. Place the downloaded file in checkpoints/mojito_imu_tokenizer.pth.

4. Example

Run the processing script

python -m example --cfg configs/config_imu_tokenizer.yaml --nodebug

🏄‍♂️ Contributors

<!-- ## 📖 Citation -->

📖 Citation

If you find our code or paper helps, please consider citing:

@article{shan2025mojito,
  title   = {Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens},
  author  = {Shan, Ziwei and He, Yaoyu and Du, Jiashen and Zhao, Chengfeng and Zhang, Jingyan and 
             Zhang, Qixuan and Yu, Jingyi and Xu, Lan},
  journal = {arXiv preprint arXiv:},
  year    = {2025}
}

Acknowledgments

Thanks to the following work that we refer to and benefit from:

  • MotionGPT: the overall framework;
  • Qwen2: the causal language model;
  • EgoEgo: the SMPL-H body model script;
  • TransPose: the data pre-processing of TotalCapture dataset;
  • SmoothNet: SMPL pose smoother

Licenses

<a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc-sa/4.0/80x15.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc-sa/4.0/">Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License</a>.

View on GitHub
GitHub Stars21
CategoryDevelopment
Updated19d ago
Forks0

Languages

Python

Security Score

75/100

Audited on Mar 12, 2026

No findings