AdaAT
Project of "Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation"
Install / Use
/learn @MRzzm/AdaATREADME
Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation. (accepted in MM2022)

Paper Demo video Supplementary materials
Face Reenactment & Talking Face
The 3DMM model can not be released, we will replace it with FLAME in the future.
Face visually dubbing
The source code is in here.
Person image generation
Download resources (pretrained model etc.)
Download resources in Google drive, including:
- person_epoch_30.pth: Pretrained model on deep fashion dataset stopped in 30 epoch.
- person_epoch_40.pth: Pretrained model on deep fashion dataset stopped in 40 epoch (has better performance than "person_epoch_30.pth").
- test_image_person_deepFashion_30epoch.zip: Inference images of "person_epoch_30.pth" on deep fashion test data for convenient comparisons.
- test_image_person_deepFashion_40epoch.zip: Inference images of "person_epoch_40.pth" on deep fashion test data for convenient comparisons.
- example_person_source_img.jpg: Source example image for person image generation.
- example_person_souce_kp.txt: Source example key points for person image generation.
- example_person_target_kp.txt: Target example key points for person image generation.
- example_person_inference_img.jpg: Inference example image for person image generation.
- fasion_train_data.json: Training json file of deep fashion dataset.\
Train on deep fashion dataset
- Download deep fashion dataset from here. We use the dataset as same as in SelectionGAN.
- Unzip the dataset.
- run
python train_person_image.py --train_data=./assert/fasion_train_data.json --train_img_dir=./deepFashion/fashion_data/train
Inference
To inference one person image from one source person image, source key points and target key points, run
python inference_person_image.py --inference_model_path=./assert/person_epoch_30.pth --source_img_path=./assert/example_person_source_img.jpg --source_kp_path=./assert/example_person_souce_kp.txt --target_kp_path=./assert/example_person_target_kp.txt --res_person_path=./assert/example_person_inference_img.jpg
Compute metrics
To compute the metrics of SSIM and LIPIS on deep fashion test data, run
python compute_metrics.py --inference_img_dir --real_img_dir=./deepFashion/fashion_data/test --task_type=person
Citation
If you use AdaAT operator in your work, please cite
@inproceedings{zhang2022adaptive,
title={Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation},
author={Zhang, Zhimeng and Ding, Yu},
booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
pages={1167--1176},
year={2022}
}
Acknowledgement
The basic modules are borrowed from first-order-model, thanks for their contributions.
Related Skills
qqbot-channel
347.0kQQ 频道管理技能。查询频道列表、子频道、成员、发帖、公告、日程等操作。使用 qqbot_channel_api 工具代理 QQ 开放平台 HTTP 接口,自动处理 Token 鉴权。当用户需要查看频道、管理子频道、查询成员、发布帖子/公告/日程时使用。
docs-writer
100.1k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
347.0kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Design
Campus Second-Hand Trading Platform \- General Design Document (v5.0 \- React Architecture \- Complete Final Version)1\. System Overall Design 1.1. Project Overview This project aims t
