SAFM
[CVPR 2022] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
Install / Use
/learn @cszy98/SAFMREADME
Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
Introduction
The source code for our paper "Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis" (CVPR 2022)

Our Framework

Quick Start
Installation
git clone https://github.com/cszy98/SAFM.git
cd SAFM
pip install -r requirements.txt
cd models/counter
python setup.py install
Data Preparation
Follow the dataset preparation process in SPADE. Besides, we get the instance maps of ADE20K from instancesegmentation.
Testing and Evaluate
The pretrained models can be downloaded from GoogleDrive.
python test.py --name [experiment_name] --dataset_mode [dataset] --gpu_ids 0 --batchSize 2 --dataroot [path to dataroot] --which_epoch best --instance_root [path to instance maps]
Training
python train.py --name [experiment_name] --dataset_mode [dataset] --batchSize 4 --dataroot [path to dataroot] --instance_root [path to instance maps] --save_epoch_freq 5 --niter 100 --niter_decay 100
Acknowledgments
This code borrows heavily from SPADE.
Citation
If you find our work useful in your research or publication, please cite:
@article{lv2022semantic,
title={Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis},
author={Lv, Zhengyao and Li, Xiaoming and Niu, Zhenxing and Cao, Bing and Zuo, Wangmeng},
booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
year = {2022}
}
Contact
Please send email to cszy98@gmail.com
Related Skills
node-connect
347.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
108.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
347.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
347.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
