SAFM

[CVPR 2022] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis

Generate Convert Improve

Install / Use

/learn @cszy98/SAFM

About this skill

Quality Score

0/100

README

Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis

Introduction

The source code for our paper "Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis" (CVPR 2022)

fullcmpv1

Our Framework

overview

Quick Start

Installation

git clone https://github.com/cszy98/SAFM.git
cd SAFM
pip install -r requirements.txt
cd models/counter
python setup.py install

Data Preparation

Follow the dataset preparation process in SPADE. Besides, we get the instance maps of ADE20K from instancesegmentation.

Testing and Evaluate

The pretrained models can be downloaded from GoogleDrive.

python test.py --name [experiment_name] --dataset_mode [dataset] --gpu_ids 0 --batchSize 2 --dataroot [path to dataroot] --which_epoch best --instance_root [path to instance maps]

Training

python train.py --name [experiment_name] --dataset_mode [dataset] --batchSize 4 --dataroot [path to dataroot] --instance_root [path to instance maps] --save_epoch_freq 5 --niter 100 --niter_decay 100

Acknowledgments

This code borrows heavily from SPADE.

Citation

If you find our work useful in your research or publication, please cite:

@article{lv2022semantic,
  title={Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis},
  author={Lv, Zhengyao and Li, Xiaoming and Niu, Zhenxing and Cao, Bing and Zuo, Wangmeng},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
  year = {2022}
}

Contact

Please send email to cszy98@gmail.com

Related Skills

node-connect

347.9k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

108.7k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

347.9k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

347.9k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。