PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Generate Convert Improve

Install / Use

/learn @PaddlePaddle/PaddleGAN

About this skill

Quality Score

0/100

README

English | 简体中文

PaddleGAN

PaddleGAN provides developers with high-performance implementation of classic and SOTA Generative Adversarial Networks, and supports developers to quickly build, train and deploy GANs for academic, entertainment and industrial usage.

GAN-Generative Adversarial Network, was praised by "the Father of Convolutional Networks" Yann LeCun (Yang Likun) as [One of the most interesting ideas in the field of computer science in the past decade]. It's the one research area in deep learning that AI researchers are most concerned about.

🎪 Hot Activities

2021.4.15~4.22

GAN 7 Days Course Camp: Baidu Senior Research Developers help you learn the basic and advanced GAN knowledge in 7 days!

Courses videos and related materials: https://aistudio.baidu.com/aistudio/course/introduce/16651

🚀 Recent Updates

👶 Young or Old？：StyleGAN V2 Face Editing-Time Machine！ 👨‍🦳
- Online Toturials
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/146649047-765ec085-0a2c-4c88-9527-744836448651.gif' width='200'/> </div>
🔥 Latest Release: PP-MSVSR 🔥
- Video Super Resolution SOTA models
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/144848981-00c6ad21-0702-4381-9544-becb227ed9f0.gif' width='300'/> </div>
😍 Boy or Girl？：StyleGAN V2 Face Editing-Changing genders！ 😍
- Online Toturials
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/141226707-58bd661e-2102-4fb7-8e18-c794a6b59ee8.gif' width='300'/> </div>
👩‍🚀 A Space Odyssey ：LapStyle image translation take you travel around the universe👨‍🚀
- Online Toturials
  <div align='center'> <img src='https://user-images.githubusercontent.com/48054808/133392621-9a552c46-841b-4fe4-bb24-7b0cbf86616c.gif' width='250'/> <img src='https://user-images.githubusercontent.com/48054808/133392630-c5329c4c-bc10-406e-a853-812a2b1f0fa6.gif' width='250'/> <img src='https://user-images.githubusercontent.com/48054808/133392652-f4811b1e-0676-4402-808b-a4c96c611368.gif' width='250'/> </div>
🧙‍♂️ Latest Creative Project：create magic/dynamic profile for your student ID in Hogwarts 🧙‍♀️
- Online Toturials
  <div align='center'> <img src='https://ai-studio-static-online.cdn.bcebos.com/da1c51844ac048aa8d4fa3151be95215eee75d8bb488409d92ec17285b227c2c' width='200'/> </div>
💞 Add Face Morphing function💞 : you can perfectly merge any two faces and make the new face get any facial expressions!
- Tutorials: https://aistudio.baidu.com/aistudio/projectdetail/2254031
  <div align='center'> <img src='https://user-images.githubusercontent.com/48054808/128299870-66a73bb3-57a4-4985-aadc-8ddeab048145.gif' width='200'/> </div>
Publish a new version of First Oder Motion model by having two impressive features:
- High resolution 512x512
- Face Enhancement
- Tutorials: https://github.com/PaddlePaddle/PaddleGAN/blob/develop/docs/zh_CN/tutorials/motion_driving.md
New image translation ability--transfer photo into oil painting style:
- Complete tutorials for deployment: https://github.com/wzmsltw/PaintTransformer
  <div align='center'> <img src='https://user-images.githubusercontent.com/48054808/129904830-8b87e310-ea51-4aff-b29b-88920ee82447.png' width='500'/> </div>

Document Tutorial

Installation

Environment dependence:
- PaddlePaddle >= 2.1.0
- Python >= 3.6
- CUDA >= 10.1
Full installation tutorial

Starter Tutorial

Model Tutorial

Pixel2Pixel
CycleGAN
LapStyle
PSGAN
First Order Motion Model
FaceParsing
AnimeGANv2
U-GAT-IT
Photo2Cartoon
Wav2Lip
Single Image Super Resolution(SISR)
- Including: RealSR, ESRGAN, LESRCNN, PAN, DRN
Video Super Resolution(VSR)
- Including: ⭐ PP-MSVSR ⭐, EDVR, BasicVSR, BasicVSR++
StyleGAN2
Pixel2Style2Pixel
StarGANv2
MPR Net
FaceEnhancement
PReNet
SwinIR
InvDN
AOT-GAN
NAFNet
GFPGan
GPEN

Composite Application

Video restore

Online Tutorial

You can run those projects in the AI Studio to learn how to use the models above:

|Online Tutorial | link | |--------------|-----------| |Motion Driving-multi-personal "Mai-ha-hi" | Click and Try | |Restore the video of Beijing hundreds years ago|Click and Try| |Motion Driving-When "Su Daqiang" sings "unravel" |Click and Try|

Examples

Face Morphing

Image Translation

Old video restore

Motion driving

Super resolution

Makeup shifter

Face cartoonization

Realistic face cartoonization

Photo animation

Lip-syncing

NEW try out the Lip-Syncing web demo on Huggingface Spaces using Gradio:

Changelog

v2.1.0 (2021.12.8)
- Release a video super-resolution model PP-MSVSR and multiple pre-training weights
- Release several SOTA video super-resolution models and their pre-trained models such as BasicVSR, IconVSR and BasicVSR++
- Release the light-weight motion-driven model(Volume compression: 229M->10.1M), and optimized the fusion effect
- Release high-resolution FOMM and Wav2Lip pre-trained models
- Release several interesting applications based on StyleGANv2, such as face inversion, face fusion and face editing
- Released Baidu’s self-developed and effective style transfer model LapStyle and its interesting applications, and launched the official website experience page
- Release a light-weight image super-resolution model PAN
v2.0.0 (2021.6.2)
- Release Fisrt Order Motion model and multiple

Related Skills

docs-writer

99.0k

`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie

model-usage

334.9k

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.

arscontexta

2.8k

Claude Code plugin that generates individualized knowledge systems from conversation. You describe how you think and work, have a conversation and get a complete second brain as markdown files you own.

mcp-documentation-server

300

MCP Documentation Server - Bridge the AI Knowledge Gap. ✨ Features: Document management • Gemini integration • AI-powered semantic search • File uploads • Smart chunking • Multilingual support • Zero-setup 🎯 Perfect for: New frameworks • API docs • Internal guides

PaddlePaddle

View profile

View on GitHub

GitHub Stars8.1k

CategoryContent

Updated7d ago

Forks1.2k

PaddlePaddle/PaddleGAN

Languages

Python

Security Score

100/100

Audited on Mar 18, 2026

No findings