PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Install / Use
/learn @PaddlePaddle/PaddleGANREADME
English | 简体中文
PaddleGAN
PaddleGAN provides developers with high-performance implementation of classic and SOTA Generative Adversarial Networks, and supports developers to quickly build, train and deploy GANs for academic, entertainment and industrial usage.
GAN-Generative Adversarial Network, was praised by "the Father of Convolutional Networks" Yann LeCun (Yang Likun) as [One of the most interesting ideas in the field of computer science in the past decade]. It's the one research area in deep learning that AI researchers are most concerned about.
<div align='center'> <img src='./docs/imgs/ppgan.jpg'> </div>🎪 Hot Activities
-
2021.4.15~4.22
GAN 7 Days Course Camp: Baidu Senior Research Developers help you learn the basic and advanced GAN knowledge in 7 days!
Courses videos and related materials: https://aistudio.baidu.com/aistudio/course/introduce/16651
🚀 Recent Updates
-
👶 Young or Old?:StyleGAN V2 Face Editing-Time Machine! 👨🦳
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/146649047-765ec085-0a2c-4c88-9527-744836448651.gif' width='200'/> </div> -
🔥 Latest Release: PP-MSVSR 🔥
- Video Super Resolution SOTA models
-
😍 Boy or Girl?:StyleGAN V2 Face Editing-Changing genders! 😍
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/141226707-58bd661e-2102-4fb7-8e18-c794a6b59ee8.gif' width='300'/> </div> -
👩🚀 A Space Odyssey :LapStyle image translation take you travel around the universe👨🚀
- <div align='center'> <img src='https://user-images.githubusercontent.com/48054808/133392621-9a552c46-841b-4fe4-bb24-7b0cbf86616c.gif' width='250'/> <img src='https://user-images.githubusercontent.com/48054808/133392630-c5329c4c-bc10-406e-a853-812a2b1f0fa6.gif' width='250'/> <img src='https://user-images.githubusercontent.com/48054808/133392652-f4811b1e-0676-4402-808b-a4c96c611368.gif' width='250'/> </div>
-
🧙♂️ Latest Creative Project:create magic/dynamic profile for your student ID in Hogwarts 🧙♀️
- <div align='center'> <img src='https://ai-studio-static-online.cdn.bcebos.com/da1c51844ac048aa8d4fa3151be95215eee75d8bb488409d92ec17285b227c2c' width='200'/> </div>
-
💞 Add Face Morphing function💞 : you can perfectly merge any two faces and make the new face get any facial expressions!
-
Tutorials: https://aistudio.baidu.com/aistudio/projectdetail/2254031
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/128299870-66a73bb3-57a4-4985-aadc-8ddeab048145.gif' width='200'/> </div>
-
-
Publish a new version of First Oder Motion model by having two impressive features:
- High resolution 512x512
- Face Enhancement
- Tutorials: https://github.com/PaddlePaddle/PaddleGAN/blob/develop/docs/zh_CN/tutorials/motion_driving.md
-
New image translation ability--transfer photo into oil painting style:
-
Complete tutorials for deployment: https://github.com/wzmsltw/PaintTransformer
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/129904830-8b87e310-ea51-4aff-b29b-88920ee82447.png' width='500'/> </div>
-
Document Tutorial
Installation
- Environment dependence:
- PaddlePaddle >= 2.1.0
- Python >= 3.6
- CUDA >= 10.1
- Full installation tutorial
Starter Tutorial
Model Tutorial
- Pixel2Pixel
- CycleGAN
- LapStyle
- PSGAN
- First Order Motion Model
- FaceParsing
- AnimeGANv2
- U-GAT-IT
- Photo2Cartoon
- Wav2Lip
- Single Image Super Resolution(SISR)
- Including: RealSR, ESRGAN, LESRCNN, PAN, DRN
- Video Super Resolution(VSR)
- Including: ⭐ PP-MSVSR ⭐, EDVR, BasicVSR, BasicVSR++
- StyleGAN2
- Pixel2Style2Pixel
- StarGANv2
- MPR Net
- FaceEnhancement
- PReNet
- SwinIR
- InvDN
- AOT-GAN
- NAFNet
- GFPGan
- GPEN
Composite Application
Online Tutorial
You can run those projects in the AI Studio to learn how to use the models above:
|Online Tutorial | link | |--------------|-----------| |Motion Driving-multi-personal "Mai-ha-hi" | Click and Try | |Restore the video of Beijing hundreds years ago|Click and Try| |Motion Driving-When "Su Daqiang" sings "unravel" |Click and Try|
Examples
Face Morphing
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/129020371-75de20d1-705b-44b1-8254-e09710124244.gif'width='700' /> </div>Image Translation
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119464966-d5c1c000-bd75-11eb-9696-9bb75357229f.gif'width='700' height='200'/> </div>Old video restore
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119469496-fc81f580-bd79-11eb-865a-5e38482b1ae8.gif' width='700'/> </div>Motion driving
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119469551-0a377b00-bd7a-11eb-9117-e4871c8fb9c0.gif' width='700'> </div>Super resolution
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119469753-3e12a080-bd7a-11eb-9cde-4fa01b3201ab.png'width='700' height='250'/> </div>Makeup shifter
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119469834-4ff44380-bd7a-11eb-93b6-05b705dcfbf2.png'width='700' height='250'/> </div>Face cartoonization
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119469952-6bf7e500-bd7a-11eb-89ad-9a78b10bd4ab.png'width='700' height='250'/> </div>Realistic face cartoonization
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119470028-7f0ab500-bd7a-11eb-88e9-78a6b9e2e319.png'width='700' height='250'/> </div>Photo animation
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119470099-9184ee80-bd7a-11eb-8b12-c9400fe01266.png'width='700' height='250'/> </div>Lip-syncing
<div align='center'> <img src='https://user-images.githubusercontent.com/48054808/119470166-a6618200-bd7a-11eb-9f98-58052ce21b14.gif'width='700'> </div>NEW try out the Lip-Syncing web demo on Huggingface Spaces using Gradio:
Changelog
-
v2.1.0 (2021.12.8)
- Release a video super-resolution model PP-MSVSR and multiple pre-training weights
- Release several SOTA video super-resolution models and their pre-trained models such as BasicVSR, IconVSR and BasicVSR++
- Release the light-weight motion-driven model(Volume compression: 229M->10.1M), and optimized the fusion effect
- Release high-resolution FOMM and Wav2Lip pre-trained models
- Release several interesting applications based on StyleGANv2, such as face inversion, face fusion and face editing
- Released Baidu’s self-developed and effective style transfer model LapStyle and its interesting applications, and launched the official website experience page
- Release a light-weight image super-resolution model PAN
-
v2.0.0 (2021.6.2)
- Release Fisrt Order Motion model and multiple
Related Skills
docs-writer
99.0k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
334.9kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
arscontexta
2.8kClaude Code plugin that generates individualized knowledge systems from conversation. You describe how you think and work, have a conversation and get a complete second brain as markdown files you own.
mcp-documentation-server
300MCP Documentation Server - Bridge the AI Knowledge Gap. ✨ Features: Document management • Gemini integration • AI-powered semantic search • File uploads • Smart chunking • Multilingual support • Zero-setup 🎯 Perfect for: New frameworks • API docs • Internal guides
