DragDiffusion

Unofficial implementation of DragDiffusion

Generate Convert Improve

Install / Use

/learn @Advocate99/DragDiffusion

About this skill

Quality Score

0/100

README

DragDiffusion

This is an unofficial code for DragDiffusion.

We show the DragDiffusion in a proof-of-concept way where we present the clean structured code of per-image optimization.

We hope the implementation of the principles helps.

The performances are not comparable with the paper's, and considering the performances, we do not include the GUI version yet.

Environment

conda env create -f environment.yml
conda activate diff

How-to

Put the image file in the ./finetune_data/ and finetune the SD-v1.5 with LoRA.

python dreambooth_lora.py --pretrained_model_name_or_path 'runwayml/stable-diffusion-v1-5' --instance_data_dir './finetune_data/' --instance_prompt 'xxy5syt00' --num_train_epochs 200 --checkpointing_steps 200 --output_dir 'lora-200'

Latent optimization.
```
python run_drag.py
```

Acknowledgement

Developed based on official version of DragGAN, unofficial version of DragGAN, and DIFT.

Related Skills

node-connect

349.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.5k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。