DragDiffusion
Unofficial implementation of DragDiffusion
Install / Use
/learn @Advocate99/DragDiffusionREADME
DragDiffusion
This is an unofficial code for DragDiffusion.
We show the DragDiffusion in a proof-of-concept way where we present the clean structured code of per-image optimization.
We hope the implementation of the principles helps.
The performances are not comparable with the paper's, and considering the performances, we do not include the GUI version yet.
<img src="assets/demo_case.jpg" width="500" alt="Demo case of Our Implementation"/>Environment
conda env create -f environment.yml
conda activate diff
How-to
-
Put the image file in the
./finetune_data/and finetune the SD-v1.5 with LoRA.python dreambooth_lora.py --pretrained_model_name_or_path 'runwayml/stable-diffusion-v1-5' --instance_data_dir './finetune_data/' --instance_prompt 'xxy5syt00' --num_train_epochs 200 --checkpointing_steps 200 --output_dir 'lora-200' -
Latent optimization.
python run_drag.py
Acknowledgement
- Developed based on official version of DragGAN, unofficial version of DragGAN, and DIFT.
Related Skills
node-connect
349.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.5kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
