DiffuseMix
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
Install / Use
/learn @khawar-islam/DiffuseMixREADME
DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
<p align="center"> <img src="https://i.imgur.com/waxVImv.png" alt="DiffusMix"> </p> <p align="center"> <img src="images/diffuseMix_flower102.png" alt="DiffusMix Treasure"> </p>📢 Latest Updates
- Mar-11-24: DiffuseMix paper is released arxiv link. 🔥🔥
🚀 Getting Started
Setup anaconda environment using environment.yml file.
conda env create --name DiffuseMix --file=environment.yml
conda remove -n DiffuseMix --all # In case environment installation faileds
📝 List of Prompts
Below is the list of prompts, if your accuracy is low then you can use all prompts to increase the performance. Remember that each prompt takes a time to generate images, so the best way is to start from two prompts then increase the number of prompts.
prompts = ["Autumn", "snowy", "watercolor art","sunset", "rainbow", "aurora",
"mosaic", "ukiyo-e", "a sketch with crayon"]
📁 Dataset Structure
train
└─── class 1
└───── n04355338_22023.jpg
└─── class 2
└───── n03786901_5410.jpg
└─── ...
✨ DiffuseMix Augmentation
To introduce the structural complexity, you can download fractal image dataset from here Fractal Dataset
`python3 main.py --train_dir PATH --fractal_dir PATH --prompts "sunset,Autumn"
💬 Citation
If you find our work useful in your research please consider citing our paper:
@article{diffuseMix2024,
title={DIFFUSEMIX: Label-Preserving Data Augmentation with Diffusion Models},
author={Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik Nandakumar},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2024}
}
@article{islam2024genmix,
title={Genmix: effective data augmentation with generative diffusion model image editing},
author={Islam, Khawar and Zaheer, Muhammad Zaigham and Mahmood, Arif and Nandakumar, Karthik and Akhtar, Naveed},
journal={arXiv preprint arXiv:2412.02366},
year={2024}
}
@inproceedings{islam2025context,
title={Context-Guided Responsible Data Augmentation with Diffusion Models},
author={Islam, Khawar and AKHTAR, NAVEED},
booktitle={ICLR 2025 Workshop on Navigating and Addressing Data Problems for Foundation Models}
}
❤️ Acknowledgment
I am grateful to Adversarial-AutoMixup (@JinXins) for providing the source and target images, which significantly saved me a lot of time. Thank you once again. I am also exceptionally thankful to the author of IPMix, (@hzlsaber), for presenting their method's figures clearly, which greatly aided my paper. Additionally, their timely responses to my concerns saved me considerable time. Lastly, my thanks again go to the author of GuidedMixup, (@3neutronstar), for their insights on datasets and method outputs.
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
16.5kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
