DeepHarmonization
Demo code of the paper: "Deep Image Harmonization", Y.-H. Tsai, X. Shen, Z. Lin, K. Sunkavalli, X. Lu and M.-H. Yang, CVPR 2017
Install / Use
/learn @wasidennis/DeepHarmonizationREADME
Deep Image Harmonization

Project webpage: https://sites.google.com/site/yihsuantsai/research/cvpr17-harmonization <br /> Contact: Yi-Hsuan Tsai (wasidennis at gmail dot com)
Paper
Deep Image Harmonization <br /> Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu and Ming-Hsuan Yang <br /> IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
This is the authors' demo code described in the above paper. Please cite our paper if you find it useful for your research.
- One re-implementation of our dataset and model: https://github.com/bcmi/Image_Harmonization_Datasets
Installation and Usage
-
Download and unzip the code.
-
Install Caffe: http://caffe.berkeleyvision.org/.
-
Download the pre-trained caffe model and move it under the model folder.
-
Run
demo.pyon real composite images (including our test set collected in the paper).
Evaluation Set
- Download our complete set of real composite images, including our harmonization results here.
Note
The model, code and dataset are available for non-commercial research purposes only.
Log
- 03/2017: demo code released
- 05/2017: complete evaluation set released
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
400Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
19.1kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
