<h1 align="center">Bridging Composite and Real: Towards End-to-end Deep Image Matting [IJCV-2022]</h1> <a href="https://arxiv.org/pdf/2010.16188.pdf"><img src="demo/src/icon/arXiv-Paper.svg" ></a> <a href="https://link.springer.com/article/10.1007/s11263-021-01541-0"><img src="demo/src/icon/publication-Paper.svg" ></a> <a href="https://colab.research.google.com/drive/1EaQ5h4u9Q_MmDSFTDmFG0ZOeSsFuRTsJ?usp=sharing"><img src="demo/src/icon/colab-badge.svg"></a> <a href="https://opensource.org/licenses/MIT"><img src="demo/src/icon/license-MIT.svg"></a> <a href="https://paperswithcode.com/sota/image-matting-on-am-2k"><img src="https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/end-to-end-animal-image-matting/image-matting-on-am-2k"></a> <a href="https://www.youtube.com/watch?v=FJPm4YQOEyo"><img src="demo/src/icon/youtube-demo.svg"></a> <a href="https://www.bilibili.com/video/BV1X34y1o7nK"><img src="demo/src/icon/bilibili-demo.svg"></a>  <h4 align="center">This is the official repository of the paper <a href="https://arxiv.org/abs/2010.16188">Bridging Composite and Real: Towards End-to-end Deep Image Matting</a>.</h4> <h5 align="center">Jizhizi Li1∗, Jing Zhang1∗, Stephen J. Maybank2, and Dacheng Tao1</h5> <h6 align="center">1 The University of Sydney, Sydney, Australia; 2 Birkbeck College, University of London, U.K.</h6> <h6 align="center">IJCV 2022 (DOI 10.1007/s11263-021-01541-0)</h6> <a href="#demo-on-google-colab">Google Colab Demo</a> | <a href="#introduction">Introduction</a> | <a href="#gfm">GFM</a> | <a href="#am-2k">AM-2k</a> | <a href="#bg-20k">BG-20k</a> | <a href="#results-demo">Results Demo</a> | <a href="https://github.com/JizhiziLi/GFM/tree/master/core">Train and Test</a> | <a href="#inference-code---how-to-test-on-your-images">Inference Code</a> | <a href="#statement">Statement</a>

<h3>🚀 News</h3>
[2021-11-12]: The training code, test code and all the pretrained models are released in this code-base page.

[2021-10-22]: The paper has been accepted by the International Journal of Computer Vision (IJCV)! 🎉

[2021-09-21]: The datasets <a href="#am-2k">AM-2k</a> and <a href="#bg-20k">BG-20k</a> can now be openly accessed from the links below (both at Google Drive and at Baidu Wangpan) ! Please follow the dataset release agreements to access. Due to some privacy issues, the dataset PM-10k will be published after privacy-preserving from the project Privacy-Preserving Portrait Matting (ACM MM 21). You can refer to this repo for access and updates.

| Dataset | Dataset Link (Google Drive) | Dataset Link (Baidu Wangpan 百度网盘) | Dataset Release Agreement| | :----:| :----: | :----: | :----: | |AM-2k|Link|Link (pw: 29r1)|Agreement (MIT License)| |BG-20k|Link|Link (pw: dffp)|Agreement (MIT License)|

[2020-11-17]: Create <a href="https://colab.research.google.com/drive/1EaQ5h4u9Q_MmDSFTDmFG0ZOeSsFuRTsJ?usp=sharing">Google Colab</a> demo to benefit users who want to have a try online.

[2020-11-03]: Publish the <a href="#inference-code-how-to-test-on-your-images">inference code</a> and a pretrained model that can be used to test on your own animal images.

[2020-10-27]: Publish a video demo (YouTube | bilibili | Google drive) contains motivation, network, datasets, and test results on an animal video.

Demo on Google Colab

For those who do not have GPUs in their environment or only want to have a simple try online, you can try our <a href="https://colab.research.google.com/drive/1EaQ5h4u9Q_MmDSFTDmFG0ZOeSsFuRTsJ?usp=sharing">Google Colab</a> demo to generate the results for your images easily.

Introduction

This repository contains the code, datasets, models, test results and a video demo for the paper <a href="https://arxiv.org/pdf/2010.16188.pdf">Bridging Composite and Real: Towards End-to-end Deep Image Matting</a>. We propose a novel Glance and Focus Matting network (GFM), which employs a shared encoder and two separate decoders to learn both tasks in a collaborative manner for end-to-end image matting. We also establish a novel Animal Matting dataset (AM-2k) to serve for end-to-end matting task. Furthermore, we investigate the domain gap issue between composition images and natural images systematically, propose a carefully designed composite route RSSN and a large-scale high-resolution background dataset (BG-20k) to serve as better candidates for composition. We have released the train code, the test code, the datasets, and the pretrained models in this <a href="https://github.com/JizhiziLi/GFM/tree/master/core">code-base page</a>. We have also prepared a <a href="https://colab.research.google.com/drive/1EaQ5h4u9Q_MmDSFTDmFG0ZOeSsFuRTsJ?usp=sharing">Google Colab</a> demo and <a href="#inference-code-how-to-test-on-your-images">inference code</a> for you to test on our pre-trained models on your own sample images. For the datasets AM-2k and BG-20k, please follow the sections <a href="#am-2k">AM-2k</a> and <a href="#bg-20k">BG-20k</a> to access. Besides, we prepare a video demo (<a href="https://www.youtube.com/watch?v=FJPm4YQOEyo">YouTube</a> | <a href="https://www.bilibili.com/video/BV1X34y1o7nK">bilibili</a>) to illustrate the motivation, the network, the datasets, and the test results on an animal video

GFM

The architecture of our proposed end-to-end method GFM is illustrated below. We adopt three kinds of Representation of Semantic and Transition Area (RoSTa) -TT, -FT, -BT within our method.

We trained GFM with four backbones, -(r) (ResNet-34), -(d) (DenseNet-121), -(r2b) (ResNet-34 with 2 extra blocks), and -(r') (ResNet-101). The trained model for each backbone can be downloaded via the link listed below.

AM-2k

Our proposed AM-2k contains 2,000 high-resolution natural animal images from 20 categories along with manually labeled alpha mattes. Some examples are shown as below, more can be viewed in the video demo (YouTube | bilibili | Google drive).

AM-2k can be accessed from here (Google Drive | [Baidu Wangpan (pw: 29r1)](https://pan.baidu.com/s/1M1uF227-ZrYe

GFM

Install / Use

README

Demo on Google Colab

Introduction

GFM

AM-2k