DANet
DANet: Divergent Activation for Weakly Supervised Object Localization,in ICCV 2019
Install / Use
/learn @xuehaolan/DANetREADME
DANet
DANet: Divergent Activation for Weakly Supervised Object Localization,in ICCV 2019
Introduction
We propose a divergent activation (DA) approach, and target at learning complementary and discriminative visual patterns for image classification and weakly supervised object localization from the perspective of discrepancy. To this end, we design hierarchical divergent activation (HDA), which leverages the semantic discrepancy to spread feature activation, implicitly. We also propose discrepant divergent activation (DDA), which pursues object extent by learning mutually exclusive visual patterns, explicitly.

Getting started
Install
-
Clone this repo:
DANet_ROOT=/path/to/clone/DANet git clone --recursive https://github.com/xuehaolan/DANet $DANet_ROOT cd $DANet_ROOT -
Create an Anaconda environment with python2.7 and PyTorch>=0.4.0
Data preparation
Download the images of CUB-200-2011 dataset and place the data at $DANet_ROOT/data/CUB-200-2011
Train and test
-
Train
cd scripts sh train_DA_cub.sh -
Test
Download the pretrained model at GoogleDrive(https://drive.google.com/open?id=114zDDJe6zYbRjashntibbN1D4DM0E1uw)
cd scripts sh val_DA_cub.sh -
Note
- Using modified VGG(vgg_DA_p) could achieve higher performance for both CAM and DANet.
- The valiation code uses simple thresholding, using localization method provided by CAM may gets greater localization results.
Visualization
<img src='figs/results_CUB.png' align="center" width="400px"> <img src='figs/results_ILSVRC.png' align="center" width="400px">
Acknowledgement
In this project, we reimplemented CHR on PyTorch based on SPG.
Citation
Please consider citing our paper in your publications if the project helps your research.
@inproceedings{xue2019danet,
title={Danet: Divergent activation for weakly supervised object localization},
author={Xue, Haolan and Liu, Chang and Wan, Fang and Jiao, Jianbin and Ji, Xiangyang and Ye, Qixiang},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
pages={6589--6598},
year={2019}
}
Related Skills
node-connect
340.5kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
340.5kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.2kCommit, push, and open a PR
