DifficultyAwareEmbedding
A difficulty-aware embedding of complementary deep networks for image classification
Install / Use
/learn @qychen13/DifficultyAwareEmbeddingREADME
DifficultyAwareEmbedding
A difficulty-aware embedding of complementary deep networks for image classification It is the code released for Embedding Complementary Deep Networks for Image Classification(CVPR2019).
<p align='center'> <img src='./imgs/imagenet.png' width='400'> </p>Citation
If you find the paper or repository is useful, please kindly cite:
@InProceedings{Chen_2019_CVPR,
author = {Chen, Qiuyu and Zhang, Wei and Yu, Jun and Fan, Jianping},
title = {Embedding Complementary Deep Networks for Image Classification},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}
Setup/Prerequisite
- install the prerequisite python packages
pip install -r requirements.txt
- Run the Visdom server for monitoring the training/test process
visdom
<p align='center'>
<img src='./imgs/ScreenShot-Training.png' width='400'>
</p>
<p align='center'>
<img src='./imgs/ScreenShot-Test.png' width='400'>
</p>
Training
- Download the resnet50 pretrained model from official torchvision
wget -P checkpoints/test-resnet50 https://download.pytorch.org/models/resnet50-19c8e357.pth
- Run the embedding algrithms for the following rounds
# for the second round
./scripts/trainval_resnet50.sh
Test
-
Download the resnet50 pretrained model and put it under checkpoints/resnet folder
-
Run the test script
./scripts/test_resnet50.sh
Related Skills
node-connect
351.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
110.7kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
351.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
351.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
