LearnableISDA
[IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation
Install / Use
/learn @LeapLabTHU/LearnableISDAREADME
Fine-grained Recognition with Learnable Semantic Data Augmentation
accepted by IEEE Transactions on Image Processing (IEEE TIP)
Authors: Yifan Pu*, Yizeng Han*, Yulin Wang, Junlan Feng, Chao Deng, Gao Huang#.
*: Equal contribution, #: Corresponding author.
Get Started
-
prepare environment
conda create --name learnable_isda python=3.8 conda activate learnable_isda pip install torch==2.0.0 torchvision==0.15.1 --index-url https://download.pytorch.org/whl/cu118 pip install scipy pandas matplotlib imageio -
prepare data
Download CUB-200-2011 from the official website
-
prepare pretrained checkpoint
mkdir pretrained_models cd pretrained_models wget https://download.pytorch.org/models/resnet50-0676ba61.pth cd ..
Usage
training
CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py \
--data_root YOUR_DATA_PATH --output_dir_root ./ --output_dir output/ \
--model_type resnet50 --pretrained_dir ./pretrained_models/resnet50-0676ba61.pth \
--dataset CUB_200_2011 --train_batch_size 128 --lr 3e-2 --eval_batch_size 64 --workers 1 \
--meta_lr 1e-3 --meta_net_hidden_size 512 --meta_net_num_layers 1 --lambda_0 10.0 \
--epochs 100 --warmup_epochs 5;
Citation
If you find our work is useful in your research, please consider citing:
@article{pu2023fine,
title={Fine-grained recognition with learnable semantic data augmentation},
author={Pu, Yifan and Han, Yizeng and Wang, Yulin and Feng, Junlan and Deng, Chao and Huang, Gao},
journal={IEEE Transactions on Image Processing},
year={2023}
}
Contact
If you have any questions, please feel free to contact the authors.
Yifan Pu: pyf20@mails.tsinghua.edu.cn, yifanpu98@126.com.
Yizeng Han: hanyz18@mails.tsinghua.edu.cn, yizeng38@gmail.com.
Related Skills
node-connect
340.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.1kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
340.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.1kCommit, push, and open a PR
