CIBA

[BMVC 2023] Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning

Generate Convert Improve

Install / Use

/learn @KuofengGao/CIBA

About this skill

Quality Score

0/100

README

Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning

This repository provides the pytorch implementatin of our BMVC 2023 work: Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning.

Abstract

A backdoored deep hashing model is expected to behave normally on original query images and return the images with the target label when a specific trigger pattern presents. To this end, we propose the confusing perturbations-induced backdoor attack (CIBA). It injects a small number of poisoned images with the correct label into the training data, which makes the attack hard to be detected. To craft the poisoned images, we first propose the confusing perturbations to disturb the hashing code learning. As such, the hashing model can learn more about the trigger. The confusing perturbations are imperceptible and generated by optimizing the intra-class dispersion and inter-class shift in the Hamming space. We then employ the targeted adversarial patch as the backdoor trigger to improve the attack performance. We have conducted extensive experiments to verify the effectiveness of our proposed CIBA

Installation

This code is tested on our local environment (python=3.7), and we recommend you to use anaconda to create a vitural environment:

conda create -n CIBA python=3.7

Then, activate the environment:

conda activate CIBA

Install PyTorch:

pip install torch==1.4.0 torchvision==0.5.0

Data Preparation

Please download the ImageNet dataset.
We give the list of training, database and query images in data_prepare/imagenet/train.txt, data_prepare/imagenet/database.txt and data_prepare/imagenet/query.txt. Note that replace the corresponding paths with yours.

Get Started

Pre-trained model

You should first train the model on the clean datasets. The model will be saved to models/<dataset>_<arch>_<n-bits>_backdoor.

python train.py --arch vgg11 --dataset imagenet --n-bits 48 --gpu-id 0

Generate the trigger pattern

The trigger pattern will be saved to <path>/<target_label>/<trigger_size>. We have provided five target labels and the trigger pattern in our experiments.

python generate_trigger_pattern.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --trigger_size 24 --target_label yurt --gpu-id 0

Backdoor attack

We craft poisoned images by adding trigger and perturbations to the images with the target label. Then, train the model on the poisoned dataset and test the backdoored model. The backdoored model will be saved to <path>/<target_label>/<trigger_size>/<poison_num>/<pert><clambda>.

Four backdoor attacks in our paper can be run as follows.

"Tri"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert non --gpu-id 0

"Tri+Noise"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert noise --gpu-id 0

"Tri+Adv"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert confusing --clambda 0 --gpu-id 0

"CIBA"

python backdoor_attack.py --path models/imagenet_vgg11_48_backdoor --arch vgg11 --dataset imagenet --n-bits 48 --poison_num 60 --trigger_size 24 --target_label yurt --pert confusing --clambda 0.8 --gpu-id 0

Citation

@inproceedings{gao2023ciba,
  title={Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning},
  author={Gao, Kuofeng and Bai, Jiawang and Chen, Bin and Wu, Dongxian and Xia, Shu-Tao},
  booktitle={BMVC},
  year={2023}
}

Acknowledgements

This respository is mainly based on DTHA. Thanks for their wonderful works!

Related Skills

node-connect

349.2k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

109.5k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

349.2k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

349.2k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。