Efficient Class Incremental Learning for Object Detection

Efficient Class Incremental Learning for Object Detection
Abstract
Get Started
- Installation
- Download COCO
Experiments
- MS-COCO
  - 40+40
  - 70+10
Training
- Normal training
- Incremental Training
Citation
Contact
Contributors

Abstract

Object Detection, a critical task in computer vision, involves identifying and localizing items within an image. Continual Object Detection (COD) extends this by incrementally introducing training samples for different object categories, posing challenges due to limited access to past data and Catastrophic Forgetting. Traditional techniques like Knowledge Distillation and Exemplar Replay often fall short, and models with large parameters prolong training times, creating computational constraints. To address these issues, this study proposes the Efficient Continual Detection Transformer (ECOD), leveraging an efficient pretrained detector for generalization, pseudo-labeling for new data, and knowledge distillation on attention layers. LoRA optimizes parameter efficiency, reducing the parameters needed for fine-tuning while maintaining high performance. Extensive experiments on the COCO dataset validate this approach, demonstrating its superiority over state-of-the-art methods with only 3% of the trainable parameters, thus advancing the field of COD. | Architecture | |:--:| | Schematic of ECOD Framework|

Get Started

Installation

git clone https://github.com/tuanlda78202/cod.git && cd cod
pip install -q -r requirements.txt

Download COCO

mkdir coco && cd coco 
wget http://images.cocodataset.org/zips/train2017.zip && unzip train2017 && rm train2017.zip
wget http://images.cocodataset.org/zips/val2017.zip & unzip val2017 && rm val2017.zip
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip & unzip annotations_trainval2017 && rm annotations_trainval2017.zip
cd ..
# Note: Change your COCO path on `configs/dataset/coco_detection.yml`

Experiments

MS-COCO

40+40

| Method | Baseline | AP | AP50 | AP75 | APS | APM | APL | |:---------------|:----------------|------:|------:|------:|------:|------:|------:| | LWF | GFLv1 | 17.2 | 45.0 | 18.6 | 7.9 | 18.4 | 24.3 | | RILOD | GFLv1 | 29.9 | 45.0 | 32.9 | 18.5 | 33.0 | 40.5 | | SID | GFLv1 | 34.0 | 51.4 | 36.3 | 18.4 | 38.4 | 44.9 | | ERD | GFLv1 | 36.9 | 54.5 | 39.6 | 21.3 | 40.3 | 47.3 | | CL-DETR | Deformable DETR | 42.0 | 60.1 | 51.2 | 24.0 | 48.4 | 55.6 | | SDDGR | Deformable DETR | 43.0 | 62.1 | 47.1 | 24.9 | 46.9 | 57.0 | | ECOD (Ours) | RT-DETR | 47.1 | 63.6 | 51.2 | 30.0 | 50.8 | 61.7 | | Relative Improv. (%)| - | 9.5 | 2.4 | 0.0 | 20.5 | 5.0 | 8.3 |

70+10

| Method | Baseline | AP | AP50 | AP75 | APS | APM | APL | |:---------------|:----------------|------:|------:|------:|------:|------:|------:| | LWF | GFLv1 | 7.1 | 12.4 | 7.0 | 4.8 | 9.5 | 10.0 | | RILOD | GFLv1 | 24.5 | 37.9 | 25.7 | 14.2 | 27.4 | 36.4 | | MMA | - | 30.2 | 52.1 | - | - | - | - | | ABR | - | 31.1 | 52.9 | 32.7 | - | - | - | | SID | GFLv1 | 32.8 | 49.9 | 35.0 | 17.1 | 36.9 | 44.5 | | ERD | GFLv1 | 34.9 | 51.9 | 35.7 | 17.4 | 38.8 | 45.4 | | CL-DETR | Deformable DETR | 35.8 | 53.5 | 39.5 | 19.4 | 43.0 | 48.6 | | SDDGR | Deformable DETR | 38.6 | 56.2 | 42.1 | 22.3 | 43.5 | 51.4 | | VLM-PL | Deformable DETR | 39.8 | 58.2 | 43.2 | 22.4 | 43.5 | 51.6 | | ECOD (Ours) | RT-DETR | 43.6 | 58.8 | 47.7 | 27.7 | 47.8 | 58.1 | | Relative Improv. (%) | - | 9.6 | 1.0 | 10.4 | 23.7 | 9.9 | 12.6 |

Training

Normal training

Training

python scripts/train.py -t /path/to/ckpt/objects365

Evaluate

python scripts/train.py -r /path/to/ckpt/training --test-only

Incremental Training

configs/rtdetr/include/dataloader.yml
- data_ratio (4040, 7010, 402020, 4010101010)
- If CL, choose task_idx $\ge$ 1
- If using buffer in CL, set buffer_mode = True and buffer_rate
configs/rtdetr/include/rtdetr_r50vd.yml
- In CL mode, set task_idx $\ge$ 1
configs/cl_pipeline.yml
- If using LoRA, set lora_train and lora_val to True
- lora_cl, pseudo_label and distill_attn set to True if CL
- teacher_path: model trained on previous task

# Note: Clean cache WandB
ps aux|grep wandb|grep -v grep | awk '{print $2}'|xargs kill -9

Citation

If you find my work useful in your research, please cite:

@misc{tuanlda78202,
  author = {Le Duc Anh Tuan},
  title = {Efficient Class Incremental Learning for Object Detection},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/tuanlda78202/cod}},
  commit = {31d9bed36a06fbc86c3f7b587367cd33a16cc535}
}

Contact

If you have any questions, please feel free to email the authors.

Star History

Contributors

Cod

Install / Use

README