IOD

(TPAMI 2021) iOD: Incremental Object Detection via Meta-Learning

Generate Convert Improve

Install / Use

/learn @JosephKJ/IOD

About this skill

Quality Score

0/100

README

Incremental Object Detection via Meta-Learning

Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

DOI 10.1109/TPAMI.2021.3124133

Early access on IEEE Xplore: https://ieeexplore.ieee.org/document/9599446

arXiv paper: https://arxiv.org/abs/2003.08798

Abstract

In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting.

We note that although distillation helps to retain previous learning, it obstructs fast adaptability to new tasks, which is a critical requirement for incremental learning. In this pursuit, we propose a meta-learning approach that learns to reshape model gradients, such that information across incremental tasks is optimally shared. This ensures a seamless information transfer via a meta-learned gradient preconditioning that minimizes forgetting and maximizes knowledge transfer. In comparison to existing meta-learning methods, our approach is task-agnostic, allows incremental addition of new-classes and scales to high-capacity models for object detection.

We evaluate our approach on a variety of incremental learning settings defined on PASCAL-VOC and MS COCO datasets, where our approach performs favourably well against state-of-the-art methods.

<img src="https://user-images.githubusercontent.com/4231550/145962389-75511c27-3d9f-4dd2-be93-934dcdf4d70c.jpg" width="800" /> Figure: Qualitative results of our incremental object detector trained in a 10+10 setting where the first task contain instances of aeroplane, bicycle, bird, boat, bottle, bus, car, cat, chair and cow, while the second task learns instance from diningtable, dog, horse, motorbike, person, pottedplant, sheep, sofa, train and tvmonitor. Our model is able to detect instances of both tasks alike, without forgetting.

Installation and setup

Install the Detectron2 library that is packages along with this code base. See INSTALL.md.
Download and extract Pascal VOC 2007 to ./datasets/VOC2007/
Use the starter script: run.sh

Trained Models and Logs

| Setting | Reported mAP | Reproduced mAP | Commands | Models and logs | |:-------:|:------------:|:--------------:|:--------:|:---------------:| | 19+1 | 70.2 | 70.4 | run.sh | Google Drive | | 15+5 | 67.8 | 69.6 | run.sh | Google Drive | | 10+10 | 66.3 | 67.3 | run.sh | Google Drive |

Configurations with which the above results were reproduced:

Python version: 3.6.7
PyTorch version: 1.3.0
CUDA version: 11.0
GPUs: 4 x NVIDIA GTX 1080-ti

Acknowledgement

The code is build on top of Detectron2 library.

Citation

If you find our research useful, please consider citing us:

@ARTICLE {joseph2021incremental,
author = {Joseph. KJ and Jathushan. Rajasegaran and Salman. Khan and Fahad. Khan and Vineeth. N Balasubramanian},
journal = {IEEE Transactions on Pattern Analysis & Machine Intelligence},
title = {Incremental Object Detection via Meta-Learning},
year = {2021},
issn = {1939-3539},
doi = {10.1109/TPAMI.2021.3124133},
publisher = {IEEE Computer Society},
address = {Los Alamitos, CA, USA},
month = {nov}
}

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

13.8k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

000-main-rules

Project Context - Name: Interactive Developer Portfolio - Stack: Next.js (App Router), TypeScript, React, Tailwind CSS, Three.js - Architecture: Component-driven UI with a strict separation of conce

JosephKJ

View profile

View on GitHub

GitHub Stars136

CategoryEducation

Updated8d ago

Forks23

JosephKJ/iOD

Languages

Python

Security Score

100/100

Audited on Mar 20, 2026

No findings