MFDC

Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection, ECCV 2022

Generate Convert Improve

Install / Use

/learn @shuangw98/MFDC

About this skill

Quality Score

0/100

README

Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection, ECCV 2022

This repo is built upon DeFRCN, where you can download the datasets and the pre-trained weights.

Requirements

Python == 3.7.10

Pytorch == 1.6.0

Torchvision == 0.7.0

Detectron2 == 0.3

CUDA == 10.1

File Structure

    ├── weight/                   
    |   ├── R-101.pkl              
    |   └── resnet101-5d3b4d8f.pth   
    └── datasets/
        ├── coco/           
        │   ├── annotations/
        │   ├── train2014/
        │   └── val2014/
        ├── cocosplit/
        ├── VOC2007/            
        │   ├── Annotations/
        │   ├── ImageSets/
        │   └── JPEGImages/
        ├── VOC2012/            
        │   ├── Annotations/
        │   ├── ImageSets/
        │   └── JPEGImages/
        └── vocsplit/

Training and Evaluation

For VOC

sh voc_train.sh mfdc SPLIT_ID

For COCO

sh coco_train.sh mfdc

Citation

If you find our code helpful in your research, please cite the following publication:

@inproceedings{wu2022multi,
  title={Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object Detection},
  author={Wu, Shuang and Pei, Wenjie and Mei, Dianwen and Chen, Fanglin and Tian, Jiandong and Lu, Guangming},
  booktitle={European Conference on Computer Vision},
  pages={578--594},
  year={2022},
  organization={Springer}
}

Contact

Please feel free to contact me (Email: wushuang9811@outlook.com) if you have any questions.

Related Skills

node-connect

347.6k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

108.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

347.6k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

347.6k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。