DynamicRouting

Learning Dynamic Routing for Semantic Segmentation

Generate Convert Improve

Install / Use

/learn @Megvii-BaseDetection/DynamicRouting

About this skill

Quality Score

0/100

README

DynamicRouting

This project provides an implementation for "Learning Dynamic Routing for Semantic Segmentation" (CVPR2020 Oral) on PyTorch. For the reason that experiments in the paper were conducted using internal framework, this project reimplements them on dl_lib and reports detailed comparisons below. Some parts of code in dl_lib are based on detectron2.

introduce image

Requirement

Python >= 3.6
- python3 --version
PyTorch >= 1.3
- pip3 install torch torchvision
OpenCV
- pip3 install opencv-python
GCC >= 4.9
- gcc --version

Installation

Make sure that your get at least one gpu when compiling. Run:

git clone https://github.com/yanwei-li/DynamicRouting.git
cd DynamicRouting
sudo python3 setup.py build develop

Usage

Dataset

We use Cityscapes dataset for training and validation. Please refer to datasets/README.md or dataset structure in detectron2 for more details.

Cityscapes Download

Pretrained Model

We give ImageNet pretained models:

Layer16-Fix GoogleDrive
Layer33-Fix GoogleDrive

Training

For example, if you want to train Dynamic Network with Layer16 backbone:

Train from scratch

cd playground/Dynamic/Seg.Layer16
dl_train --num-gpus 4

Use ImageNet pretrain

cd playground/Dynamic/Seg.Layer16.ImageNet
dl_train --num-gpus 4 MODEL.WEIGHTS /path/to/your/save_dir/ckpt.pth

NOTE: Please set FIX_SIZE_FOR_FLOPS to [768,768] and [1024,2048] for training and evaluation, respectively.

Evaluation

You can evaluate the trained or downloaded model:

Evaluate the trained model
```
dl_test --num-gpus 8
```

Evaluate the downloaded model:

dl_test --num-gpus 8 MODEL.WEIGHTS /path/to/your/save_dir/ckpt.pth

NOTE: If your machine does not support such setting, please change settings in config.py to a suitable value.

Performance

Cityscapes val set

Without ImageNet Pretrain:

Methods | Backbone | Iter/K | mIoU (paper) | GFLOPs (paper) | mIoU (ours) | GFLOPs (ours) | Model :--:|:--:|:--:|:--:|:--:|:--:|:--:|:--: Dynamic-A | Layer16 | 186 | 72.8 | 44.9 | 73.9 | 52.5 | GoogleDrive Dynamic-B | Layer16 | 186 | 73.8 | 58.7 | 74.3 | 58.9 | GoogleDrive Dynamic-C | Layer16 | 186 | 74.6 | 66.6 | 74.8 | 59.8 | GoogleDrive Dynamic-Raw | Layer16 | 186 | 76.1 | 119.5 | 76.7 | 114.9 | GoogleDrive Dynamic-Raw | Layer16 | 558 | 78.3 | 113.3 | 78.1 | 114.2 | GoogleDrive

With ImageNet Pretrain:

Methods | Backbone | Iter/K | mIoU (paper) | GFLOPs (paper) | mIoU (ours) | GFLOPs (ours) | Model :--:|:--:|:--:|:--:|:--:|:--:|:--:|:--: Dynamic-Raw | Layer16 | 186 | 78.6 | 119.4 | 78.8 | 117.8 | GoogleDrive Dynamic-Raw | Layer33 | 186 | 79.2 | 242.3 | 79.4 | 243.1 | GoogleDrive

To do

[ ] Faster inference speed
[ ] Support more vision tasks
- [ ] Object detection
- [ ] Instance segmentation
- [ ] Panoptic segmentation

Acknowledgement

Citation

Consider cite the Dynamic Routing in your publications if it helps your research.

@inproceedings{li2020learning,
    title = {Learning Dynamic Routing for Semantic Segmentation},
    author = {Yanwei Li, Lin Song, Yukang Chen, Zeming Li, Xiangyu Zhang, Xingang Wang, Jian Sun},
    booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
    year = {2020}
}

Consider cite this project in your publications if it helps your research.

@misc{DynamicRouting,
    author = {Yanwei Li},
    title = {DynamicRouting},
    howpublished = {\url{https://github.com/yanwei-li/DynamicRouting}},
    year ={2020}
}

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

last30days-skill

4.5k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary