DAEFormer

[MICCAI 2023] DAE-Former: Dual Attention-guided Efficient Transformer for Medical Image Segmentation

Generate Convert Improve

Install / Use

/learn @xmindflow/DAEFormer

About this skill

Quality Score

0/100

README

DAE-Former: Dual Attention-guided Efficient Transformer for Medical Image Segmentation - MICCAI 2023 PRIME Workshop

Transformers have recently gained attention in the computer vision domain due to their ability to model long-range dependencies. However, the self-attention mechanism, which is the core part of the Transformer model, usually suffers from quadratic computational complexity with respect to the number of tokens. Many architectures attempt to reduce model complexity by limiting the self-attention mechanism to local regions or by redesigning the tokenization process. In this paper, we propose DAE-Former, a novel method that seeks to provide an alternative perspective by efficiently designing the self-attention mechanism. More specifically, we reformulate the self-attention mechanism to capture both spatial and channel relations across the whole feature dimension while staying computationally efficient. Furthermore, we redesign the skip connection path by including the cross-attention module to ensure the feature reusability and enhance the localization power. Our method outperforms state-of-the-art methods on multi-organ cardiac and skin lesion segmentation datasets, without pre-training weights.

Proposed Model

Updates

July 25, 2023, 2022: Accepted in MICCAI 2023 PRIME Workshop! 🥳
December 29, 2022: Initial release with arXiv.

Citation

@inproceedings{azad2023dae,
  title={Dae-former: Dual attention-guided efficient transformer for medical image segmentation},
  author={Azad, Reza and Arimond, Ren{\'e} and Aghdam, Ehsan Khodapanah and Kazerouni, Amirhossein and Merhof, Dorit},
  booktitle={International Workshop on PRedictive Intelligence In MEdicine},
  pages={83--95},
  year={2023},
  organization={Springer}
}

How to use

The script train.py contains all the necessary steps for training the network. A list and dataloader for the Synapse dataset are also included. To load a network, use the --module argument when running the train script (--module <directory>.<module_name>.<class_name>, e.g. --module networks.DAEFormer.DAEFormer)

Model weights

You can download the learned weights of the DAEFormer in the following table.

Task | Dataset |Learned weights ------------ | -------------|---- Multi organ segmentation | Synapse | DAE-Former

Training and Testing

Download the Synapse dataset from here.
Run the following code to install the Requirements.

pip install -r requirements.txt
Run the below code to train the DAEFormer on the synapse dataset.
```
python train.py --root_path ./data/Synapse/train_npz --test_path ./data/Synapse/test_vol_h5 --batch_size 20 --eval_interval 20 --max_epochs 400 --module networks.DAEFormer.DAEFormer
```
--root_path [Train data path]

--test_path [Test data path]

--eval_interval [Evaluation epoch]

--module [Module name, including path (can also train your own models)]
Run the below code to test the DAEFormer on the synapse dataset.
```
python test.py --volume_path ./data/Synapse/ --output_dir './model_out'
```
--volume_path [Root dir of the test data]

--output_dir [Directory of your learned weights]

Results

Performance comparision on Synapse Multi-Organ Segmentation dataset.

results

Query

All implementation done by Rene Arimond. For any query please contact us for more information.

rene.arimond@lfb.rwth-aachen.de

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

best-practices-researcher

The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app

groundhog

400

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

19.1k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary