Dino U-Net

This is the official repository for Dino U-Net: Exploiting High-Fidelity Dense Features from Foundation Models for Medical Image Segmentation.

Teaser image

Dino U-Net: Exploiting High-Fidelity Dense Features from Foundation Models for Medical Image Segmentation.

This repository contains the official implementation of Dino U-Net, a novel architecture for medical image segmentation that integrates a pre-trained DINOv3 foundation model within U-Net architecture. By leveraging the high-fidelity dense features from DINOv3, Dino U-Net achieves state-of-the-art performance on various medical image segmentation tasks.

Features

Foundation Model: Utilizes the powerful DINOv3 as the high-fidelity feature extractor.
Multiple Model Sizes: Supports various DINOv3 model sizes, from ViT-S (~22M params) to ViT-7B (~7B params), allowing flexibility between performance and computational cost.
nnU-Net Integration: Built upon the robust and widely-used nnU-Net framework for data preprocessing, training, and evaluation.
High Performance: Achieves excellent results by transferring knowledge from natural images to medical segmentation tasks.

Supported Models

Dino U-Net supports several DINOv3 model variants, each with different parameter counts and computational requirements:

| Model Name | DINOv3 Backbone | Act. Params | Pre-trained Checkpoint | |-----------------|-----------------|------------|---------------------------------------------------------------| | dinounet_s | ViT-S/16 | ~5M | dinov3_vits16_pretrain_lvd1689m-08c60483.pth | | dinounet_b | ViT-B/16 | ~11M | dinov3_vitb16_pretrain_lvd1689m-73cec8be.pth | | dinounet_l | ViT-L/16 | ~18M | dinov3_vitl16_pretrain_lvd1689m-8aa4cbdd.pth | | dinounet_7b | ViT-7B/16 | ~220M | dinov3_vit7b16_pretrain_lvd1689m-a955f4ea.pth |

Prerequisites

Python 3.8+
PyTorch 1.10+
CUDA-enabled GPU

Installation

Clone the repository:

git clone https://github.com/yifangao112/DinoUNet.git
cd dino-unet

Create a virtual environment conda create -n dinounet python=3.10 -y and activate it conda activate dinounet
Install Pytorch
Install the required packages: It is recommended to create a virtual environment first.
```
pip install -r requirements.txt
```

Install the MultiScaleDeformableAttention module:

cd dinounet/dinov3/eval/segmentation/models/utils/ops
pip install .

Download the pre-trained DINOv3 checkpoints: Download the desired DINOv3 checkpoints from the official repository or another source and place them in the dinounet/checkpoints/ directory.

Dataset Preparation

This project uses the modified nnU-Net framework for data handling. Please format your dataset according to the nnU-Net guidelines.

Structure your dataset as follows:

/path/to/dataset/
├── imagesTr/
│   ├── case001_0000.nii.gz
│   └── ...
├── labelsTr/
│   ├── case001.nii.gz
│   └── ...
└── dataset.json

Set up nnU-Net Environment Variables: nnU-Net uses three environment variables to manage paths for raw data, preprocessed data, and model results.
- nnUNet_raw: Directory for storing raw datasets.
- nnUNet_preprocessed: Directory for storing preprocessed data.
- nnUNet_results: Directory for saving model weights and outputs.
You need to set these variables in your environment. For Linux/macOS, you can add the following lines to your .bashrc or .zshrc file:
```
export nnUNet_raw="/path/to/your/raw_data"
export nnUNet_preprocessed="/path/to/your/preprocessed_data"
export nnUNet_results="/path/to/your/model_results"
```
For more detailed instructions, including for Windows, please see the official nnU-Net documentation.

Training

You can train a Dino U-Net model using the dinounet_training.py script. The script handles data preprocessing, model building, and training.

Usage:

python dinounet_training.py --gpuid <GPU_ID> --model <MODEL_NAME> --datasetid <DATASET_ID> --epoch <NUM_EPOCHS>

Arguments:

--gpuid: The ID of the GPU to use for training (e.g., 0).
--model: The name of the model to train. Choose from dinounet_s, dinounet_b, dinounet_l, dinounet_7b.
--datasetid: The integer ID of your dataset, as registered with nnU-Net.
--epoch: The number of epochs to train for.

Example:

To train the dinounet_s model on dataset ID 9 for 200 epochs on GPU 2:

python dinounet_training.py --gpuid 2 --model dinounet_s --datasetid 9 --epoch 200

The script will automatically:

Preprocess the dataset.
Configure the network architecture.
Train the model.
Save the results and logs to the directory specified by nnUNet_results.

Note:

If you have previously generated nnU-Net plans, please set force_rerun=true for preprocessing to rebuild the plans and avoid using stale caches.

Evaluation

After training, the script automatically proceeds to the evaluation phase. It will compute metrics such as Dice score and Hausdorff Distance on the validation set. The results will be printed to the console and saved in the results folder.

Extending Dino U-Net

See the full extension guide here: extending-dinounet · 中文扩展指南

Acknowledgements

We gratefully acknowledge the following open-source projects that our work builds upon:

nnU-Net (docs, dataset format).
DINOv3 (repo).

DinoUNet

Install / Use

README