RETFound - A foundation model for retinal images

Official repo including a series of foundation models and applications for retinal images.<br> [RETFound-MAE]:RETFound: a foundation model for generalizable disease detection from retinal images.<br> [RETFound-DINOv2]:Revealing the Impact of Pre-training Data on Medical Foundation Models.<br> [DINOv2]:General-purpose vision foundation models DINOv2 by Meta.<br> [DINOv3]:General-purpose vision foundation models DINOv3 by Meta.<br>

Please contact ykzhoua@gmail.com or yukun.zhou.19@ucl.ac.uk if you have questions.

📝Key features

RETFound is pre-trained on 1.6 million retinal images with self-supervised learning
RETFound has been validated in multiple disease detection tasks
RETFound can be efficiently adapted to customised tasks

🎉News

🐉2025/09: Preprint benchmarking DINOv3, DINOv2, and RETFound is available!
🐉2025/09: We included state-of-the-art DINOv3 into fine-tuning pipeline for retinal applications!
🐉2025/02: We organised the model weights on HuggingFace, no more manual downloads needed!
🐉2025/02: Multiple pre-trained weights, including MAE-based and DINOV2-based, are added!
🐉2025/02: We update the version of packages, such as CUDA12+ and PyTorch 2.3+!
🐉2024/01: Feature vector notebook are now online!
🐉2024/01: Data split and model checkpoints for public datasets are now online!
🎄2023/12: Colab notebook is now online - free GPU & simple operation!

🔧Install environment

Create environment with conda:

conda create -n retfound python=3.11.0 -y
conda activate retfound

Install dependencies

pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu121
git clone https://github.com/rmaphoh/RETFound/
cd RETFound
pip install -r requirements.txt
pip install ipykernel
python -m ipykernel install --user --name retfound --display-name "Python (retfound)"

🌱Fine-tuning with RETFound weights

Get access to the pre-trained models on HuggingFace (register an account and fill in the form) and go to step 2:

<table><tbody>   <th valign="bottom"></th> <th valign="bottom">ViT-Large</th> <th valign="bottom">Source</th>  <tr><td align="left">RETFound_mae_natureCFP</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_mae_natureCFP">access</a></td> <td align="center"><a href="https://www.nature.com/articles/s41586-023-06555-x">Nature RETFound paper</a></td> </tr>  <tr><td align="left">RETFound_mae_natureOCT</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_mae_natureOCT">access</a></td> <td align="center"><a href="https://www.nature.com/articles/s41586-023-06555-x">Nature RETFound paper</a></td> </tr>  <tr><td align="left">RETFound_mae_meh</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_mae_meh">access</a></td> <td align="center"><a href="https://www.researchsquare.com/article/rs-6080254/v1">FM data paper</a></td> </tr>  <tr><td align="left">RETFound_mae_shanghai</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_mae_shanghai">access</a></td> <td align="center"><a href="https://www.researchsquare.com/article/rs-6080254/v1">FM data paper</a></td> </tr>  <tr><td align="left">RETFound_dinov2_meh</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_dinov2_meh">access</a></td> <td align="center"><a href="https://www.researchsquare.com/article/rs-6080254/v1">FM data paper</a></td> </tr>  <tr><td align="left">RETFound_dinov2_shanghai</td> <td align="center"><a href="https://huggingface.co/YukunZhou/RETFound_dinov2_shanghai">access</a></td> <td align="center"><a href="https://www.researchsquare.com/article/rs-6080254/v1">FM data paper</a></td> </tr> </tbody></table>

huggingface-cli login --token YOUR_HUGGINGFACE_TOKEN

Optional: if your machine and server cannot access HuggingFace due to internet wall, run the command below (Do not run it if you can access):

export HF_ENDPOINT=https://hf-mirror.com

If you would like to fine-tune DINOv2 and DINOv3, please visit their GitHub repositories to download the model weights and put them in the RETFound folder.
Organise your data into this directory structure (Public datasets used in this study can be downloaded here)

├── data folder
    ├──train
        ├──class_a
        ├──class_b
        ├──class_c
    ├──val
        ├──class_a
        ├──class_b
        ├──class_c
    ├──test
        ├──class_a
        ├──class_b
        ├──class_c

Start fine-tuning by running sh train.sh.

In train.sh, the model can be selected by changing the hyperparameters MODEL, MODEL_ARCH, FINETUNE:

RETFound:

| MODEL | MODEL_ARCH | FINETUNE | SIZE | |-----------------|--------------------------|--------------------------|--------------------------| | RETFound_mae | retfound_mae | RETFound_mae_natureCFP | ~300M | | RETFound_mae | retfound_mae | RETFound_mae_natureOCT | ~300M | | RETFound_mae | retfound_mae | RETFound_mae_meh | ~300M | | RETFound_mae | retfound_mae | RETFound_mae_shanghai | ~300M | | RETFound_dinov2 | retfound_dinov2 | RETFound_dinov2_meh | ~300M | | RETFound_dinov2 | retfound_dinov2 | RETFound_dinov2_shanghai | ~300M |

DINOv3:

| MODEL | MODEL_ARCH | FINETUNE | SIZE | |-----------------|--------------------------|----------------------------------|--------------------------| | Dinov3 | dinov3_vits16 | dinov3_vits16_pretrain.pth | ~21M | | Dinov3 | dinov3_vits16plus | dinov3_vits16plus_pretrain.pth | ~29M | | Dinov3 | dinov3_vitb16 | dinov3_vitb16_pretrain.pth | ~86M | | Dinov3 | dinov3_vitl16 | dinov3_vitl16_pretrain.pth | ~300M | | Dinov3 | dinov3_vith16plus | dinov3_vith16plus_pretrain.pth | ~840M | | Dinov3 | dinov3_vit7b16 | dinov3_vit7b16_pretrain.pth | ~6.7B |

DINOv2:

| MODEL | MODEL_ARCH | FINETUNE | SIZE | |-----------------|--------------------------|------------------------------|--------------------------| | Dinov2 | dinov2_vits14 | dinov2_vits14_pretrain.pth | ~21M | | Dinov2 | dinov2_vitb14 | dinov2_vitb14_pretrain.pth | ~86M | | Dinov2 | dinov2_vitl14 | dinov2_vitl14_pretrain.pth | ~300M | | Dinov2 | dinov2_vitg14 | dinov2_vitg14_pretrain.pth | ~1.1B |

Change the DATA_PATH to your dataset directory.

# ==== Model settings ====
# adaptation {finetune,lp}
ADAPTATION="finetune"
MODEL="RETFound_dinov2"
MODEL_ARCH="retfound_dinov2"
FINETUNE="RETFound_dinov2_meh"

# ==== Data settings ====
# change the dataset name and corresponding class number
DATASET="MESSIDOR2"
NUM_CLASS=5

# =======================
DATA_PATH="PATH TO THE DATASET"
TASK="${MODEL_ARCH}_${DATASET}_${ADAPTATION}"

torchrun --nproc_per_node=1 --master_port=48766 main_finetune.py \
  --model "${MODEL}" \
  --model_arch "${MODEL_ARCH}" \
  --finetune "${FINETUNE}" \
  --savemodel \
  --global_pool \
  --batch_size 24 \
  --world_size 1 \
  --epochs 50 \
  --nb_classes "${NUM_CLASS}" \
  --data_path "${DATA_PATH}" \
  --input_size 224 \
  --task "${TASK}" \
  --adaptation "${ADAPTATION}"

For evaluation only (download data and model checkpoints here; change the DATA_PATH below)

# ==== Model/settings (match training) ====
ADAPTATION="finetune"
MODEL="RETFound_dinov2"
MODEL_ARCH="retfound_dinov2"
FINETUNE="RETFound_dinov2_meh"

# ==== Data/settings (match training) ====
DATASET="MESSIDOR2"
NUM_CLASS=5

# =======================
DATA_PATH="PATH TO THE DATASET"
TASK="${MODEL_ARCH}_${DATASET}_${ADAPTATION}"

# Path to the trained checkpoint (adjust if you saved elsewhere)
CKPT="./output_dir/${TASK}/checkpoint-best.pth"

# ==== Evaluation only ====
torchrun --nproc_per_node=1 --master_port=48766 main_finetune.py \
  --model "${MODEL}" \
  --model_arch "${MODEL_ARCH}" \
  --savemodel \
  --global_pool \
  --batch_size 128 \
  --world_size 1 \
  --nb_classes "${NUM_CLASS}" \
  --data_path "${DATA_PATH}" \
  --input_size 224 \
  --task "${TASK}" \
  --adaptation "${ADAPTATION}" \
  --eval \
  --resume "${CKPT}"

📃Citation

If you find this repository useful, please consider citing this paper:

@article{zhou2023foundation,
  title={A foundation model for generalizable disease detection from retinal images},
  author={Zhou, Yukun and Chia, Mark A and Wagn

RETFound

Install / Use

README