SkillAgentSearch skills...

FairSeg

[ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Install / Use

/learn @Harvard-Ophthalmology-AI-Lab/FairSeg
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

FairSeg

[ICLR 24] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

by Yu Tian*, Min Shi*, Yan Luo*, Ava Kouhana, Tobias Elze, and Mengyu Wang.

<img width="1081" alt="Screenshot 2024-01-20 at 9 24 39 AM" src="https://github.com/Harvard-Ophthalmology-AI-Lab/Harvard-FairSeg/assets/19222962/176cd0d2-f3ec-4ac2-a0cb-65d66574f25b">

Download Harvard-FairSeg Dataset

  • The Harvard-FairSeg dataset can be accessed via this link. If you have any questions, please email harvardophai@gmail.com and harvardairobotics@gmail.com.

  • If you cannot directly download the Harvard-FairSeg dataset, please request access in the above Google Drive link, we will make sure to grant you access within 3-5 days.

  • Please refer to each of the folders for FairSeg with SAMed and TransUNet, respectively.

  • Note that, the modifier word “Harvard” only indicates that our dataset is from the Department of Ophthalmology of Harvard Medical School and does not imply an endorsement, sponsorship, or assumption of responsibility by either Harvard University or Harvard Medical School as a legal identity.

  • CVer中文讲解

Dataset Description

This dataset can only be used for non-commercial research purposes. At no time, the dataset shall be used for clinical decisions or patient care. The data use license is CC BY-NC-ND 4.0.

The dataset containing 10,000 patients includes 10,000 Scanning laser ophthalmoscopy (SLO) fundus images. The disc and cup masks, patient age, gender, race, ethnicity, language, and marital status information are also included in the data. Under the folder "ReadMe", the file "data_summary.csv" provides an overview of our data.

10,000 SLO fundus images with pixel-wise disc and cup masks are in the Google Drive folder: data_00001.npz data_00002.npz ... data_10000.npz

NPZ files have the following keys:

slo_fundus: Scanning laser ophthalmoscopy (SLO) fundus image
disc_cup_mask: disc and cup masks for the corresponding SLO fundus image
age: patient's age
gender: 0 - Female, 1 - Male
race: 0 - Asian, 1 - Black, 2 - White
ethnicity: 0 - Non-Hispanic, 1 - Hispanic, -1 - Unknown
language: 0 - English, 1 - Spanish, 2 - Others, -1 - Unknown
maritalstatus: 0 - Married or Partnered, 1 - Single, 2 - Divorced, 3 - Widowed, 4 - Legally Separated, -1 - Unknown

Acknowledgement and Citation

If you find this repository useful for your research, please consider citing our paper:

@inproceedings{tianfairseg,
  title={FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling},
  author={Tian, Yu and Shi, Min and Luo, Yan and Kouhana, Ava and Elze, Tobias and Wang, Mengyu},
  booktitle={The Twelfth International Conference on Learning Representations}
}

Related Skills

View on GitHub
GitHub Stars93
CategoryEducation
Updated21d ago
Forks16

Languages

Python

Security Score

80/100

Audited on Mar 11, 2026

No findings