OrganSegC2F: a coarse-to-fine organ segmentation framework

version 1.11 - Dec 3 2017 - by Yuyin Zhou and Lingxi Xie

Please note: an improved version of OrganSegC2F named OrganSegRSTN is available: https://github.com/198808xc/OrganSegRSTN

It outperforms OrganSegC2F by 84.50% vs. 82.37% on the NIH pancreas segmentation dataset.

Also NOTE: some functions have been optimized in the repository OrganSegRSTN, but not yet been transferred here.

I will do these things in the near future - they do not impact performance, but will make the testing process MUCH faster.

Yuyin Zhou is the main contributor to this repository.

Yuyin Zhou proposed the algorithm, created the framework and implemented main functions. Lingxi Xie later wrapped up these codes for release.

If you use our codes, please cite our paper accordingly:

Yuyin Zhou, Lingxi Xie, Wei Shen, Yan Wang, Elliot K. Fishman, Alan L. Yuille, "A Fixed-Point Model for Pancreas Segmentation in Abdominal CT Scans", in International Conference on MICCAI, Quebec City, Quebec, Canada, 2017.

https://arxiv.org/abs/1612.08230

http://lingxixie.com/Projects/OrganSegC2F.html

All the materials released in this library can ONLY be used for RESEARCH purposes.

The authors and their institution (JHU/JHMI) preserve the copyright and all legal rights of these codes.

Before you start, please note that there is a LAZY MODE, which allows you to run the entire framework with ONE click. Check the contents before Section 4.3 for details.

1. Introduction

OrganSegC2F is a code package for our paper: Yuyin Zhou, Lingxi Xie, Wei Shen, Yan Wang, Elliot K. Fishman, Alan L. Yuille, "A Fixed-Point Model for Pancreas Segmentation in Abdominal CT Scans", in International Conference on MICCAI, Quebec City, Quebec, 2017.

OrganSegC2F is a segmentation framework designed for 3D volumes. It was originally designed for segmenting abdominal organs in CT scans, but we believe that it can also be used for other purposes, such as brain tissue segmentation in fMRI-scanned images.

OrganSegC2F is based on the state-of-the-art deep learning techniques. This code package is to be used with CAFFE, a deep learning library. We make use of the python interface of CAFFE, named pyCAFFE.

It is highly recommended to use one or more modern GPUs for computation. Using CPUs will take at least 50x more time in computation.

2. File List

| Folder/File |:-------------------------- | README.txt | | | DATA2NPY/ | dicom2npy.py | nii2npy.py | | | DiceLossLayer/ | dice_loss_layer.hpp | dice_loss_layer.cpp | | | OrganSegC2F/ | coarse2fine_testing.py | coarse_fusion.py | coarse_surgery.py | coarse_testing.py | coarse_training.py | DataC.py | DataF.py | fine_surgery.py | fine_training.py | init.py | oracle_fusion.py | oracle_testing.py | run.sh | utils.py | | | OrganSegC2F/prototxts | deploy_1.prototxt | deploy_3.prototxt | training_C1.prototxt | training_C3.prototxt | training_F1.prototxt | training_F3.prototxt | Description | |:---------------------------------------------------- | | the README file | | | codes to transfer the NIH dataset into NPY format | | transferring image data (DICOM) into NPY format | | transferring label data (NII) into NPY format | | | CPU implementation of the Dice loss layer | | the header file | | the CPU implementation | | | primary codes of OrganSegC2F | | the coarse-to-fine testing process | | the coarse-scaled fusion process | | the surgery function for coarse-scaled training | | the coarse-scaled testing process | | the coarse-scaled training process | | the data layer in the coarse-scaled training | | the data layer in the fine-scaled training | | the surgery function for fine-scaled training | | the fine-scaled training process | | the initialization functions | | the fusion process with oracle information | | the testing process with oracle information | | the main program to be called in bash shell | | the common functions | | | primary codes of OrganSegC2F | | the prototxt file for 1-slice testing | | the prototxt file for 3-slice testing | | the prototxt file for 1-slice coarse-scaled training | | the prototxt file for 3-slice coarse-scaled training | | the prototxt file for 1-slice fine-scaled training | | the prototxt file for 3-slice fine-scaled training |

3. Installation

3.1 Prerequisites

3.1.1 Please make sure that your computer is equipped with modern GPUs that support CUDA.

Without them, you will need 50x more time in both training and testing stages.

3.1.2 Please also make sure that python (we are using 2.7) is installed.

3.2 CAFFE and pyCAFFE

3.2.1 Download a CAFFE library from http://caffe.berkeleyvision.org/ .

Suppose your CAFFE root directory is $CAFFE_PATH.

3.2.2 Place the files of Dice loss layer at the correct position.

dice_loss_layer.hpp -> $CAFFE_PATH/include/caffe/layers/
dice_loss_layer.cpp -> $CAFFE_PATH/src/caffe/layers/

3.2.3 Make CAFFE and pyCAFFE.

4. Usage

Please follow these steps to reproduce our results on the NIH pancreas segmentation dataset.

NOTE: Here we only provide basic steps to run our codes on the NIH dataset. For more detailed analysis and empirical guidelines for parameter setting (this is very important especially when you are using our codes on other datasets), please refer to our technical report (check our webpage for updates).

4.1 Data preparation

4.1.1 Download NIH data from https://wiki.cancerimagingarchive.net/display/Public/Pancreas-CT .

You should be able to download image and label data individually.
Suppose your data directory is $RAW_PATH:
    The image data are organized as $RAW_PATH/DOI/PANCREAS_00XX/A_LONG_CODE/A_LONG_CODE/ .
    The label data are organized as $RAW_PATH/TCIA_pancreas_labels-TIMESTAMP/label00XX.nii.gz .

4.1.2 Use our codes to transfer these data into NPY format.

Put dicom2npy.py under $RAW_PATH, and run: python dicom2npy.py .
    The transferred data should be put under $RAW_PATH/images/
Put nii2npy.py under $RAW_PATH, and run: python nii2npy.py .
    The transferred data should be put under $RAW_PATH/labels/

4.1.3 Suppose your directory to store experimental data is $DATA_PATH:

Put $CAFFE_PATH under $DATA_PATH/libs/
Put images/ under $DATA_PATH/
Put labels/ under $DATA_PATH/

NOTE: If you use other path(s), please modify the variable(s) in run.sh accordingly.

4.2 Initialization (requires: 4.1)

4.2.1 Check run.sh and set $DATA_PATH accordingly.

4.2.2 Set $ENABLE_INITIALIZATION=1 and run this script.

Several folders will be created under $DATA_PATH:
    $DATA_PATH/images_X|Y|Z: the sliced image data (data are sliced for faster I/O).
    $DATA_PATH/labels_X|Y|Z: the sliced label data (data are sliced for faster I/O).
    $DATA_PATH/lists: used for storing training, testing and slice lists.
    $DATA_PATH/logs: used for storing log files during the training process.
    $DATA_PATH/models: used for storing models (snapshots) during the training process.
    $DATA_PATH/prototxts: used for storing prototxts (called by training and testing nets).
    $DATA_PATH/results: used for storing testing results (volumes and text results).
According to the I/O speed of your hard drive, the time cost may vary.
    For a typical HDD, around 20 seconds are required for a 512x512x300 volume.
This process needs to be executed only once.

NOTE: if you are using another dataset which contains multiple targets,
    you can modify the variables "ORGAN_NUMBER" and "ORGAN_ID" in run.sh,
    as well as the "is_organ" function in utils.py to define your mapping function flexibly.

LAZY MODE!

You can run all the following modules with one execution!

a) Enable everything (except initialization) in the beginning part.
b) Set all the "PLANE" variables as "A" (4 in total) in the following part.
c) Run this manuscript!

4.3 Coarse-scaled training (requires: 4.2)

4.3.1 Check run.sh and set $COARSE_TRAINING_PLANE and $COARSE_TRAINING_GPU.

You need to run X|Y|Z planes individually, so you can use 3 GPUs in parallel.
You can also set COARSE_TRAINING_PLANE=A, so that three planes are trained orderly in one GPU.

4.3.2 Set $ENABLE_COARSE_TRAINING=1 and run this script.

The following folders/files will be created:
    Under $DATA_PATH/logs/, a log file named by training information.
    Under $DATA_PATH/models/snapshots/, a folder named by training information.
        Snapshots and solver-states wil

OrganSegC2F

Install / Use

README