This directory contains preprocessing scripts for all datasets used in DeepSparse.
data/
base/ Shared utilities (projector, dataset, saver, utils)
atlas-mini/ Pretraining dataset (AbdomenAtlas1.0Mini)
PANORAMA/ Finetuning — abdomen (PANORAMA challenge)
PENGWIN/ Finetuning — pelvis (PENGWIN challenge)
LUNA16_v2/ Finetuning — lung (LUNA16)
ToothFairy/ Finetuning — tooth (ToothFairy challenge)
projector.py— TIGRE-based cone-beam CT forward projectordataset.py— Generic Dataset class: resample → crop/pad → normalize → block conversionsaver.py— Saves processed CT, blocks, and projections with consistent folder structureutils.py— SimpleITK load/save helpers
The dataset_img_dir_dict in code/datasets/base.py maps training dataset names to these folders:
| Training name | Folder |
|---|---|
atlas-mini |
atlas-mini/ |
abdomen |
PANORAMA/ |
pelvis |
PENGWIN/ |
luna |
LUNA16_v2/ |
tooth |
ToothFairy/ |
pip install SimpleITK scipy numpy tigre matplotlib tqdm
See each dataset's README.md for dataset-specific download and preprocessing instructions.