CvT2DistilGPT2
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
- This repository houses the implementation of CvT2DistilGPT2 from [1].
- CvT2DistilGPT2 is an encoder-to-decoder model that was developed for chest X-ray report generation.
- Checkpoints for CvT2DistilGPT2 on MIMIC-CXR and IU X-Ray are available.
- This implementation could be adapted for any image captioning task by modifying the datamodule.
Installation
The required packages are located in requirements.txt. It is recommended that these are installed in a virtualenv:
python3 -m venv --system-site-packages venv
source venv/bin/activate
pip install --upgrade pip
pip install --upgrade -r requirements.txt --no-cache-dir
Datasets
For MIMIC-CXR:
-  Download MIMIC-CXR-JPG from: https://physionet.org/content/mimic-cxr-jpg/2.0.0/
-  Place in dataset/mimic_cxr_jpgsuch thatdataset/mimic_cxr_jpg/physionet.org/files/mimic-cxr-jpg/2.0.0/files.
-  Download the Chen et al. labels for MIMIC-CXR from: https://drive.google.com/file/d/1DS6NYirOXQf8qYieSVMvqNwuOlgAbM_E/view?usp=sharing
-  Place annotations.jsonindataset/mimic_cxr_chen
For IU X-Ray:
- Download the Chen et al. labels and the chest X-rays in pngformat for IU X-Ray from:https://drive.google.com/file/d/1c0BXEuDy8Cmm2jfN0YYGkQxFZd2ZIoLg/view
- Place files into dataset/iu_x-ray_chensuch thatdataset/iu_x-ray_chen/annotations.jsonanddataset/iu_x-ray_chen/images.
#####Note: the dataset directory can be changed for each task with the variable dataset_dir in task/mimic_cxr_jpg_chen/paths.yaml and task/mimic_cxr_jpg_chen/paths.yaml
Checkpoints
The checkpoints for MIMIC-CXR and IU X-Ray can be found at (the download link is located at the top right): https://doi.org/10.25919/hbqx-2p71. Place the checkpoints in the experiment directory for each version of each task, e.g., experiment/mimic_cxr_jpg_chen/cvt_21_to_gpt2_scst/epoch=0-val_chen_cider=0.410965.ckpt #####Note: the experiment directory can be changed for each task with the variable exp_dir in task/mimic_cxr_jpg_chen/paths.yaml and task/mimic_cxr_jpg_chen/paths.yaml
Instructions
-  The model configurations for each task can be found in its configdirectory, e.g.task/mimic_cxr_jpg_chen/config.
-  A job for a model is described in the tasks jobs.yamlfile, e.g.task/mimic_cxr_jpg_chen/jobs.yaml.
-  To test the CvT2DistilGPT2 + SCST checkpoint, set task/mimic_cxr_jpg_chen/jobs.yamlto (default):cvt_21_to_distilgpt2_scst: train: 0 test: 1 debug: 0 num_nodes: 1 num_gpus: 1 num_workers: 5
-  To train CvT2DistilGPT2 with teacher forcing and then test, set task/mimic_cxr_jpg_chen/jobs.yamlto:cvt_21_to_distilgpt2: train: 1 test: 1 debug: 0 num_nodes: 1 num_gpus: 1 num_workers: 5or with Slurm: cvt_21_to_distilgpt2: train: 1 test: 1 debug: 0 num_nodes: 1 num_gpus: 1 num_workers: 5 resumable: 1 sbatch: 1 time_limit: 1-00:00:00
-  To run the job: python3 main.py --task mimic_cxr_jpg_chen 
#####Note: data from the job will be saved in the experiment directory.
