Using Clinical Drug Representations for Improving Mortality and Length of Stay Predictions
Usage
- Clone the code to local.
https://github.com/tanlab/MIMIC-III-Clinical-Drug-Representations.git
cd MIMIC-III-Clinical-Drug-Representations
-
Run MIMIC-Extract Pipeline as explained in https://github.com/MLforHealth/MIMIC_Extract.
-
Copy the output file of MIMIC-Extract Pipeline named
all_hourly_data.h5tomimic-extractfolder. -
Copy the
ADMISSIONS.csv,PRESCRIPTIONS.csv,ICUSTAYS.csvfiles intomimic-iiifolder. -
Run
01-MIMIC-III-Drugs-Names-To-Pubchem-ID.ipynbto convert MIMIC-III Drug names into Pubchem ID. -
Download drug information via FDA. https://www.fda.gov/drugs/drug-approvals-and-databases/national-drug-code-directory.
-
Run
02-Create-Cohort.ipynbto select correct drugs for patients and create the final cohort. -
Run
03-Embeddings.ipynbto get embeddings of drugs. -
Run
04-Timeseries.ipynbto run timeseries baseline model to predict 4 different clinical tasks. -
Run
05-ECFP-1024-TimeSeries.ipynbto run ECFP multimodal baseline to predict 4 different clinical tasks. -
Run
6-Smiles-Transformer-TimeSeries.ipynbto run Transformers multimodal baseline to predict 4 different clinical tasks.
References
Download the MIMIC-III dataset via https://mimic.physionet.org/
MIMIC-Extract implementation: https://github.com/MLforHealth/MIMIC_Extract