SurvTRACE: Transformers for Survival Analysis with Competing Events

Last update: Oct 06, 2022

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

This repo provides the implementation of SurvTRACE for survival analysis. It is easy to use with only the following codes:

from survtrace.dataset import load_data
from survtrace.model import SurvTraceSingle
from survtrace import Evaluator
from survtrace import Trainer
from survtrace import STConfig

# use METABRIC dataset
STConfig['data'] = 'metabric'
df, df_train, df_y_train, df_test, df_y_test, df_val, df_y_val = load_data(STConfig)

# initialize model
model = SurvTraceSingle(STConfig)

# execute training
trainer = Trainer(model)
trainer.fit((df_train, df_y_train), (df_val, df_y_val))

# evaluating
evaluator = Evaluator(df, df_train.index)
evaluator.eval(model, (df_test, df_y_test))

print("done!")

🔥 See the demo

Please refer to experiment_metabric.ipynb and experiment_support.ipynb !

🔥 How to config the environment

Use our pre-saved conda environment!

conda env create --name survtrace --file=survtrace.yml
conda activate survtrace

or try to install from the requirement.txt

pip3 install -r requirements.txt

🔥 How to get SEER data

Go to https://seer.cancer.gov/data/ to ask for data request from SEER following the guide there.
After complete the step one, we should have the following seerstat software for data access. Open it and sign in with the username and password sent by seer.

Use seerstat to open the ./data/seer.sl file, we shall see the following.

Click on the 'excute' icon to request from the seer database. We will obtain a csv file.

move the csv file to ./data/seer_raw.csv, then run the python script process_seer.py, as
```
python process_seer.py
```
we will obtain the processed seer data named seer_processed.csv.

📝 Functions

single event survival analysis
competing events survival analysis
multi-task learning
automatic hyperparameter grid-search

😄 If you find this result interesting, please consider to cite this paper:

@article{wang2021survtrace,
      title={Surv{TRACE}: Transformers for Survival Analysis with Competing Events}, 
      author={Zifeng Wang and Jimeng Sun},
      year={2021},
      eprint={2110.00855},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

SurvTRACE: Transformers for Survival Analysis with Competing Events

Related tags

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

🔥 See the demo

🔥 How to config the environment

🔥 How to get SEER data

📝 Functions

😄 If you find this result interesting, please consider to cite this paper:

Owner

Zifeng

HAN2HAN : Hangul Font Generation

A natural language modeling framework based on PyTorch

A complete NLP guideline for enthusiasts

Open-World Entity Segmentation

Main repository for the chatbot Bobotinho.

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

🏆 • 5050 most frequent words in 109 languages

A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

NeMo: a toolkit for conversational AI

Signature remover is a NLP based solution which removes email signatures from the rest of the text.

Repository for Project Insight: NLP as a Service

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

SpikeX - SpaCy Pipes for Knowledge Extraction

A python gui program to generate reddit text to speech videos from the id of any post.

Neural text generators like the GPT models promise a general-purpose means of manipulating texts.

用Resnet101+GPT搭建一个玩王者荣耀的AI

Using Bert as the backbone model for lime, designed for NLP task explanation (sentence pair text classification task)

Deduplication is the task to combine different representations of the same real world entity.