KNIGHT

The official repository holding the data for the ISBI 2022 KNIGHT Challenge

About

The KNIGHT Challenge asks teams to develop models to classify patients with kidney tumors in terms of their "risk score" as defined by the recently-release American Urological Association (AUA) Guidelines for Renal Masses. KNIGHT makes use of the imaging and clinical data from the MICCAI KiTS21 Challenge.

Accessing the Data

A JSON file with each patient's clinical data lives in this repository at knight/data/knight.json. The imaging associated with each of the 300 patients can be downloaded with the knight/scripts/get_imaging.py script (requires Python 3).

If you wish to make use of the segmentations used for the KiTS21 challenge, you can access those by cloning the official KiTS21 repository.

The prediction target for the KNIGHT challenge is the attribute entitled "aua_risk_group" in the knight.json file. The primary task is a binary classification between the two higher-risk groups ("high_risk" and "very_high_risk") versus the three lower-risk groups ("benign", "low_risk", and "intermediate_risk"). A secondary task is the five-way classification problem for each group individually.

Participants are encouraged to make use of the clinical data as well as the imaging in order to make their predictions. The following clinical attributes will be made available at inference time for cases in the test set.

"age_at_nephrectomy"
"gender"
"body_mass_index"
"comorbidities"
"smoking_history"
"age_when_quit_smoking"
"pack_years"
"chewing_tobacco_use"
"alcohol_use"
"last_preop_egfr"
"radiographic_size"
"voxel_spacing"

All other attributes will NOT be made available and participants should not train models that take as inputs any clinical attributes not listed above.

The official repository of the ISBI 2022 KNIGHT Challenge

Related tags

Overview

KNIGHT

About

Accessing the Data

Owner

Nicholas Heller

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Auto translate textbox from Japanese to English or Indonesia

Levenshtein and Hamming distance computation

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

KoBERT - Korean BERT pre-trained cased (KoBERT)

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Blender addon - Scrub timeline from viewport with a shortcut

Practical Machine Learning with Python

Tracking Progress in Natural Language Processing

Tevatron is a simple and efficient toolkit for training and running dense retrievers with deep language models.

A library for finding knowledge neurons in pretrained transformer models.

Top2Vec is an algorithm for topic modeling and semantic search.

Unsupervised text tokenizer focused on computational efficiency

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.