The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

Last update: Mar 31, 2022

Related tags

Deep Learning D-REX

Overview

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

How do I cite D-REX?

For now, cite the Arxiv paper

@article{albalak2021drex,
      title={D-REX: Dialogue Relation Extraction with Explanations}, 
      author={Alon Albalak and Varun Embar and Yi-Lin Tuan and Lise Getoor and William Yang Wang},
      journal={arXiv preprint arXiv:2109.05126},
      year={2021},
}

To train the full system:

GPU=0
bash train_drex_system.sh $GPU

Notes:

The training script is set up to work with an NVIDIA Titan RTX (24Gb memory, mixed-precision)
To train on a GPU with less memory, adjust the GPU_BATCH_SIZE parameter in train_drex_system.sh to match your memory limit.
Training the full system takes ~24 hours on a single NVIDIA Titan RTX

To test the trained system:

GPU=0
bash test_drex_system.sh $GPU

To train/test individual modules:

Relation Extraction Model -

Training:

GPU=0
MODEL_PATH=relation_extraction_model
mkdir $MODEL_PATH
CUDA_VISIBLE_DEVICES=$GPU python3 train_relation_extraction_model.py \
    --model_class=relation_extraction_roberta \
    --model_name_or_path=roberta-base \
    --base_model=roberta-base \
    --effective_batch_size=30 \
    --gpu_batch_size=30 \
    --fp16 \
    --output_dir=$MODEL_PATH \
    --relation_extraction_pretraining \
    > $MODEL_PATH/train_outputs.log

Testing:

GPU=0
MODEL_PATH=relation_extraction_model
BEST_MODEL=$(ls $MODEL_PATH/F1* -d | sort -r | head -n 1)
THRESHOLD1=$(echo $BEST_MODEL | grep -o "T1.....")
THRESHOLD1=${THRESHOLD1: -2}
THRESHOLD2=$(echo $BEST_MODEL | grep -o "T2.....")
THRESHOLD2=${THRESHOLD2: -2}
CUDA_VISIBLE_DEVICES=0 python3 test_relation_extraction_model.py \
    --model_class=relation_extraction_roberta \
    --model_name_or_path=$BEST_MODEL \
    --base_model=roberta-base \
    --relation_extraction_pretraining \
    --threshold1=$THRESHOLD1 \
    --threshold2=$THRESHOLD2 \
    --data_split=test

Explanation Extraction Model -

Training:

GPU=0
MODEL_PATH=explanation_extraction_model
mkdir $MODEL_PATH
CUDA_VISIBLE_DEVICES=$GPU python3 train_explanation_policy.py \
    --model_class=explanation_policy_roberta \
    --model_name_or_path=roberta-base \
    --base_model=roberta-base \
    --effective_batch_size=30 \
    --gpu_batch_size=30 \
    --fp16 \
    --output_dir=$MODEL_PATH \
    --explanation_policy_pretraining \
    > $MODEL_PATH/train_outputs.log

Testing:

GPU=0
MODEL_PATH=explanation_extraction_model
BEST_MODEL=$(ls $MODEL_PATH/F1* -d | sort -r | head -n 1)
CUDA_VISIBLE_DEVICES=$GPU python3 test_explanation_policy.py \
    --model_class=explanation_policy_roberta \
    --model_name_or_path=$BEST_MODEL \
    --base_model=roberta-base \
    --explanation_policy_pretraining \
    --data_split=test

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

Related tags

Overview

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

How do I cite D-REX?

To train the full system:

To test the trained system:

To train/test individual modules:

Owner

Alon Albalak

Python Jupyter kernel using Poetry for reproducible notebooks

The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

PySLM Python Library for Selective Laser Melting and Additive Manufacturing

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

Fast Differentiable Matrix Sqrt Root

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

[ WSDM '22 ] On Sampling Collaborative Filtering Datasets

Aesara is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays.

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

Physics-Informed Neural Networks (PINN) and Deep BSDE Solvers of Differential Equations for Scientific Machine Learning (SciML) accelerated simulation

Joint deep network for feature line detection and description

SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

[CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

Tensorflow port of a full NetVLAD network

This repo contains the code for the paper "Efficient hierarchical Bayesian inference for spatio-temporal regression models in neuroimaging" that has been accepted to NeurIPS 2021.

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Custom studies about block sparse attention.

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly