Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

Last update: Dec 08, 2022

Overview

Structure-Aware-BART

This repo contains codes for the following paper:

Jiaao Chen, Diyi Yang:Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs, NAACL 2021

If you would like to refer to it, please cite the paper mentioned above.

Getting Started

These instructions will get you running the codes of Structure-Aware-Bart Conversation Summarization.

Requirements

Python 3.6 or higher
Pytorch >= 1.3.0
Pandas, Numpy, Pickle
rouge=1.0.0 (https://github.com/pltrdy/rouge)
transformers
allennlp
openie
wandb

Note that different versions of rouge or different rouge packages might result in different rouge scores. For the transformers, we used the version released by Oct. 7 2020. The updated version might also result in different performances.

Install the transformers with S-BART

cd transformers

pip install --editable ./

Downloading the data

Please download the dataset (including pre-processed graphs) and put them in the data folder here

Pre-processing the data

The data folder you download from the above link already contains all the pre-processed files (including the extracted graphs) from SAMSum corpus.

Extract Discourse Graphs

Here we utilize the data and codes from here to pre-train a conversation discourse parser and use that parser to extract discourse graphs in the SAMSum dataset.

Extract Action Graphs

Please go through ./src/data/extract_actions.ipynb to extract action graphs.

Training models

These section contains instructions for training the conversation summarizationmodels.

The generated summaries on test set for baseline BART and the S-BART is in the ./src/baseline and ./src/composit folder. (trained with seed 42)

The training logs from wandb for different seed (0,1,42) for S-BART is shown in ./src/Weights&Biases.pdf

Training baseline BART model

Please run ./train_base.sh to train the BART baseline models.

Training S-BART model

Please run ./train_multi_graph.sh to train the S-BART model.

Evaluating models

Please follow the example jupyter notebook (./src/eval.ipynb) is provided for evaluating the model on test set.

Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

Related tags

Overview

Structure-Aware-BART

Getting Started

Requirements

Install the transformers with S-BART

Downloading the data

Pre-processing the data

Extract Discourse Graphs

Extract Action Graphs

Training models

Training baseline BART model

Training S-BART model

Evaluating models

Owner

GT-SALT

[CVPR 2021] Generative Hierarchical Features from Synthesizing Images

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

Blind visual quality assessment on 360° Video based on progressive learning

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

这是一个yolo3-tf2的源码，可以用于训练自己的模型。

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation (CVPR 2021)

Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

FeTaQA: Free-form Table Question Answering

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".

Multi-task Learning of Order-Consistent Causal Graphs (NeuRIPs 2021)