Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Last update: Oct 14, 2022

Overview

About this repository

This repo contains an Pytorch implementation for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. The code framework is based on TextBox.

Environment

python >= 3.8.11
torch >= 1.6.0

Run install.sh to install other requirements.

Dataset

The processed dataset can be downloaded from Google Drive. Once finished, unzip the datafiles (train.src, train.tgt, ...) to ./data.

An overview of dataset: train: 287113 cases, dev: 13368 cases, test: 11490 cases

Paramters

# overall settings
data_path: 'data/'
checkpoint_dir: 'saved/'
generated_text_dir: 'generated/'
# dataset settings
max_vocab_size: 50000
src_len: 400
tgt_len: 100

# model settngs
decoding_strategy: 'beam_search'
beam_size: 4
is_attention: True
is_pgen: True
is_coverage: True
cov_loss_lambda: 1.0

Log file is located in ./log, more details can be found in yamls.

Note: Distributed Data Parallel (DDP) is not supported yet.

Train & Evaluation

From scratch run `fire.py`.

if __name__ == '__main__':
    config = Config(config_dict={'test_only': False,
                                 'load_experiment': None})
    train(config)

If you want to resume from a checkpoint, just set the 'load_experiment': './saved/$model_name$.pth'. Similarly, when 'test_only' is set to True, 'load_experiment' is required.

Results

The best model is trained on a TITAN Xp GPU (8GB usage).

Training loss

Ablation study

Model	Rouge-1	Rouge-2	Rouge-L
Seq2Seq	22.17	7.20	20.97
Seq2Seq+attn	29.35	12.58	27.38
Seq2Seq+attn+pgen	36.04	15.87	32.92
Seq2Seq+attn+pgen+coverage	39.52	17.85	36.40

Note: The architecture of the Seq2Seq model is based on lstm, I hope I can replace it with transformer in the future.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run `fire.py`.

Results

Training loss

Ablation study

Owner

wxDai

Py-FEAT: Python Facial Expression Analysis Toolbox

Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning

基于深度强化学习的原神自动钓鱼AI

Implementation of Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

This is an official source code for implementation on Extensive Deep Temporal Point Process

Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Repo for EMNLP 2021 paper "Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression"

Frigate - NVR With Realtime Object Detection for IP Cameras

This repository provides a basic implementation of our GCPR 2021 paper "Learning Conditional Invariance through Cycle Consistency"

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Training neural models with structured signals.

Code for SALT: Stackelberg Adversarial Regularization, EMNLP 2021.

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run fire.py.

Results

Training loss

Ablation study

Owner

wxDai

Py-FEAT: Python Facial Expression Analysis Toolbox

Camera Distortion-aware 3D Human Pose Estimation in Video with Optimization-based Meta-Learning

基于深度强化学习的原神自动钓鱼AI

Implementation of Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

SOFT: Softmax-free Transformer with Linear Complexity, NeurIPS 2021 Spotlight

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

This is an official source code for implementation on Extensive Deep Temporal Point Process

Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Repo for EMNLP 2021 paper "Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression"

Frigate - NVR With Realtime Object Detection for IP Cameras

This repository provides a basic implementation of our GCPR 2021 paper "Learning Conditional Invariance through Cycle Consistency"

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Training neural models with structured signals.

Code for SALT: Stackelberg Adversarial Regularization, EMNLP 2021.

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

From scratch run `fire.py`.