Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Last update: Dec 23, 2022

Related tags

Deep Learning pptod

Overview

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Authors: Yixuan Su, Lei Shu, Elman Mansimov, Arshit Gupta, Deng Cai, Yi-An Lai, and Yi Zhang

Code our PPTOD paper: Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Introduction:

Pre-trained language models have been recently shown to benefit task-oriented dialogue (TOD) systems. Despite their success, existing methods often formulate this task as a cascaded generation problem which can lead to error accumulation across different sub-tasks and greater data annotation overhead. In this study, we present PPTOD, a unified model that seamlessly supports both task-oriented dialogue understanding and response generation in a plug-and-play fashion. In addition, we introduce a new dialogue multi-task pre-training strategy that allows the model to learn the primary TOD task completion skills from heterogeneous dialog corpora. We extensively test our model on three benchmark TOD tasks, including end-to-end dialogue modelling, dialogue state tracking, and intent classification. Results show that PPTOD creates new state-of-the-art on all evaluated tasks in both full training and low-resource scenarios. Furthermore, comparisons against previous SOTA methods show that the responses generated by PPTOD are more factually correct and semantically coherent as judged by human annotators.

1. Citation

If you find our paper and resources useful, please kindly cite our paper:

  @article{su2021multitask,
    author    = {Yixuan Su and
                 Lei Shu and
                 Elman Mansimov and
                 Arshit Gupta and
                 Deng Cai and
                 Yi{-}An Lai and
                 Yi Zhang},
    title     = {Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System},
    journal   = {CoRR},
    volume    = {abs/2109.14739},
    year      = {2021},
    url       = {https://arxiv.org/abs/2109.14739},
    eprinttype = {arXiv},
    eprint    = {2109.14739}
  }

2. Environment Setup:

pip3 install -r requirements.txt
python -m spacy download en_core_web_sm

3. PPTOD Checkpoints:

You can download checkpoints of PPTOD with different configurations here.

PPTOD-small	PPTOD-base	PPTOD-large
here	here	here

To use PPTOD, you should download the checkpoint you want and unzip it in the ./checkpoints directory.

Alternatively, you can run the following commands to download the PPTOD checkpoints.

(1) Downloading Pre-trained PPTOD-small Checkpoint:

cd checkpoints
chmod +x ./download_pptod_small.sh
./download_pptod_small.sh

(2) Downloading Pre-trained PPTOD-base Checkpoint:

cd checkpoints
chmod +x ./download_pptod_base.sh
./download_pptod_base.sh

(3) Downloading Pre-trained PPTOD-large Checkpoint:

cd checkpoints
chmod +x ./download_pptod_large.sh
./download_pptod_large.sh

4. Data Preparation:

The detailed instruction for preparing the pre-training corpora and the data of downstream TOD tasks are provided in the ./data folder.

5. Dialogue Multi-Task Pre-training:

To pre-train a PPTOD model from scratch, please refer to details provided in ./Pretraining directory.

6. Benchmark TOD Tasks:

(1) End-to-End Dialogue Modelling:

To perform End-to-End Dialogue Modelling using PPTOD, please refer to details provided in ./E2E_TOD directory.

(2) Dialogue State Tracking:

To perform Dialogue State Tracking using PPTOD, please refer to details provided in ./DST directory.

(3) Intent Classification:

To perform Intent Classification using PPTOD, please refer to details provided in ./IC directory.

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Related tags

Overview

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Introduction:

1. Citation

2. Environment Setup:

3. PPTOD Checkpoints:

(1) Downloading Pre-trained PPTOD-small Checkpoint:

(2) Downloading Pre-trained PPTOD-base Checkpoint:

(3) Downloading Pre-trained PPTOD-large Checkpoint:

4. Data Preparation:

5. Dialogue Multi-Task Pre-training:

6. Benchmark TOD Tasks:

(1) End-to-End Dialogue Modelling:

(2) Dialogue State Tracking:

(3) Intent Classification:

Security

License

Owner

Amazon Web Services - Labs

Robotics environments

pytorch implementation of Attention is all you need

Implementations of the algorithms in the paper Approximative Algorithms for Multi-Marginal Optimal Transport and Free-Support Wasserstein Barycenters

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

SIR model parameter estimation using a novel algorithm for differentiated uniformization.

The Simplest DCGAN Implementation

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

A keras-based real-time model for medical image segmentation (CFPNet-M)

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Udacity's CS101: Intro to Computer Science - Building a Search Engine

Automatic library of congress classification, using word embeddings from book titles and synopses.

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

This app is a simple example of using Strealit to create a financial data web app.

Efficient and intelligent interactive segmentation annotation software

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

Pytorch implementation of SenFormer: Efficient Self-Ensemble Framework for Semantic Segmentation