Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Last update: Dec 20, 2022

Related tags

Overview

Fine-grained Post-training for Multi-turn Response Selection

Implements the model described in the following paper Fine-grained Post-training for Improving Retrieval-based Dialogue Systems in NAACL-2021.

@inproceedings{han-etal-2021-fine,
title = "Fine-grained Post-training for Improving Retrieval-based Dialogue Systems",
author = "Han, Janghoon  and Hong, Taesuk  and Kim, Byoungjae  and Ko, Youngjoong  and Seo, Jungyun",
booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
month = jun, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2021.naacl-main.122", pages = "1549--1558",
}

This code is reimplemented as a fork of huggingface/transformers.

Setup and Dependencies

This code is implemented using PyTorch v1.8.0, and provides out of the box support with CUDA 11.2 Anaconda is the recommended to set up this codebase.

# https://pytorch.org
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
pip install -r requirements.txt

Preparing Data and Checkpoints

Post-trained and fine-tuned Checkpoints

We provide following post-trained and fine-tuned checkpoints.

Data pkl for Fine-tuning (Response Selection)

We used the following data for post-training and fine-tuning

fine-grained post-training dataset and fine-tuning dataset for 3 benchmarks (ubuntu, douban, e-commerce)

Original version for each dataset is availble in Ubuntu Corpus V1, Douban Corpus, and E-Commerce Corpus, respectively.

Fine-grained Post-Training

Making Data for post-training and fine-tuning

Data_processing.py

Post-training Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

python -u FPT/ubuntu_final.py --num_train_epochs 25
python -u FPT/douban_final.py --num_train_epochs 27
python -u FPT/e_commmerce_final.py --num_train_epochs 34

Fine-tuning Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Taining

To train the model, set `--is_training`
python -u Fine-Tuning/Response_selection.py --task ubuntu --is_training
python -u Fine-Tuning/Response_selection.py --task douban --is_training
python -u Fine-Tuning/Response_selection.py --task e_commerce --is_training

Testing

python -u Fine-Tuning/Response_selection.py --task ubuntu
python -u Fine-Tuning/Response_selection.py --task douban 
python -u Fine-Tuning/Response_selection.py --task e_commerce

Training Response Selection Models

Model Arguments

Fine-grained post-training

task_name	data_dir	checkpoint_path
ubuntu	ubuntu_data/ubuntu_post_train.pkl	FPT/PT_checkpoint/ubuntu/bert.pt
douban	douban_data/douban_post_train.pkl	FPT/PT_checkpoint/douban/bert.pt
e-commerce	e_commerce_data/e_commerce_post_train.pkl	FPT/PT_checkpoint/e_commerce/bert.pt

Fine-tuning

task_name	data_dir	checkpoint_path
ubuntu	ubuntu_data/ubuntu_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/ubuntu.0.pt
douban	douban_data/douban_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/douban.0.pt
e-commerce	e_commerce_data/e_commerce_dataset_1M.pkl	Fine-Tuning/FT_checkpoint/e_commerce.0.pt

Performance

We provide model checkpoints of BERT_FP, which obtained new state-of-the-art, for each dataset.

Ubuntu	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.911	0.962	0.994

Douban	MAP	MRR	[email protected]	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.644	0.680	0.512	0.324	0.542	0.870

E-Commerce	[email protected]	[email protected]	[email protected]
[BERT_FP]	0.870	0.956	0.993

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Related tags

Overview

Fine-grained Post-training for Multi-turn Response Selection

Setup and Dependencies

Preparing Data and Checkpoints

Post-trained and fine-tuned Checkpoints

Data pkl for Fine-tuning (Response Selection)

Fine-grained Post-Training

Making Data for post-training and fine-tuning

Post-training Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Fine-tuning Examples

(Ubuntu Corpus V1, Douban Corpus, E-commerce Corpus)

Taining

Testing

Training Response Selection Models

Model Arguments

Fine-grained post-training

Fine-tuning

Performance

Owner

Janghoon Han

TensorFlow Tutorials with YouTube Videos

Riemannian Convex Potential Maps

code for "Self-supervised edge features for improved Graph Neural Network training",

3rd place solution for the Weather4cast 2021 Stage 1 Challenge

Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal transformer that encodes language inputs and the full episode history of visual observations and actions.

[ICCV 2021 Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Voxel Transformer for 3D object detection

Code for LIGA-Stereo Detector, ICCV'21

Interpretation of T cell states using reference single-cell atlases

Code for MSc Quantitative Finance Dissertation

TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

Demonstrates iterative FGSM on Apple's NeuralHash model.

[CVPR2021] Domain Consensus Clustering for Universal Domain Adaptation

In this project we predict the forest cover type using the cartographic variables in the training/test datasets.

Keqing Chatbot With Python

PyTorch implementation of CloudWalk's recent work DenseBody

DLL: Direct Lidar Localization

A python implementation of Deep-Image-Analogy based on pytorch.

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

This repo holds codes of the ICCV21 paper: Visual Alignment Constraint for Continuous Sign Language Recognition.