Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Last update: Dec 20, 2022

Related tags

Deep Learning KEMP

Overview

Knowledge Bridging for Empathetic Dialogue Generation

This is the official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Model Architecture

Setup

Check the packages needed or simply run the command:

pip install -r requirements.txt

Download GloVe vectors from here (glove.6B.300d.txt) and put it into /data/.
Download other data sources regarding ConceptNet and NRC_VAD lexicon, please visit Google Drive and place processed dataset kemp_dataset_preproc.json into /data/.
For reproducibility purposes, we place the model checkpoints at Google Drive. You could download and move it under /result/[MODELNAME]/result/, e.g., /result/KEMP/result/KEMP_best.tar.
To skip training, please check folder /result/[MODELNAME]/predicition/.

Data preprocessing

The dataset (EmpatheticDialogue) is preprocessed and stored under data in pickle format

python preprocess.py

Training

KEMP (Our)

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model KEMP \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--attn_loss \
--pointer_gen \
--save_path result/KEMP/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o ECE

This model does not consider the emotional context graph of Emotional Context Encoder (ECE).

In ECE, we enrich the dialogue history with external knowledge into an emotional context graph. Then, the emotional signals of context are distilled based on the embeddings and emotion intensity values from the emotional context graph.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_ECE \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_ECE/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o EDD

This model does not consider the emotional dependency strategies of Emotion-Dependency Decoder (EDD).

In EDD, given emotional signal and emotional context graph, we incorporate an emotional cross-attention mechanism to selectively learn the emotional dependencies.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_EDD \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_EDD/ \
--emb_file data/glove.6B.300d.txt

Testing

Add --test into above commands.

You can directly run /result/cal_metrics.py script to evaluate the model predictions.

Citation

If you find our work useful, please cite our paper as follows:

@article{li-etal-2022-kemp,
  title={Knowledge Bridging for Empathetic Dialogue Generation},
  author={Qintong Li and Piji Li and Zhaochun Ren and Pengjie Ren and Zhumin Chen},
  booktitle={AAAI},
  year={2022},
}

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Related tags

Overview

Knowledge Bridging for Empathetic Dialogue Generation

Model Architecture

Setup

Data preprocessing

Training

KEMP (Our)

KEMP w/o ECE

KEMP w/o EDD

Testing

Citation

Owner

Qintong Li

[NeurIPS 2021] "G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators"

Code for Emergent Translation in Multi-Agent Communication

Scenic: A Jax Library for Computer Vision and Beyond

Lane follower: Lane-detector (OpenCV) + Object-detector (YOLO5) + CAN-bus

Official Implementation for the "An Empirical Investigation of 3D Anomaly Detection and Segmentation" paper.

PyTorch implementation of "VRT: A Video Restoration Transformer"

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".

Official code for the CVPR 2021 paper "How Well Do Self-Supervised Models Transfer?"

A annotation of yolov5-5.0

Neural Tangent Generalization Attacks (NTGA)

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

An AI Assistant More Than a Toolkit

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Huawei Hackathon 2021 - Sweden (Stockholm)

efficient neural audio synthesis in the waveform domain

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics