Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Last update: Oct 21, 2022

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

This repository contains the data and code for the following paper:

**Can NLI Models Verify QA Systems' Predictions? **
Jifan Chen, Eunsol Choi, Greg Durrett
EMNLP 2021 Findings

@article{chen2021can,
  title={Can NLI Models Verify QA Systems' Predictions?},
  author={Chen, Jifan and Choi, Eunsol and Durrett, Greg},
  journal={EMNLP Findings},
  year={2021}
}

Datasets

The NLI data converted from QA datasets through our pipeline described in the paper can be found here

Data Format

The data files are formatted as jsonlines; each example is described as the following:

Field	Description
`example_id`	Example ID
`title_text`	Title of the Wikipedia page of the example, could be NONE
`paragraph_text`	Paragraph containing the answer
`question_text`	Question
`answer_text`	Answer of the question
`answer_sent_text`	Sentence containing the answer
`decontext_answer_sent_text`	Decontextualized sentence containing the answer
`question_statement_text`	Declarative version of the question by combining the answer
`answer_scores`	Top 5 Answer score computed by the QA(BERT-joint) model
`is_correct`	Whether the answer is correct
`answer_sent_text`	Sentence containing the answer

Models

Getting started

git clone https://github.com/jifan-chen/QA-Verification-Via-NLI.git

Install the dependencies by running pip install -r requirements.txt

Question Converter & Decontextualizer

See README in seq2seq_converter.

NQ-NLI

coming soon

Contact

Please contact at [email protected] if you have any questions.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

Datasets

Data Format

Models

Getting started

Question Converter & Decontextualizer

NQ-NLI

Contact

Owner

Jifan Chen

🤕 spelling exceptions builder for lazy people

Code for Emergent Translation in Multi-Agent Communication

MMDA - multimodal document analysis

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

硕士期间自学的NLP子任务，供学习参考

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

ConvBERT-Prod

ZUNIT - Toward Zero-Shot Unsupervised Image-to-Image Translation

Converts python code into c++ by using OpenAI CODEX.

CoSENT、STS、SentenceBERT

Faster, modernized fork of the language identification tool langid.py

Biterm Topic Model (BTM): modeling topics in short texts

Python library for parsing resumes using natural language processing and machine learning

Basic yet complete Machine Learning pipeline for NLP tasks

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

A paper list for aspect based sentiment analysis.