The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Last update: Oct 30, 2022

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

This is the repository for baseline models and annotated data for this paper: Akari Asai and Eunsol Choi. Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval. In: Proceedings of ACL. 2021

In the paper, we carefully analyze unanswerable questions in information-seeking QA dataset (i.e., Natural Questions and TyDi QA) and attempt to identify the remaining headrooms. We conduct both a range of controlled experiments and insensitive human annotations on around 800 examples across across 6 languages.

Annotated data

In human_annotated_data, we provide human annotated data from TyDi QA and Natural Questions.

Dataset	language	# of annotated questions	file name
Natural Questions	English	450	NQ.tsv
TyDi QA	Bengali	50	TyDi-Bn.tsv
TyDi QA	Japanese	100	TyDi-Ja.tsv
TyDi QA	Korean	100	TyDi-Bn.tsv
TyDi QA	Russian	50	TyDi-Ru.tsv
TyDi QA	Telugu	50	TyDi-Te.tsv

Baselines

In this work, we conduct several baseline experiments to identify the remaining headrooms in information-seeking QA. This repository include baselines for question only baseline. See the training and evaluation details in README.md. We thank the authors of Riki Net, Retro-reader, and ETC for providing their models' predictions that are used to analyze those state-of-the-art models behaviors.

Citation and Contact

If you find this codebase is useful or use in your work, please cite our paper.

@inproceedings{
asai2020learning,
title={Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval},
author={Akari Asai and Eunsol Choi},
booktitle={ACL-IJCNLP},
year={2021}
}

Please contact Akari Asai (@AkariAsai, akari[at]cs.washington.edu) for questions and suggestions.

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Related tags

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

Annotated data

Baselines

Citation and Contact

Owner

Akari Asai

Natural Intelligence is still a pretty good idea.

TLXZoo - Pre-trained models based on TensorLayerX

This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

High-resolution networks and Segmentation Transformer for Semantic Segmentation

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot

Ganilla - Official Pytorch implementation of GANILLA

Blender Python - Node-based multi-line text and image flowchart

Implementation of "Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification"

A Streamlit demo demonstrating the Deep Dream technique. Adapted from the TensorFlow Deep Dream tutorial.

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Automatically replace ONNX's RandomNormal node with Constant node.

Open-sourcing the Slates Dataset for recommender systems research

Assessing syntactic abilities of BERT

Educational API for 3D Vision using pose to control carton.

A parametric soroban written with CADQuery.

Six - a Python 2 and 3 compatibility library