Answering Open-Domain Questions of Varying Reasoning Steps from Text

Last update: Dec 22, 2022

Related tags

Overview

IRRR

This repository contains the authors' implementation of the Iterative Retriever, Reader, and Reranker (IRRR) model in the EMNLP 2021 paper "Answering Open-Domain Questions of Varying Reasoning Steps from Text".

Run prediction on BeerQA

Setting up

Checkout the code from our repository using

git clone https://github.com/beerqa/IRRR.git

This repo requires Python 3.6. Please check your shell environment's python before proceeding. To use ElasticSearch, make sure you also install Java Development Kit (JDK) version 8.

The setup script will download all required dependencies (python requirements, data, models, etc.) required to run the IRRR pipeline end-to-end. Before running this script, make sure you have the Unix utility wget (which can be installed through anaconda as well as other common package managers). Along the way, it will also start running Elasticsearch and index the Wikipedia corpus locally.

Note: This might take a while to finish and requires a large amount of disk space, so it is strongly recommended that you run this on a machine with at least 100GB of free disk space.

bash setup.sh

Run prediction

Here is a quick example for running prediction using our trained model on the BeerQA dataset. It will take hours depending on the number of retrieved passages at each iteration. It requires up-to 100GB of storage for storing intermediate files when are large number of passages are retrieved at each reasoning step

bash scripts/predict_dynamic_hops.sh PREDICT_OUTPUT_PATH \
                                     ./data/beerqa/beerqa_dev_v1.0.json \
				     MODEL_OUTPUT_PATH \
				     NUM_PASSAGES_AT_EACH_ITERATION \
				     MAX_ITERATION

Evaluate the prediction

Once the prediction has been made, you can use the following command to evaluate the output

python utils/eval_beerqa.py ./data/beerqa/beerqa_dev_v1.0.json \
                            PREDICT_OUTPUT_PATH/answer_predictions.json

Citation

If you use IRRR in your work, please consider citing our paper

@inproceedings{qi2021answering,
  title={Answering Open-Domain Questions of Varying Reasoning Steps from Text},
  author = {Qi, Peng and Lee, Haejun and Sido, Oghenetegiri "TG" and Manning, Christopher D.},
  booktitle = {Empirical Methods for Natural Language Processing ({EMNLP})},
  year = {2021}
}

License

All work contained in this package is licensed under the Apache License, Version 2.0. See the included LICENSE file.

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Related tags

Overview

IRRR

Run prediction on BeerQA

Setting up

Run prediction

Evaluate the prediction

Citation

License

Owner

Implementation of Feedback Transformer in Pytorch

PyTorch implementation of normalizing flow models

Space Invaders For Python

Llvlir - Low Level Variable Length Intermediate Representation

Birthday-problem - The birthday problem asks for the probability that, in a set of n randomly chosen people, at least two will share a birthday

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

PyTorch implementation of paper: AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer, ICCV 2021.

Fbone (Flask bone) is a Flask (Python microframework) starter/template/bootstrap/boilerplate application.

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Trajectory Variational Autoencder baseline for Multi-Agent Behavior challenge 2022

Demystifying How Self-Supervised Features Improve Training from Noisy Labels

Code accompanying the paper "How Tight Can PAC-Bayes be in the Small Data Regime?"

Adversarial Self-Defense for Cycle-Consistent GANs

🤗 Paper Style Guide

Lightweight Face Image Quality Assessment

Bayesian Neural Networks in PyTorch

An OpenAI Gym environment for Super Mario Bros

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.