Open-Ended Commonsense Reasoning

Quick links: [Paper] | [Video] | [Slides] | [Documentation]

This is the repository of the paper, Differentiable Open-Ended Commonsense Reasoning, by Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren, and William W. Cohen, in Proc. of NAACL 2021.

Abstract

Current commonsense reasoning research focuses on developing models that use commonsense knowledge to answer multiple-choice questions. However, systems designed to answer multiple-choice questions may not be useful in applications that do not provide a small list of candidate answers to choose from. As a step towards making commonsense reasoning research more realistic, we propose to study open-ended commonsense reasoning (OpenCSR) — the task of answering a commonsense question without any pre-defined choices — using as a resource only a corpus of commonsense facts. OpenCSR is challenging due to a large decision space, and because many questions require implicit multi-hop reasoning. As an approach to OpenCSR, we propose DrFact, an efficient Differentiable model for multi-hop Reasoning over knowledge Facts. To evaluate OpenCSR methods, we adapt several popular commonsense reasoning benchmarks, and collect multiple new answers for each test question via crowd-sourcing. Experiments show that DrFact outperforms strong baseline methods by a large margin.

Content

Please check the documentation for running the code.

We show the instructions for running four retrieval approaches to the OpenCSR task — BM25 (off-the-shelf), DPR (EMNLP2020), DrKIT (ICLR 2020) and DrFact (ours, NAACL 2021), as well as a concept re-ranker to boost the performance by learning with cross-attention. Note that there is a relative dependency of these four methods:

training the DPR model needs the results from BM25 (to create training data);
DrFact needs to reuse DPR’s fact index and single-hop results (for creating distant supervision);
DrFact and DrKIT share many utility functions (sparse matrix operation and indexing scripts). We detailed the detailed instructions in individual pages.

Outline and Documentation

drfact_data/
- datasets/ (download from here)
- knowledge_corpus/ (download from here)
baseline_methods/
- BM25/ --> https://open-csr.github.io/methods/bm25
- DPR/ --> https://open-csr.github.io/methods/dpr
- MCQA/ (i.e., Concept Re-ranker) --> https://open-csr.github.io/methods/reranker
language-master/language/labs/
- drkit/ (common modules for DrKIT and DrFact)
- drfact/ (for running DrFact)
scripts/
- run_drkit.sh --> https://open-csr.github.io/methods/drkit
- run_drfact.sh --> https://open-csr.github.io/methods/drfact
evaluation/ --> https://open-csr.github.io/evaluation

Citation

@inproceedings{lin-etal-2021-differentiable,
    title = "Differentiable Open-Ended Commonsense Reasoning",
    author = "Lin, Bill Yuchen and Sun, Haitian and Dhingra, Bhuwan and Zaheer, Manzil and Ren, Xiang and Cohen, William",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.naacl-main.366",
    pages = "4611--4625"
}

Contact

This repo is now under active development, and there may be issues caused by refactoring code. Please email [email protected] if you have any questions.

Open-Ended Commonsense Reasoning (NAACL 2021)

Related tags

Overview

Open-Ended Commonsense Reasoning

Quick links: [Paper] | [Video] | [Slides] | [Documentation]

Abstract

Content

Please check the documentation for running the code.

Outline and Documentation

Citation

Contact

Owner

(Bill) Yuchen Lin

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID,

Disagreement-Regularized Imitation Learning

Deep learning for spiking neural networks

Convolutional neural network web app trained to track our infant’s sleep schedule using our Google Nest camera.

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers

Import Python modules from dicts and JSON formatted documents.

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Source code of SIGIR2021 Paper 'One Chatbot Per Person: Creating Personalized Chatbots based on Implicit Profiles'

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-local Spatial-Temporal Similarity

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

[2021][ICCV][FSNet] Full-Duplex Strategy for Video Object Segmentation