NaturalProofs: Mathematical Theorem Proving in Natural Language

Last update: Jan 05, 2023

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck, Jiacheng Liu, Ronan Le Bras, Hannaneh Hajishirzi, Yejin Choi, Kyunghyun Cho

This repo contains:

The NaturalProofs Dataset and the mathematical reference retrieval task data.
Preprocessing NaturalProofs and the retrieval task data.
Training and evaluation for mathematical reference retrieval.
Pretrained models for mathematical reference retrieval.

Please cite our work if you found the resources in this repository useful:

@article{welleck2021naturalproofs,
  title={NaturalProofs: Mathematical Theorem Proving in Natural Language},
  author={Welleck, Sean and Liu, Jiacheng and Le Bras, Ronan and Hajishirzi, Hannaneh and Choi, Yejin and Cho, Kyunghyun},
  year={2021}
}

Section	Subsection
NaturalProofs Dataset	Dataset
	Preprocessing
Mathematical Reference Retrieval	Dataset
	Setup
	Preprocessing
	Pretrained models
	Training
	Evaluation

NaturalProofs Dataset

We provide the preprocessed NaturalProofs Dataset (JSON):

NaturalProofs Dataset
dataset.json [zenodo]

Preprocessing

To see the steps used to create the NaturalProofs dataset.json from raw ProofWiki data:

Download the ProofWiki XML.
Preprocess the data using notebooks/parse_proofwiki.ipynb.
Form the data splits using notebooks/dataset_splits.ipynb.

Mathematical Reference Retrieval

Dataset

The Mathematical Reference Retrieval dataset contains (x, r, y) examples with theorem statements x, positive and negative references r, and 0/1 labels y, derived from NaturalProofs.

We provide the version used in the paper (bert-based-cased tokenizer, 200 randomly sampled negatives):

Reference Retrieval Dataset
`bert-base-cased` 200 negatives

Pretrained Models

Pretrained models
`bert-base-cased`
`lstm`

These models were trained with the "bert-base-cased 200 negatives" dataset provided above.

Setup

python setup.py develop

You can see the DockerFile for additional version info, etc.

Generating and tokenizing

To create your own version of the retrieval dataset, use python utils.py.

This step is not needed if you are using the reference retrieval dataset provided above.

Example:

python utils.py --filepath /path/to/dataset.json --output-path /path/to/out/ --model-type bert-base-cased
# => Writing dataset to /path/to/out/dataset_tokenized__bert-base-cased_200.pkl

Evaluation

Using the retrieval dataset and a model provided above, we compute the test evaluation metrics in the paper:

Predict the rankings:

python naturalproofs/predict.py \
--method bert-base-cased \      # | lstm
--model-type bert-base-cased \  # | lstm
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--checkpoint-path /path/to/best.ckpt \
--output-dir /path/to/out/ \
--split test  # use valid during model development

Compute metrics over the rankings:

python naturalproofs/analyze.py \
--method bert-base-cased \      # | lstm
--eval-path /path/to/out/eval.pkl \
--analysis-path /path/to/out/analysis.pkl

Training

python naturalproofs/model.py \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--default-root-dir /path/to/out/

Classical Retrieval Baselines

TF-IDF example:

python naturalproofs/baselines.py \
--method tfidf \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--savedir /path/to/out/

Then use analyze.py as shown above to compute metrics.

NaturalProofs: Mathematical Theorem Proving in Natural Language

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs Dataset

Preprocessing

Mathematical Reference Retrieval

Dataset

Pretrained Models

Setup

Generating and tokenizing

Evaluation

Training

Classical Retrieval Baselines

Owner

Sean Welleck

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

End-to-end Temporal Action Detection with Transformer. [Under review]

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

A collection of IPython notebooks covering various topics.

CVAT is free, online, interactive video and image annotation tool for computer vision

Neural network for stock price prediction

Transfer Learning Remote Sensing

This is a library for training and applying sparse fine-tunings with torch and transformers.

Models, datasets and tools for Facial keypoints detection

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

Lightweight mmm - Lightweight (Bayesian) Media Mix Model

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation

Hypersearch weight debugging and losses tutorial

The FIRST GANs-based omics-to-omics translation framework

A Simulation Environment to train Robots in Large Realistic Interactive Scenes

automated systems to assist guarding corona Virus precautions for Closed Rooms (e.g. Halls, offices, etc..)

Implementation of Graph Convolutional Networks in TensorFlow