Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

Fbone (Flask bone) is a Flask (Python microframework) starter/template/bootstrap/boilerplate application.

Deep Reinforcement Learning based Trading Agent for Bitcoin

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Code for paper "Multi-level Disentanglement Graph Neural Network"

FairMOT - A simple baseline for one-shot multi-object tracking

Official implementation of "Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform", ICCV 2021

Unsupervised Representation Learning by Invariance Propagation

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)

Contrastive Language-Image Pretraining

Semantic Bottleneck Scene Generation

A simple, unofficial implementation of MAE using pytorch-lightning

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Split Variational AutoEncoder

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

YOLOV4运行在嵌入式设备上

Simple transformer model for CIFAR10

[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".

This is the repo for the paper `SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization'. (published in Bioinformatics'21)

CLIP + VQGAN / PixelDraw

Self-describing JSON-RPC services made easy