"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Last update: Nov 16, 2022

Related tags

Text Data & NLP transformers-arithmetic

Overview

transformers-arithmetic

This repository contains the code to reproduce the experiments from the paper:

Nogueira, Jiang, Lin "Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

First, install the required packages:

pip install -r requirements.txt

The command below trains and evaluates a T5-base model on the task of adding up to 15-digits:

python main.py \
    --output_dir=. \
    --model_name_or_path=t5-base \
    --operation=addition \
    --orthography=10ebased \
    --balance_train \
    --balance_val \
    --train_size=100000 \
    --val_size=10000 \
    --test_size=10000 \
    --min_digits_train=2 \
    --max_digits_train=15 \
    --min_digits_test=2 \
    --max_digits_test=15 \
    --base_number=10 \
    --seed=1 \
    --train_batch_size=4 \
    --accumulate_grad_batches=32 \
    --val_batch_size=32 \
    --max_seq_length=512 \
    --num_workers=4 \
    --gpus=1 \
    --optimizer=AdamW \
    --lr=3e-4 \
    --weight_decay=5e-5 \
    --scheduler=StepLR \
    --t_0=2 \
    --t_mult=2 \
    --gamma=1.0 \
    --step_size=1000 \
    --max_epochs=20 \
    --check_val_every_n_epoch=2 \
    --amp_level=O0 \
    --precision=32 \
    --gradient_clip_val=1.0

This training should take 10 hours on a V100 GPU.

The exact match on the test set should be 1:

--------------------------------------------------------------------------------
DATALOADER:0 TEST RESULTS
{'test_exact_match': 1.0000}
--------------------------------------------------------------------------------

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Related tags

Overview

transformers-arithmetic

Owner

Castorini

A programming language with logic of Python, and syntax of all languages.

☀️ Measuring the accuracy of BBC weather forecasts in Honolulu, USA

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

A method to generate speech across multiple speakers

Pretty-doc - Composable text objects with python

Python3 to Crystal Translation using Python AST Walker

Weird Sort-and-Compress Thing

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

This is a Prototype of an Ai ChatBot "Tea and Coffee Supplier" using python.

NLP, before and after spaCy

Use the power of GPT3 to execute any function inside your programs just by giving some doctests

NLP Core Library and Model Zoo based on PaddlePaddle 2.0

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

ACL'22: Structured Pruning Learns Compact and Accurate Models

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Uses Google's gTTS module to easily create robo text readin' on command.