Code for the paper: Sequence-to-Sequence Learning with Latent Neural Grammars

Last update: Dec 23, 2022

Related tags

Overview

Sequence-to-Sequence Learning with Latent Neural Grammars

Code for the paper:
Sequence-to-Sequence Learning with Latent Neural Grammars
Yoon Kim
arXiv Preprint

Dependencies

The code was tested in python 3.7 and pytorch 1.5. We also use a slightly modified version of the Torch-Struct library, which is included in the repo and can be installed via:

cd pytorch-struct
python setup.py install

Data

For convenience we include the datasets used in the paper in the data/ folder. Please cite the original papers when using the data (i.e. Lake and Baroni 2018 for SCAN/MT, and Lyu et al. 2021 for StylePTB).

Training

SCAN

To train the model on (for example) the length split:

python train_scan.py --train_file data/SCAN/tasks_train_length.txt --save_path scan-length.pt

For prediction and evaluation:

python predict_scan.py --data_file data/SCAN/tasks_test_length.txt --model_path scan-length.pt

Style Transfer

To train on (for example) the active-to-passive task:

python train_styleptb.py --train_file data/StylePTB/ATP/train.tsv --dev_file data/StylePTB/ATP/valid.tsv --save_path styleptb-atp.pt

To predict:

python predict_styleptb.py --data_file data/StylePTB/ATP/test.tsv --model_path styleptb-atp.pt 
--out_file styleptb-atp-pred.txt

We use the nlg-eval package to calculate the various metrics.

Machine Translation

To train on MT:

python train_mt.py --train_file_src data/MT/train.en --train_file_tgt data/MT/train.fr 
--dev_file_src data/MT/dev.en --dev_file_tgt data/MT/dev.fr --save_path mt.pt

To predict on the daxy test set:

python predict_mt.py --data_file data/MT/test-daxy.en --model_path mt.pt --out_file mt-pred-daxy.txt

For the regular test set:

python predict_mt.py --data_file data/MT/test.en --model_path mt.pt --out_file mt-pred.txt

We use the multi-bleu script to calculate BLEU.

Training Stability

We observed training to be unstable and the approach required several runs across different seeds to perform well. For reference we have posted logs of some example runs in the logs/ folder.

License

MIT

Code for the paper: Sequence-to-Sequence Learning with Latent Neural Grammars

Related tags

Overview

Sequence-to-Sequence Learning with Latent Neural Grammars

Dependencies

Data

Training

SCAN

Style Transfer

Machine Translation

Training Stability

License

Owner

Yoon Kim

Python library for processing Chinese text

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Switch spaces for knowledge graph embeddings

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Skipgram Negative Sampling in PyTorch

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

CCKS-Title-based-large-scale-commodity-entity-retrieval-top1

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Creating a python chatbot that Starbucks users can text to place an order + help cut wait time of a normal coffee.

REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

Parrot is a paraphrase based utterance augmentation framework purpose built to accelerate training NLU models

TensorFlow code and pre-trained models for BERT

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

skweak: A software toolkit for weak supervision applied to NLP tasks