Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Last update: May 18, 2022

Related tags

Deep Learning multiDDS

Overview

Balancing Training for Multilingual Neural Machine Translation

Implementation of the paper

Balancing Training for Multilingual Neural Machine Translation

Xinyi Wang, Yulia Tsvetkov, Graham Neubig

Data:

The preprocessed and binarized data for fairseq can be downloaded here

To process data from scrach, see the script

util_scripts/prepare_multilingual_data.sh

Training Scripts:

The training scripts for many-to-one translation of the related language group (Related M2O) is under the directory job_scripts/related_ted8_m2o/.

Our methods:

MultiDDS-S:

job_scripts/related_ted8_m2o/multidds_s.sh

MultiDDS:

job_scripts/related_ted8_m2o/multidds.sh

Baselines:

Proportional:

job_scripts/related_ted8_m2o/proportional.sh

Temperature:

job_scripts/related_ted8_m2o/temperature.sh

The scripts for Related O2M is under the directory job_scripts/related_ted8_o2m/

The scripts for Diverse M2O is under the directory job_scripts/diverse_ted8_m2o/

The scripts for Diverse O2M is under the directory job_scripts/diverse_ted8_o2m/

Inference Scripts:

Each of the experiment script directory contains a trans.sh file to translate the test set. To translate the test set for the Related M2O MultiDDS-S

job_scripts/related_ted8_m2o/trans.sh checkpoints/related_ted8_m2o/multidds_s/

To translate other experiment, simply replace the argument with the experiment checkpoint directory.

Citation

Please cite as:

@inproceedings{wang2020multiDDS,
  title = {Balancing Training for Multilingual Neural Machine Translation},
  author = {Xinyi Wang, Yulia Tsvetkov, Graham Neubig},
  booktitle = {ACL},
  year = {2020},
}

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Related tags

Overview

Balancing Training for Multilingual Neural Machine Translation

Data:

Training Scripts:

Inference Scripts:

Citation

Owner

Xinyi Wang

T-LOAM: Truncated Least Squares Lidar-only Odometry and Mapping in Real-Time

This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels].

This repo contains the implementation of the algorithm proposed in Off-Belief Learning, ICML 2021.

Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

MTA:SA Server Configer.

Easy to use Audio Tagging in PyTorch

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

AntroPy: entropy and complexity of (EEG) time-series in Python

Testing the Facial Emotion Recognition (FER) algorithm on animations

Human4D Dataset tools for processing and visualization

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

Knowledge Management for Humans using Machine Learning & Tags

Implementation for Homogeneous Unbalanced Regularized Optimal Transport

Python version of the amazing Reaction Mechanism Generator (RMG).

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

这是一个facenet-pytorch的库，可以用于训练自己的人脸识别模型。

A python implementation of Physics-informed Spline Learning for nonlinear dynamics discovery

An Active Automata Learning Library Written in Python

Elastic weight consolidation technique for incremental learning.