The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Last update: Dec 17, 2022

Overview

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation .

Requirement:

apex
fairseq
scikit-learn
pytorch

Process data following https://github.com/pytorch/fairseq/tree/main/examples/translation#multilingual-translation.
Training:

data_bin=    # data path 
lang_pairs=  # comma separated language pairs

fairseq-train $data_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --criterion label_smoothed_cross_entropy --label-smoothing 0.1 \
    --optimizer adam --lr 0.0015 --adam-betas '(0.9,0.98)' \
    --lr-scheduler inverse_sqrt --warmup-updates 4000 --warmup-init-lr 1e-07 \
    --arch parameter_differentiation_base_model \
    --max-tokens 8192 \
    --user-dir $PWD

Decoding

source_lang=
target_lang=
model_path=
fairseq-generate $data_path --path $model_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --beam 4 --lenpen 0.6 --remove-bpe sentencepiece \
    --source-lang $source_lang --target-lang $target_lang > result.$source_lang-$target_lang.txt

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Related tags

Overview

Owner

Qian Wang

Kurumi ChatBot

Spooky Skelly For Python

The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques

The RWKV Language Model

Community and sentiment analysis based on tweets

Natural Language Processing

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences"

Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER adversarial training part

ReCoin - Restoring our environment and businesses in parallel

Every Google, Azure & IBM text to speech voice for free

📝An easy-to-use package to restore punctuation of the text.

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

ASCEND Chinese-English code-switching dataset

Bu Chatbot, Konya Bilim Merkezi Yen için tasarlanmış olan bir projedir.

Refactored version of FastSpeech2

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021