JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Last update: Oct 26, 2022

Related tags

Deep Learning JASS

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

This the repository for this paper.

Find extensions of this work and new pre-trained models here: code, paper

Requirements

Install OpenNMT-py (1.0) and subword-nmt.

pip install OpenNMT-py
pip install subword-nmt

Pre-trained JASS models

We release JASS models on 2 language pairs: ja+en, ja+ru. For Japanese seq2seq pretraining, we use our proposed JASS methods while MASS is utilized for English and Russian.

Model	Vocabulary	BPE codes
JASS-jaen	ja-en	ja-en.bpe.codes
JASS-jaru	ja-ru	ja-ru.bpe.codes

Usage

Run the bpe precrocessing for the dataset to be finetuned. After setting up the downloaded vocabulary for src and tgt sentences during the preprocessing phase by preprocess.py of OpenNMT, use train_from argument of train.py in OpenNMT to implement the finetuning for the pretrained model.

Others

We will update the current Japanese--English pre-trained model and release pretrained models on Japanese--Chinese and Japanese--Korean. We released new models here: code

Reference

[1] Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi, JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

@inproceedings{mao-etal-2020-jass,
    title = "{JASS}: {J}apanese-specific Sequence to Sequence Pre-training for Neural Machine Translation",
    author = "Mao, Zhuoyuan  and
      Cromieres, Fabien  and
      Dabre, Raj  and
      Song, Haiyue  and
      Kurohashi, Sadao",
    booktitle = "Proceedings of The 12th Language Resources and Evaluation Conference",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.454",
    pages = "3683--3691",
    language = "English",
    ISBN = "979-10-95546-34-4",
}

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Related tags

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Requirements

Pre-trained JASS models

Usage

Others

Reference

Owner

Zhuoyuan Mao

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

Image data augmentation scheduler for albumentations transforms

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal

Pytorch implementation of 'Fingerprint Presentation Attack Detector Using Global-Local Model'

Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

Skipgram Negative Sampling in PyTorch

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Code in both PyTorch and TensorFlow

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Attention-guided gan for synthesizing IR images

PyTorch implementation of the Transformer in Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

Deep Multimodal Neural Architecture Search

Reimplementation of NeurIPS'19: "Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting" by Shu et al.

Code repository for "Stable View Synthesis".

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

Official repository for MixFaceNets: Extremely Efficient Face Recognition Networks

Code for the paper Task Agnostic Morphology Evolution.

PyTorch implementation of DeepDream algorithm

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving