Meta Language-Specific Layers in Multilingual Language Models

This repo contains the source codes for our paper

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

EMNLP 2020

Introduction

This repo contains code to train multilingual language models (XLM) that (1) contain language-specific layers, and (2) meta-learn these layers through gradient of gradient.

Language-specific layers are served as meta parameters, optimized using an iterative procedure. The goal is to remedy negative transfer in multilingual models through a meta training objective. Please see our paper for details.

Dependencies

Python 3
XLM
NumPy
PyTorch

Usage

The code is based on the official implementation of XLM. This repo only contains files that we modified from the original codebase. To train a model, please merge code with the source code of XLM, and then follow the standard preprocessing and training instructions there.

Meta Language-Specific Layers in Multilingual Language Models

Related tags

Overview

Meta Language-Specific Layers in Multilingual Language Models

Introduction

Dependencies

Usage

Owner

Zirui Wang

Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Perturb-and-max-product: Sampling and learning in discrete energy-based models

Unofficial PyTorch implementation of the Adaptive Convolution architecture for image style transfer

Using Python to Play Cyberpunk 2077

Chunkmogrify: Real image inversion via Segments

Object classification with basic computer vision techniques

Madanalysis5 - A package for event file analysis and recasting of LHC results

NeurIPS 2021, "Fine Samples for Learning with Noisy Labels"

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

A pytorch implementation of Paper "Improved Training of Wasserstein GANs"

Hydra Lightning Template for Structured Configs

ZEBRA: Zero Evidence Biometric Recognition Assessment

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

Let Python optimize the best stop loss and take profits for your TradingView strategy.

Computationally efficient algorithm that identifies boundary points of a point cloud.

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".