MetaNLI

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Train (source task)

Reptile

To train the model using Reptile algorithm, run the command below:

python reptile.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --queue_len 4 \
    --temp 5.0 \
    --epochs 1 \
    --meta_lr 1e-5 \
    --scheduler \
    --gamma 0.5 \
    --step_size 4000 \
    --shot 4 \
    --meta_iteration 8000 \
    --log_interval 300

Prototypical

To train the model using Prototypical Networks algorithm, run the command below:

python prototype.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --target_task sc_fa \
    --epochs 1 \
    --meta_lr 1e-5 \
    --lambda_1 1 \
    --lambda_2 1 \
    --scheduler \
    --gamma 0.5 \
    --step_size 1000 \
    --shot 8 \
    --query_num 0 \
    --target_shot 8 \
    --meta_iteration 2500 \
    --log_interval 50

Zero-shot Test (on target task)

To perform a zero-shot test of the trained model on the target task, run the command below:

python zeroshot.py \
    --load saved/model_sc.pt \
    --task sc_fa

Fine-tune (target task)

To fine-tune the trained model on the target task, run the command below:

python finetune.py \
    --save saved \
    --model_filename fine.pt \
    --load saved/model_sc.pt \
    --task sc_fa \
    --epochs 5 \
    --lr 1e-5

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Related tags

Overview

MetaNLI

Train (source task)

Reptile

Prototypical

Zero-shot Test (on target task)

Fine-tune (target task)

Owner

M.Hassan Mojab

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

A Facebook Messenger Chatbot using NLP

Programme de chiffrement et de déchiffrement inverse d'un message en python3.

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

Implementation of Natural Language Code Search in the project CodeBERT: A Pre-Trained Model for Programming and Natural Languages.

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

CLIPfa: Connecting Farsi Text and Images

News-Articles-and-Essays - NLP (Topic Modeling and Clustering)

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

An algorithm that can solve the word puzzle Wordle with an optimal number of guesses on HARD mode.

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

MRC approach for Aspect-based Sentiment Analysis (ABSA)

:P Some basic stuff I'm gonna use for my upcoming Agile Software Development and Devops

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

Example code for "Real-World Natural Language Processing"

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

Local cross-platform machine translation GUI, based on CTranslate2