Code for Emergent Translation in Multi-Agent Communication

Last update: Jul 15, 2022

Related tags

Overview

Emergent Translation in Multi-Agent Communication

PyTorch implementation of the models described in the paper Emergent Translation in Multi-Agent Communication.

We present code for training and decoding both word- and sentence-level models and baselines, as well as preprocessed datasets.

Dependencies

Python

Python 2.7
PyTorch 0.2
Numpy

GPU

CUDA (we recommend using the latest version. The version 8.0 was used in all our experiments.)

Related code

For preprocessing, we used scripts from Moses and Subword-NMT.

Downloading Datasets

The original corpora can be downloaded from (Bergsma500, Multi30k, MS COCO). For the preprocessed corpora see below.

	Dataset
Bergsma500	Data
Multi30k	Data
MS COCO	Data

Before you run the code

Download the datasets and place them in /data/word (Bergsma500) and /data/sentence (Multi30k and MS COCO)
Set correct path in scr_path() from /scr/word/util.py and scr_path(), multi30k_reorg_path() and coco_path() from /src/sentence/util.py

Word-level Models

Running nearest neighbour baselines

$ python word/bergsma_bli.py

Running our models

$ python word/train_word_joint.py --l1 <L1> --l2 <L2>

where <L1> and <L2> are any of {en, de, es, fr, it, nl}

Sentence-level Models

Baseline 1 : Nearest neighbour

$ python sentence/baseline_nn.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG>

Baseline 2 : NMT with neighbouring sentence pairs

$ python sentence/nmt.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG> --nn_baseline

Baseline 3 : Nakayama and Nishida, 2017

$ python sentence/train_naka_encdec.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG> --train_enc_how <ENC_HOW> --train_dec_how <DEC_HOW>

where <ENC_HOW> is either two or three, and <DEC_HOW> is either img, des, or both.

Our models :

$ python sentence/train_seq_joint.py --dataset <DATASET> --task <TASK>

Aligned NMT :

$ python sentence/nmt.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG>

where <DATASET> is multi30k or coco, and <TASK> is either 1 or 2 (only applicable for Multi30k).

Dataset & Related Code Attribution

Moses is licensed under LGPL, and Subword-NMT is licensed under MIT License.
MS COCO and Multi30k are licensed under Creative Commons.

Citation

If you find the resources in this repository useful, please consider citing:

@inproceedings{Lee:18,
  author    = {Jason Lee and Kyunghyun Cho and Jason Weston and Douwe Kiela},
  title     = {Emergent Translation in Multi-Agent Communication},
  year      = {2018},
  booktitle = {Proceedings of the International Conference on Learning Representations},
}

Code for Emergent Translation in Multi-Agent Communication

Related tags

Overview

Emergent Translation in Multi-Agent Communication

Dependencies

Python

GPU

Related code

Downloading Datasets

Before you run the code

Word-level Models

Running nearest neighbour baselines

Running our models

Sentence-level Models

Baseline 1 : Nearest neighbour

Baseline 2 : NMT with neighbouring sentence pairs

Baseline 3 : Nakayama and Nishida, 2017

Our models :

Aligned NMT :

Dataset & Related Code Attribution

Citation

Owner

Facebook Research

Fast, DB Backed pretrained word embeddings for natural language processing.

Blue Brain text mining toolbox for semantic search and structured information extraction

Model for recasing and repunctuating ASR transcripts

Natural language computational chemistry command line interface.

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

American Sign Language (ASL) to Text Converter

Backend for the Autocomplete platform. An AI assisted coding platform.

Google and Stanford University released a new pre-trained model called ELECTRA

This is a Prototype of an Ai ChatBot "Tea and Coffee Supplier" using python.

GPT-3 command line interaction

Simple GUI where you can enter an article and get a crisp summarized version.

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

nlpcommon is a python Open Source Toolkit for text classification.

ASCEND Chinese-English code-switching dataset

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Data manipulation and transformation for audio signal processing, powered by PyTorch

Sorce code and datasets for "K-BERT: Enabling Language Representation with Knowledge Graph",

中文生成式预训练模型

Source code for CsiNet and CRNet using Fully Connected Layer-Shared feedback architecture.