Machine translation models released by the Gourmet project

Last update: Dec 08, 2021

Related tags

Overview

Gourmet Models

Overview

The Gourmet project has released several machine translation models to translate low-resource languages. This repository contains information about the models, as well as sample code showing how the models can be used. The models themselves are available as dockers, and can be downloaded from a separate site (see below).

Some of the models are described in our public deliverables. See the integration report for details of model training, and the evaluation report for details of evaluation.

Using the Models

The model download links are here. Once downloaded, the model can be deployed using:

docker load <

to launch the translation server, use:

docker run -p 4000:4000 -i --rm

This exposes the model on port 4000. Change the second number above if you want to use a different port.

To test the model, use the sample client like this:

$ ./gourmet-client.py 
2021-11-15 14:54:10 DEBUG: __main__:  Connecting to translation server at localhost4000
2021-11-15 14:54:10 DEBUG: urllib3.connectionpool:  Starting new HTTP connection (1): localhost:4000
2021-11-15 14:54:14 DEBUG: urllib3.connectionpool:  http://localhost:4000 "POST /translation HTTP/1.1" 200 88
Translation: An gudanar da taron sauyin yanayi a Glasgow
Time: 2242
Error: None

This is using the English to Hausa model. You can use the -n and -p arguments to change the host and port. The source text is hard-coded but easily changed.

Some of the models support additional arguments which can be passed to the docker at launch time, using the -e argument to docker (for environemnent) variables. See the models page for more details.

Computational Requirements

All models are configured to use the CPU for translation. A GPU is not required, and the models will not use it.

Licence

CC-BY

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825299.

Machine translation models released by the Gourmet project

Related tags

Overview

Gourmet Models

Overview

Using the Models

Computational Requirements

Licence

Acknowledgements

Owner

Edinburgh NLP

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

This code extends the neural style transfer image processing technique to video by generating smooth transitions between several reference style images

Fastseq 基于ONNXRUNTIME的文本生成加速框架

Script to download some free japanese lessons in portuguse from NHK

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Text to speech for Vietnamese, ez to use, ez to update

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

A multi-voice TTS system trained with an emphasis on quality

⚖️ A Statutory Article Retrieval Dataset in French.

Huggingface Transformers + Adapters = ❤️

Shared, streaming Python dict

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

topic modeling on unstructured data in Space news articles retrieved from the Guardian (UK) newspaper using API

A BERT-based reverse dictionary of Korean proverbs