Translate

Command-line interface to translation pipelines, powered by Huggingface transformers. This tool can download translation models, and then using them to translate sentences offline. By default, tries using models from Helsinki-NLP (each model is about 300MB large).

Install

$ git clone https://github.com/Teuze/translate
$ cd translate
$ pip3 install --user -r requirements.py

If you want to be able to use this script from anywhere in your system, you can symlink or copy the translate script file into one of your path folders, like for example $HOME/.local/bin.

Usage

Listing available and installed translation models :

$ # Also available on https://huggingface.co/models
$ ./translate model list online | less
$ ./translate model list local | less

Downloading models :

$ ./translate download model "Helsinki-NLP/opus-mt-en-es"
$ ./translate download model "Helsinki-NLP/opus-mt-fr-en"

Using models to translate from CLI arguments or from standard input :

$ ./translate text -e "Helsinki-NLP/opus-mt-en-es" "Hello World!"
¡Hola Mundo!
$ echo "Ceci est une phrase d'exemple simple" | ./translate text -s fr -t en
This is a simple example sentence

Partially offline multi-language translator built upon Huggingface transformers.

Related tags

Overview

Translate

Install

Usage

Owner

Richard Jarry

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Syntax-aware Multi-spans Generation for Reading Comprehension (TASLP 2022)

Beautiful visualizations of how language differs among document types.

Codes to pre-train Japanese T5 models

🗣️ NALP is a library that covers Natural Adversarial Language Processing.

A toolkit for document-level event extraction, containing some SOTA model implementations

RoNER is a Named Entity Recognition model based on a pre-trained BERT transformer model trained on RONECv2

Topic Modelling for Humans

Graphical user interface for Argos Translate

Write Alphabet, Words and Sentences with your eyes.

scikit-learn wrappers for Python fastText.

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

A curated list of efficient attention modules

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

A highly sophisticated sequence-to-sequence model for code generation

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

A minimal code for fairseq vq-wav2vec model inference.

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Extract rooms type, door, neibour rooms, rooms corners nad bounding boxes, and generate graph from rplan dataset