Auto translate textbox from Japanese to English or Indonesia

Last update: Aug 25, 2022

Related tags

Text Data & NLP priconne-auto-translate

Overview

priconne-auto-translate

Auto translate textbox from Japanese to English or Indonesia

How to use

Install python first, Anaconda is recommended
Install python depedency with command: pip install -r requirement.txt
Install tesseract for windows, Download here, select tesseract-ocr-w64-setup-v5.0.0-alpha.20210811.exe
Download japanese datapack at this repository
Copy trainneddata file to C:\Program Files\Tesseract-OCR\tessdata
Run Princess Connect Re:Dive
Run run.py with command : python run.py --data fast --translate googleDict or just python run.py

Available command

--data

select japanese detection datapack.\

fast is lightweight but not accurate.
medium is not light but not heavy either, fairly accurate.
best is heavy but it is very accurate. default is best

--translate

Select translator endpoint.\

azure using bing translate, need API key, free is limited 2 millions of character
ibm using IBM Translate, not really accurate.
googleModule using googletrans module, not accurate, IP maybe blocked if too many request
googleDict using google dictionary endpoint, somewhat accurate. Default is googleDict

PR is wellcome!

Yes, I need your help to improve this program. My code is messy but at least it work.
If there any improvement, PR is always open!

Issue

Please open issue if there is any bug or have question!

ToDo

Overlay to the game.
Improve performance with multithreading / multiprocessing

Auto translate textbox from Japanese to English or Indonesia

Related tags

Overview

priconne-auto-translate

How to use

Available command

PR is wellcome!

Issue

ToDo

Owner

Aji Priyo Wibowo

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

Experiments in converting wikidata to ftm

iBOT: Image BERT Pre-Training with Online Tokenizer

Wind Speed Prediction using LSTMs in PyTorch

NLP: SLU tagging

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Python port of Google's libphonenumber

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

Huggingface Transformers + Adapters = ❤️

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

A Transformer Implementation that is easy to understand and customizable.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

This github repo is for Neurips 2021 paper, NORESQA A Framework for Speech Quality Assessment using Non-Matching References.

English loanwords in the world's languages

Code for the paper: Sequence-to-Sequence Learning with Latent Neural Grammars

A text augmentation tool for named entity recognition.

Auto translate textbox from Japanese to English or Indonesia

Related tags

Overview

priconne-auto-translate

How to use

Available command

PR is wellcome!

Issue

ToDo

Owner

Aji Priyo Wibowo

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

Experiments in converting wikidata to ftm

iBOT: Image BERT Pre-Training with Online Tokenizer

Wind Speed Prediction using LSTMs in PyTorch

NLP: SLU tagging

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Python port of Google's libphonenumber

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

Huggingface Transformers + Adapters = ❤️

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

A Transformer Implementation that is easy to understand and customizable.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

This github repo is for Neurips 2021 paper, NORESQA A Framework for Speech Quality Assessment using Non-Matching References.

English loanwords in the world's languages

Code for the paper: Sequence-to-Sequence Learning with Latent Neural Grammars

A text augmentation tool for named entity recognition.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。