Code for Text Prior Guided Scene Text Image Super-Resolution

Last update: Dec 26, 2022

Related tags

Text Data & NLP TPGSR

Overview

Text Prior Guided Scene Text Image Super-Resolution

https://arxiv.org/abs/2106.15368

Jianqi Ma, Shi Guo, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the '$TPGSR_ROOT$/', place the pretrained weights from recognizer in '$TPGSR_ROOT$/'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TPGSR-TSRN.sh
./train_TPGSR-TSRN.sh
or
python3 main.py --arch="tsrn_tl_cascade" \       # The architecture
                --batch_size=48 \                # The batch size
                --STN \                          # Using STN net for alignment
		--mask \                         # Using the contour mask
		--use_distill \                  # Using the TP loss
		--gradient \                     # Using the Gradient Prior Loss
		--sr_share \                     # Sharing weights for SR Module
		--stu_iter=1 \                   # The number of interations in multi-stage version
		--vis_dir='vis_TPGSR-TSRN' \     # The checkpoint directory

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={Text Prior Guided Scene Text Image Super-resolution},
author={Ma, Jianqi and Guo, Shi and Zhang, Lei},
journal={arXiv preprint arXiv:2106.15368},
year={2021}
}

Code for Text Prior Guided Scene Text Image Super-Resolution

Related tags

Overview

Text Prior Guided Scene Text Image Super-Resolution

Recovering TextZoom samples

Environment:

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

This repo contains simple to use, pretrained/training-less models for speaker diarization.

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

A collection of models for image - text generation in ACM MM 2021.

Experiments in converting wikidata to ftm

A Structured Self-attentive Sentence Embedding

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Korean Simple Contrastive Learning of Sentence Embeddings using SKT KoBERT and kakaobrain KorNLU dataset

Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Application for shadowing Chinese.

Topic Inference with Zeroshot models

Maix Speech AI lib, including ASR, chat, TTS etc.

Turkish Stop Words Türkçe Dolgu Sözcükleri

Natural Language Processing Specialization

Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention

NLP made easy

Collection of useful (to me) python scripts for interacting with napari