A curated list of papers and resources for scene text detection and recognition

Overview

Awesome

Awesome Scene Text

A curated list of papers and resources for scene text detection and recognition

The year when a paper was first published, including ArXiv publications, is used. As a result, there may be cases when a paper was accepted for example to CVPR 2019, but it is listed in year 2018 because it was published in 2018 on ArXiv.

Table of contents
1. Scene Text Detection
2. Weakly Supervised Scene Text Detection
3. Scene Text Recognition
4. Other scene text papers
5. Scene Text Survey papers

Scene Text Detection (including methods for end-to-end detection and recognition)

2010

  • Detecting text in natural scenes with stroke width transform [CVPR 2010] [paper]
  • A Method for Text Localization and Recognition in Real-World Images [ACCV 2010] [paper]

2011

2012

  • Real-time scene text localization and recognition [CVPR 2012] [paper]

2013

2014

  • Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [ECCV 2014] [paper]

2015

  • Symmetry-based text line detection in natural scenes [CVPR 2015] [paper]
  • Object proposals for text extraction in the wild [ICDAR 2015] [paper]
  • Text-Attentional Convolutional Neural Network for Scene Text Detection [TIP 2016] [paper]
  • Text Flow : A Unified Text Detection System in Natural Scene Images [ICCV 2015] [paper]

2016

2017

2018

2019

  • MSR: Multi-Scale Shape Regression for Scene Text Detection [IJCAI 2019] [paper]
  • Scene Text Detection with Inception Text Proposal Generation Module [ICMLC 2019] [paper]
  • Towards Robust Curve Text Detection with Conditional Spatial Expansion [CVPR 2019] [paper]
  • Curve Text Detection with Local Segmentation Network and Curve Connection [ArXiv] [paper]
  • Pyramid Mask Text Detector [ArXiv] [paper]
  • Tightness-aware Evaluation Protocol for Scene Text Detection [CVPR 2019] [paper]
  • Character Region Awareness for Text Detection [CVPR 2019] [paper]
  • Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [CVPR 2019] [paper]
  • TextCohesion: Detecting Text for Arbitrary Shapes [ArXiv] [paper]
  • Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation [CVPR 2019] [paper]
  • Learning Shape-Aware Embedding for Scene Text Detection [CVPR 2019] [paper]
  • A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning [ACMMM 2019] [paper]
  • Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [ICCV 2019] [paper]
  • Towards Unconstrained End-to-End Text Spotting [ICCV 2019] [paper]
  • TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting [paper]
  • Convolutional Character Networks [ICCV 2019] [paper]

Weakly supervised Scene Text Detection & Recognition

2017

  • Attention-Based Extraction of Structured Information from Street View Imagery [ICDAR 2017] [paper]
  • WeText: Scene Text Detection under Weak Supervision [ICCV 2017] [paper]
  • SEE: Towards Semi-Supervised End-to-End Scene Text Recognition [AAAI 2018] [paper]

Scene Text Recognition

2014

2015

2016

2017

2018

  • Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition [AAAI 2018] [paper]
  • SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network [AAAI 2018] [paper]
  • Edit Probability for Scene Text Recognition [CVPR 2018] [paper]
  • ASTER: An Attentional Scene Text Recognizer with Flexible Rectification [TPAMI 2018] [paper]
  • Synthetically Supervised Feature Learning for Scene Text Recognition [ECCV 2018] [paper]
  • Scene Text Recognition from Two-Dimensional Perspective [AAAI 2019] [paper]
  • ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification [CVPR 2019] [paper]

2019

Script Identification

Other scene text related papers

2016

2019

  • Scene Text Synthesis for Efficient and Effective Deep Network Training [ArXiv] [paper]

Scene text survey

2018

  • Scene Text Detection and Recognition: The Deep Learning Era [ArXiv] [paper]

2019

  • Scene text detection and recognition with advances in deep learning: a survey [IJDAR 2019] [paper]
Owner
Jan Zdenek
Jan Zdenek
Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

isearch is an OSINT tool on Instagram. Offers a face recognition reverse image search on Instagram profile feed photos.

Malek salem 20 Oct 25, 2022
This is a real life mario project using python and mediapipe

real-life-mario This is a real life mario project using python and mediapipe How to run to run this just run - realMario.py file requirements This req

Programminghut 42 Dec 22, 2022
Face Recognizer using Opencv Python

Face Recognizer using Opencv Python The first step create your own dataset with file open-cv-create_dataset second step You can put the photo accordin

Han Izza 2 Nov 16, 2021
Text page dewarping using a "cubic sheet" model

page_dewarp Page dewarping and thresholding using a "cubic sheet" model - see full writeup at https://mzucker.github.io/2016/08/15/page-dewarping.html

Matt Zucker 1.2k Dec 29, 2022
Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

Este programa tem o intuito de ser um modificador de arquivos PDF. Os arquivos PDFs podem ser 3: PDFs verdadeiros - em que podem ser selecionados o ti

Daniel Soares Saldanha 2 Oct 11, 2021
Rubik's Cube in pygame with OpenGL

Rubik Rubik's Cube in pygame with OpenGL The script show on the screen a Rubik Cube buit with OpenGL. Then I have also implemented all the possible mo

Gabro 2 Apr 15, 2022
Image processing using OpenCv

Image processing using OpenCv Write a program that opens the webcam, and the user selects one of the following on the video: ✅ If the user presses the

M.Najafi 4 Feb 18, 2022
It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

OCR-Tool It is a image ocr tool made in Python using the Tesseract-OCR engine with the pytesseract package and has a GUI. This is my second ever pytho

Khant Htet Aung 4 Jul 11, 2022
Natural language detection

Detect the language of text. What’s so cool about franc? franc can support more languages(†) than any other library franc is packaged with support for

Titus 3.8k Jan 02, 2023
Simple SDF mesh generation in Python

Generate 3D meshes based on SDFs (signed distance functions) with a dirt simple Python API.

Michael Fogleman 1.1k Jan 08, 2023
A real-time dolly zoom camera effect

Dolly-Zoom I've always been amazed by the gradual perspective change of dolly zoom, and I have some experience in python and OpenCV, so I decided to c

Dylan Kai Lau 52 Dec 08, 2022
轻量级公式 OCR 小工具:一键识别各类公式图片,并转换为 LaTeX 格式

QC-Formula | 青尘公式 OCR 介绍 轻量级开源公式 OCR 小工具:一键识别公式图片,并转换为 LaTeX 格式。 支持从 电脑本地 导入公式图片;(后续版本将支持直接从网页导入图片) 公式图片支持 .png / .jpg / .bmp,大小为 4M 以内均可; 支持印刷体及手写体,前

青尘工作室 26 Jan 07, 2023
Textboxes implementation with Tensorflow (python)

tb_tensorflow A python implementation of TextBoxes Dependencies TensorFlow r1.0 OpenCV2 Code from Chaoyue Wang 03/09/2017 Update: 1.Debugging optimize

Jayne Shin (신재인) 20 May 31, 2019
Scene text recognition

AttentionOCR for Arbitrary-Shaped Scene Text Recognition Introduction This is the ranked No.1 tensorflow based scene text spotting algorithm on ICDAR2

777 Jan 09, 2023
A python program to block out your face

Readme This is a small program I threw together in about 6 hours to block out your face. It probably doesn't work very well, so be warned. By default,

1 Oct 17, 2021
Brief idea about our project is mentioned in project presentation file.

Brief idea about our project is mentioned in project presentation file. You just have to run attendance.py file in your suitable IDE but we prefer jupyter lab.

Dhruv ;-) 3 Mar 20, 2022
A Python script to capture images from multiple webcams at once and save them into your local machine

Capturing multiple images at once from Webcam Using OpenCV Capture multiple image by accessing the webcam of your system and save it to your machine.

Fazal ur Rehman 2 Apr 16, 2022
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

gosseract OCR Golang OCR package, by using Tesseract C++ library. OCR Server Do you just want OCR server, or see the working example of this package?

Hiromu OCHIAI 1.9k Dec 28, 2022
[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Grounded Situation Recognition with Transformers Paper | Model Checkpoint This is the official PyTorch implementation of Grounded Situation Recognitio

Junhyeong Cho 18 Jul 19, 2022
A Python wrapper for the tesseract-ocr API

tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

Fayez 1.7k Dec 31, 2022