End-to-end pipeline for real-time scene text detection and recognition.

Last update: Aug 04, 2022

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

End-to-end pipeline for real-time scene text detection and recognition.

The detection model use the EAST, the recognition model use the crnn.

Download trained EAST model from https://drive.google.com/file/d/0B3APw5BZJ67ETHNPaU9xUkVoV0U/view and put it in EAST/result.

Download trained crnn model from https://www.dropbox.com/s/dboqjk20qjkpta3/crnn.pth?dl=0 and put it in crnn/samples.

Screenshot

Video

Here is the full video of the real time system: https://youtu.be/1zvteqaU4h4

Owner

Fangneng Zhan

Computer Vision, Deep Learning.

GitHub Repository

Page to PAGE Layout Analysis Tool

P2PaLA Page to PAGE Layout Analysis (P2PaLA) is a toolkit for Document Layout Analysis based on Neural Networks. 💥 Try our new DEMO for online baseli

180 Nov 24, 2022

Scene text recognition

AttentionOCR for Arbitrary-Shaped Scene Text Recognition Introduction This is the ranked No.1 tensorflow based scene text spotting algorithm on ICDAR2

777 Jan 09, 2023

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

Hand Gesture Volume Controller ✋ Hand recognition 👆 Finger recognition 🔊 you can decrease and increase volume Demo Code Firstly I have created a Mod

19 Nov 17, 2022

list all open dataset about ocr.

ocr-open-dataset list all open dataset about ocr. printed dataset year Born-Digital Images (Web and Email) 2011-2015 COCO-Text 2017 Text Extraction fr

95 Nov 24, 2022

Deskewing images with slanted content

skew_correction De-skewing images with slanted content by finding the deviation using Canny Edge Detection. To Run: In python 3.6, from deskew import

13 Aug 27, 2022

Vietnamese Language Detection and Recognition

Table of Content Introduction (Khôi viết) Dataset (đổi link thui thành 3k5 ảnh mình) Getting Started (An Viết) Requirements Usage Example Training & E

6 May 27, 2022

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Arabic OCR OCR system for Arabic language that converts images of typed text to machine-encoded text. The system currently supports only letters (29 l

144 Jan 05, 2023

Assignment work with webcam

work with webcam : Press key 1 to use emojy on your face Press key 2 to use lip and eye on your face Press key 3 to checkered your face Press key 4 to

2 May 31, 2022

Creating of virtual elements of the graphical interface using opencv and mediapipe.

Virtual GUI Creating of virtual elements of the graphical interface using opencv and mediapipe. Element GUI Output Description Button By default the b

4 Jun 16, 2022

Corner-based Region Proposal Network

Corner-based Region Proposal Network CRPN is a two-stage detection framework for multi-oriented scene text. It employs corners to estimate the possibl

140 Nov 04, 2022

Automatically resolve RidderMaster based on TensorFlow & OpenCV

AutoRiddleMaster Automatically resolve RidderMaster based on TensorFlow & OpenCV 基于 TensorFlow 和 OpenCV 实现的全自动化解御迷士小马谜题 Demo How to use Deploy the ser

5 Nov 19, 2021

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

LayoutAnalysisEvaluator Layout Analysis Evaluator for: ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records ICD

17 Dec 08, 2022

A Python wrapper for Google Tesseract

Python Tesseract Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded i

4.6k Jan 06, 2023

Camelot: PDF Table Extraction for Humans

Camelot: PDF Table Extraction for Humans Camelot is a Python library that makes it easy for anyone to extract tables from PDF files! Note: You can als

3.3k Dec 31, 2022

Detect and fix skew in images containing text

Alyn Skew detection and correction in images containing text Image with skew Image after deskew Install and use via pip! Recommended way(using virtual

230 Dec 21, 2022

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

The project is based on older versions of tesseract and other tools, and is now superseded by another project which allows for more granular control o

32 Jul 24, 2022

Virtual Zoom Gesture using OpenCV

Virtual_Zoom_Gesture I have created a virtual zoom gesture where we can Zoom in and Zoom out any image and even we can move that image anywhere on the

2 Dec 26, 2021

computer vision, image processing and machine learning on the web browser or node.

Image processing and Machine learning labs computer vision, image processing and machine learning on the web browser or node note Fast Fourier Trans

487 Nov 11, 2022

Rubik's Cube in pygame with OpenGL

Rubik Rubik's Cube in pygame with OpenGL The script show on the screen a Rubik Cube buit with OpenGL. Then I have also implemented all the possible mo

2 Apr 15, 2022

Basic functions manipulating images using the OpenCV library

OpenCV Basic functions manipulating images using the OpenCV library. Reading Ima

3 Feb 17, 2022

End-to-end pipeline for real-time scene text detection and recognition.

Related tags

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

Screenshot

Video

Owner

Fangneng Zhan

Page to PAGE Layout Analysis Tool

Scene text recognition

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

list all open dataset about ocr.

Deskewing images with slanted content

Vietnamese Language Detection and Recognition

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Assignment work with webcam

Creating of virtual elements of the graphical interface using opencv and mediapipe.

Corner-based Region Proposal Network

Automatically resolve RidderMaster based on TensorFlow & OpenCV

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

A Python wrapper for Google Tesseract

Camelot: PDF Table Extraction for Humans

Detect and fix skew in images containing text

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Virtual Zoom Gesture using OpenCV

computer vision, image processing and machine learning on the web browser or node.

Rubik's Cube in pygame with OpenGL

Basic functions manipulating images using the OpenCV library