Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

A real-time dolly zoom camera effect

Table recognition inside douments using neural networks

A set of workflows for corpus building through OCR, post-correction and normalisation

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

Document Layout Analysis Projects

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Generate a list of papers with publicly available source code in the daily arxiv

Histogram specification using openCV in python .

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Slice a single image into multiple pieces and create a dataset from them

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

Color Picker and Color Detection tool for METR4202

7th place solution

Corner-based Region Proposal Network

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Lightning Fast Language Prediction 🚀

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

A Python wrapper for the tesseract-ocr API

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

A real-time dolly zoom camera effect

Table recognition inside douments using neural networks

A set of workflows for corpus building through OCR, post-correction and normalisation

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

Document Layout Analysis Projects

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Generate a list of papers with publicly available source code in the daily arxiv

Histogram specification using openCV in python .

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Slice a single image into multiple pieces and create a dataset from them

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

Color Picker and Color Detection tool for METR4202

7th place solution

Corner-based Region Proposal Network

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Lightning Fast Language Prediction 🚀

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

A Python wrapper for the tesseract-ocr API

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約