The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Last update: Nov 21, 2022

Related tags

Computer Vision masktextspotter.caffe2

Overview

Mask TextSpotter

A Pytorch implementation of Mask TextSpotter along with its extension can be find here

Introduction

This is the official implementation of Mask TextSpotter.

Mask TextSpotter is an End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{LyuLYWB18,
  author    = {Pengyuan Lyu and
               Minghui Liao and
               Cong Yao and
               Wenhao Wu and
               Xiang Bai},
  title     = {Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes},
  booktitle = {Proc. ECCV},
  pages     = {71--88},
  year      = {2018}
}

Requirements
Installation
Models
Datasets
Test
Train

Requirements

NVIDIA GPU, Linux, Python2
Caffe2, various standard Python packages

Installation

Caffe2

To install Caffe2 with CUDA support, follow the installation instructions from the Caffe2 website. If you already have Caffe2 installed, make sure to update your Caffe2 to a version that includes the Detectron module.

Please ensure that your Caffe2 installation was successful before proceeding by running the following commands and checking their output as directed in the comments.

# To check if Caffe2 build was successful
python2 -c 'from caffe2.python import core' 2>/dev/null && echo "Success" || echo "Failure"

# To check if Caffe2 GPU build was successful
# This must print a number > 0 in order to use Detectron
python2 -c 'from caffe2.python import workspace; print(workspace.NumCudaDevices())'

If the caffe2 Python package is not found, you likely need to adjust your PYTHONPATH environment variable to include its location (/path/to/caffe2/build, where build is the Caffe2 CMake build directory).

Install Python dependencies:

pip install numpy pyyaml matplotlib opencv-python>=3.0 setuptools Cython mock

Set up Python modules:

cd $ROOT_DIR/lib && make

Note: Caffe2 is difficult to install sometimes.

Models

Download the model and place it as models/model_iter79999.pkl Our trained model: Google Drive; BaiduYun (key of BaiduYun: gnpc)

Datasets

Download the ICDAR2013(Google Drive, BaiduYun) and ICDAR2015(Google Drive, BaiduYun) as examples. Datasets should be placed in lib/datasets/data/ as below

synth
icdar2013
icdar2015
scut-eng-char
totaltext

If you do not train the model, you can just download the ICDAR2013 or ICDAR2015 datasets for testing.

Test

python tools/test_net.py --cfg configs/text/mask_textspotter.yaml

You can modify the model path or the test dataset in configs/text/mask_textspotter.yaml.

Train

You should format all the datasets you used for training as above. Then modify configs/text/mask_textspotter.yaml to fit the gpus, model path, and datasets.

python tools/train_net.py --cfg configs/text/mask_textspotter.yaml

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Related tags

Overview

Mask TextSpotter

A Pytorch implementation of Mask TextSpotter along with its extension can be find here

Introduction

Citing the paper

Contents

Requirements

Installation

Caffe2

Models

Datasets

Test

Train

Owner

Pengyuan Lyu

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

Motion Detection Squid Game with OpenCV Python

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Automatically resolve RidderMaster based on TensorFlow & OpenCV

Using computer vision method to recognize and calcutate the features of the architecture.

Sort By Face

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

This can be use to convert text in a file to handwritten text.

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

A tool to enhance your old/damaged pictures built using python & opencv.