TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Last update: Dec 12, 2022

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

The code and trained models of:

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection, TIP 2019 [Paper]

Citation

Please cite the related works in your publications if it helps your research:


@article{xu2018textfield,
  title={TextField: Learning A Deep Direction Field for Irregular Scene Text Detection},
  author={Xu, Yongchao and Wang, Yukang and Zhou, Wei and Wang, Yongpan and Yang, Zhibo and Bai, Xiang},
  journal={arXiv preprint arXiv:1812.01393},
  year={2018}
}

Prerequisite

Caffe and SynthText pretrained model [Link]
Datasets: [Total-Text], [ICDAR2015]
OpenCV 3.4.3
MATLAB

Usage

1. Install Caffe

cp Makefile.config.example Makefile.config
# adjust Makefile.config (for example, enable python layer)
make all -j16
# make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make pycaffe

Please refer to Caffe Installation to ensure other dependencies.

2. Data and model preparation

# download datasets and pretrained model then
mkdir data && mv [your_dataset_folder] data/
mkdir models && mv [your_pretrained_model] models/

3. Training scripts

# an example on Total-Text dataset
cd examples/TextField/
python train.py --gpu [your_gpu_id] --dataset total --initmodel ../../models/synth_iter_800000.caffemodel

4. Evaluation scripts

# an example on Total-Text dataset
cd evaluation/total/
./eval.sh

Results and Trained Models

Total-Text

Recall	Precision	F-measure	Link
0.816	0.824	0.820	[Google drive]

*lambda=0.50 for post-processing

ICDAR2015

Recall	Precision	F-measure	Link
0.811	0.846	0.828	[Google drive]

*lambda=0.75 for post-processing

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Related tags

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

Citation

Prerequisite

Usage

1. Install Caffe

2. Data and model preparation

3. Training scripts

4. Evaluation scripts

Results and Trained Models

Total-Text

ICDAR2015

Owner

Yukang Wang

Fatigue Driving Detection Based on Dlib

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Train custom VR face tracking parameters

POT : Python Optimal Transport

This repository contains codes on how to handle mouse event using OpenCV

Table Extraction Tool

Automatically download multiple papers by keywords in CVPR

📷 Face Recognition using Haar-Cascade Classifier, OpenCV, and Python

OCR, Object Detection, Number Plate, Real Time

An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

a Deep Learning Framework for Text

FOTS Pytorch Implementation

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

chineseocr/table_line 表格线检测模型pytorch版

Deep learning based page layout analysis

Indonesian ID Card OCR using tesseract OCR

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

OCR-D-compliant page segmentation

A machine learning software for extracting information from scholarly documents

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Related tags

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

Citation

Prerequisite

Usage

1. Install Caffe

2. Data and model preparation

3. Training scripts

4. Evaluation scripts

Results and Trained Models

Total-Text

ICDAR2015

Owner

Yukang Wang

Fatigue Driving Detection Based on Dlib

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Train custom VR face tracking parameters

POT : Python Optimal Transport

This repository contains codes on how to handle mouse event using OpenCV

Table Extraction Tool

Automatically download multiple papers by keywords in CVPR

📷 Face Recognition using Haar-Cascade Classifier, OpenCV, and Python

OCR, Object Detection, Number Plate, Real Time

An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

a Deep Learning Framework for Text

FOTS Pytorch Implementation

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

chineseocr/table_line 表格线检测模型pytorch版

Deep learning based page layout analysis

Indonesian ID Card OCR using tesseract OCR

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

OCR-D-compliant page segmentation

A machine learning software for extracting information from scholarly documents

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約