Detect handwritten words in a text-line (classic image processing method).

Last update: Jan 03, 2023

Overview

Word segmentation

Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is from 1999, the method still achieves good results, is fast, and is easy to implement. The algorithm takes an image of a line as input and outputs the segmented words.

Run demo

Go to the src/ directory and run the script python main.py. The images from the data/ directory (taken from IAM dataset) are segmented into words and the results are saved to the out/ directory.

Documentation

An anisotropic filter kernel is applied to the input image to create blobs corresponding to words. After thresholding the blob-image, connected components are extracted which correspond to words.

Parameters

Most of the parameters of the function wordSegmentation deal with the shape of the filter kernel:

img: grayscale uint8 image of the text-line to be segmented.
kernelSize: size of filter kernel, must be an odd integer.
sigma: standard deviation of Gaussian function used for filter kernel.
theta: approximated width/height ratio of words, filter function is distorted by this factor.
minArea: ignore word candidates smaller than specified area.

The function prepareImg can be used to convert the input image to grayscale and to resize it to a fixed height:

img: input image.
height: image will be resized to fit specified height.

Algorithm

The illustration below shows how the algorithm works:

top left: input image.
top right: filter kernel is applied.
bottom left: blob image after thresholding.
bottom right: bounding boxes around words in original image.

Results

This algorithm gives good results on datasets with large inter-word-distances and small intra-word-distances like IAM. However, for historical datasets like Bentham or Ratsprotokolle results are not very good and more complex approaches should be used instead (e.g., a neural network based approach as implemented in the WordDetectorNN repository).

Detect handwritten words in a text-line (classic image processing method).

Related tags

Overview

Word segmentation

Run demo

Documentation

Parameters

Algorithm

Results

Owner

Harald Scheidl

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Apply different text recognition services to images of handwritten documents.

Fatigue Driving Detection Based on Dlib

OpenCV-Erlang/Elixir bindings

A selectional auto-encoder approach for document image binarization

Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

document image degradation

Image augmentation for machine learning experiments.

A simple component to display annotated text in Streamlit apps.

This is a GUI program which consist of 4 OpenCV projects

Provides OCR (Optical Character Recognition) services through web applications

Volume Control using OpenCV

One Metrics Library to Rule Them All!

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Geometric Augmentation for Text Image

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

Connect Aseprite to Blender for painting pixelart textures in real time