Document Layout Analysis Projects

Last update: Dec 08, 2022

Related tags

Overview

Layout_Analysis

Introduction

This is an implementation of RLSA and X-Y Cut with OpenCV

Dependencies

OpenCV 3.0+

How to use

Compile with g++ :

g++ -std=c++11 RLSA.cpp -o RLSA `pkg-config --libs --cflags opencv` -ldl

g++ -std=c++11 X_YCut.cpp -o X_YCut `pkg-config --libs --cflags opencv` -ldl

Reference:

Wong K Y, Casey R G, Wahl F M. Document analysis system[J]. Ibm Journal of Research & Development, 2011, 26(26):647-656.
Ha J, Haralick R M, Phillips I T. Recursive X-Y cut using bounding boxes of connected components[C]// International Conference on Document Analysis and Recognition. IEEE, 1995:952-955 vol.2.

Owner

GitHub Repository

Here use convulation with sobel filter from scratch in opencv python .

2 Nov 11, 2021

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

cosc428-structor I had an open-ended Computer Vision assignment to complete, and an out-of-copyright book that I wanted to turn into an ebook. Convent

45 Dec 06, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

GDB python tool to pretty print and debug c++ xtensor containers

gdb_xt2np GDB python tool to pretty print, examine, and debug c++ Xtensor containers. Xtensor is a c++ library for scientific computing using multidim

4 Oct 29, 2021

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

OpenCV-CameraCalibration-Example FishEyeCameraCalibration.mp4 OpenCVを用いたカメラキャリブレーションのサンプルです 2021/06/21時点でPython実装のある以下3種類について用意しています。通常カメラ向け魚眼レンズ向け(

34 Nov 17, 2022

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Tips: A more recent scene text detection algorithm: PixelLink, has been implemented here: https://github.com/ZJULearning/pixel_link Contents: Introduc

484 Dec 07, 2022

Detect textlines in document images

Textline Detection Detect textlines in document images Introduction This tool performs border, region and textline detection from document image data

70 Jun 30, 2022

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Handwritten Line Text Recognition using Deep Learning with Tensorflow Description Use Convolutional Recurrent Neural Network to recognize the Handwrit

224 Jan 07, 2023

A selectional auto-encoder approach for document image binarization

The code of this repository was used for the following publication. If you find this code useful please cite our paper: @article{Gallego2019, title =

89 Nov 18, 2022

Links to awesome OCR projects

Awesome OCR This list contains links to great software tools and libraries and literature related to Optical Character Recognition (OCR). Contribution

2.2k Jan 02, 2023

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

InceptText-Tensorflow An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Orien

115 Dec 12, 2022

Document Layout Analysis Projects

Related tags

Overview

Layout_Analysis

Introduction

Dependencies

How to use

Reference:

Owner

Here use convulation with sobel filter from scratch in opencv python .

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

GDB python tool to pretty print and debug c++ xtensor containers

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Detect textlines in document images

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

A selectional auto-encoder approach for document image binarization

Links to awesome OCR projects

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

FastOCR is a desktop application for OCR API.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

Pre-Recognize Library - library with algorithms for improving OCR quality.

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Image Recognition Model Generator

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.