Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Last update: Dec 30, 2022

Related tags

Computer Vision TableNet

Overview

TableNet

Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images.

Paper

Overview

Paper: TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

TableNet is a modern deep learning architecture that was proposed by a team from TCS Research year in the year 2019. The main motivation was to extract information from scanned tables through mobile phones or cameras.

They proposed a solution that includes accurate detection of the tabular region within an image and subsequently detecting and extracting information from the rows and columns of the detected table.

Architecture: The architecture is based out of Long et al., an encoder-decoder model for semantic segmentation. The same encoder/decoder network is used as the FCN architecture for table extraction. The images are preprocessed and modified using the Tesseract OCR.

Source: Nanonets

How to run

pip install -r requirements.txt

Download the Marmot Dataset from the link given in readme.
Run data_preprocess/generate_mask.py to generate Table and Column Mask of corresponding images.
Follow the TableNet.ipynb notebook to train and test the model.

Challenges

Require a very decent System with a good GPU for accurate result on High pixel images.

Dataset

Download the dataset provided in paper : Marmot Dataset.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Related tags

Overview

TableNet

Overview

How to run

Challenges

Dataset

Owner

Jainam Shah

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

Ocular is a state-of-the-art historical OCR system.

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Automatically remove the mosaics in images and videos, or add mosaics to them.

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Corner-based Region Proposal Network

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

Handwritten Text Recognition (HTR) using TensorFlow 2.x

Python Computer Vision Aim Bot for Roblox's Phantom Forces

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Framework for the Complete Gaze Tracking Pipeline

Smart computer vision application

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

SemTorch

Train custom VR face tracking parameters

Document Image Dewarping

Slice a single image into multiple pieces and create a dataset from them

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Line based ATR Engine based on OCRopy

Creating of virtual elements of the graphical interface using opencv and mediapipe.