Textboxes : Image Text Detection Model : python package (tensorflow)

Last update: Dec 15, 2022

Overview

shinTB

Abstract

A python package for use Textboxes : Image Text Detection Model

implemented by tensorflow, cv2

Textboxes Paper Review in Korean (My Blog) : shinjayne.github.io/textboxes

shintb : useable textboxes python package (Source codes are in here)

svt1 : Street view Text dataset. can use with shintb.svt_data_loader.SVTDataLoader when training Textboxes model

config.py : (NECESSARY) configuration of model building and training with shinTB

main.py : simple example useage of shinTB package

Dependancies

python Version: 3.5.3
numpy Version: 1.13.0
tensorflow Version: 1.2.1
cv2

How to use

Clone this repository to your local.
You will use shintb python package and config.py for building and training your own Textboxes model.
svt1 gives us training / test data.
Open new python file.
Import config.config and shintb.

from config import config
from shintb import graph_drawer, default_box_control, svt_data_loader, runner

Initialize GraphDrawer,DefaultBoxControl,SVTDataLoader instance.

graphdrawer = graph_drawer.GraphDrawer(config)

dataloader = svt_data_loader.SVTDataLoader('./svt1/train.xml', './svt1/test.xml')

dbcontrol = default_box_control.DefaultBoxControl(config, graphdrawer)

GraphDrawer instance contains a tensorflow graph of Textboxes.
DefaultboxControl instance contains methods and attributes which is related to default box.
SVTDataLoader instance loads data from svt1.
Initialize Runner instance.

runner = runner.Runner(config, graphdrawer, dataloader, dbcontrol)

Runner uses GraphDrawer,DefaultBoxControl,SVTDataLoader instance.
If you want to train your Textboxes model, use Runner.train(). Every 1000 step, shintb will save ckpt file in the directory you set in config.py.

runner.train()

If you want to validate/test your model, use Runner.test()

runner.test()

After training, if you want to detect texts from one image use Runner.image().

runner.image(<your_image_directory>)

Textboxes : Image Text Detection Model : python package (tensorflow)

Related tags

Overview

shinTB

Abstract

Dependancies

How to use

Owner

Jayne Shin (신재인)

🖺 OCR using tensorflow with attention

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

computer vision, image processing and machine learning on the web browser or node.

Using python libraries to track hands

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"

Handwritten Text Recognition (HTR) using TensorFlow 2.x

a micro OCR network with 0.07mb params.

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Framework for the Complete Gaze Tracking Pipeline

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

FastOCR is a desktop application for OCR API.

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Pure Javascript OCR for more than 100 Languages 📖🎉🖥