Contextual Attention Localization for Offline Handwritten Text Recognition

Last update: Feb 17, 2022

Related tags

Overview

CALText

This repository contains the source code for CALText model introduced in "CALText: Contextual Attention Localization for Offline Handwritten Text" paper. The details of this model are presented in: (Add paper link)

Samples of the datasets that were used to train and test the model can be found at: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

The code in this model was based on the work of:

https://github.com/JianshuZhang/WAP.

https://github.com/wwjwhen/Watch-Attend-and-Parse-tensorflow-version.

Requirements

Python 3 Tensorflow v1.6

Usage

Upload data files into your Colab account, create pickle files (train, valid, and test images and labels) from the dataset. You can place the pickle dataset files at any folder of your preference but change the path settings in the code where this data is being loaded.

Run "makepickle.ipynb" to create pickle files for train and test data. Further distribute the train pickle file into train and valid pickle files by using last 907 images and labels of train as valid.

For training, set mode="train", and run "CALText.ipynb".

For testing, set mode="test", and run "CALText.ipynb".

For Contextual Attention, set alpha_reg=0, while training and testing.

For Contextual Attention Localization, set alpha_reg=1, while training and testing.

Run on Python Compiler

To run the code on python compiler, copy the code and make file as "makepickle.py" and "CALText.py". Use following commands to run code files.

python makepickle.py

python CALText.py

Run on Google Colab

Open "makepickle.ipynb" and "CALText.ipynb" notebook in Google Colab Notebook, and run.

Run "%tensorflow_version 1.x" command at colab notebook before running of "CALText.ipynb".

Change runtime to GPU or TPU for better performance.

Add these lines in notebook for accessing data from google derive:

from google.colab import drive

drive.mount("/gdrive", force_remount=True)

References

PUCIT Offline Handwritten Urdu Lines (PUCIT-OHUL) Dataset: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

Previous Work:

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/index.html

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/ICFHR2020_manuscript.pdf

Contextual Attention Localization for Offline Handwritten Text Recognition

Related tags

Overview

CALText

Requirements

Usage

Run on Python Compiler

Run on Google Colab

References

Owner

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

AFLNet: A Greybox Fuzzer for Network Protocols

BMN: Boundary-Matching Network

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

📚 A collection of Jupyter notebooks for learning and experimenting with OpenVINO 👓

A Joint Video and Image Encoder for End-to-End Retrieval

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

iNAS: Integral NAS for Device-Aware Salient Object Detection

Using this you can control your PC/Laptop volume by Hand Gestures (pinch-in, pinch-out) created with Python.

Systematic generalisation with group invariant predictions

Composing methods for ML training efficiency

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

Model Zoo for MindSpore

An Api for Emotion recognition.

Codebase for INVASE: Instance-wise Variable Selection - 2019 ICLR

A curated list of neural network pruning resources.

Contextual Attention Localization for Offline Handwritten Text Recognition

Related tags

Overview

CALText

Requirements

Usage

Run on Python Compiler

Run on Google Colab

References

Owner

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

AFLNet: A Greybox Fuzzer for Network Protocols

BMN: Boundary-Matching Network

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

📚 A collection of Jupyter notebooks for learning and experimenting with OpenVINO 👓

A Joint Video and Image Encoder for End-to-End Retrieval

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

iNAS: Integral NAS for Device-Aware Salient Object Detection

Using this you can control your PC/Laptop volume by Hand Gestures (pinch-in, pinch-out) created with Python.

Systematic generalisation with group invariant predictions

Composing methods for ML training efficiency

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning

Model Zoo for MindSpore

An Api for Emotion recognition.

Codebase for INVASE: Instance-wise Variable Selection - 2019 ICLR

A curated list of neural network pruning resources.

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,