A Number Recognition algorithm

Last update: Nov 12, 2021

Related tags

Overview

Paddle-VisualAttention

Results_Compared

Methods	Steps	GPU	Batch Size	Learning Rate	Patience	Decay Step	Decay Rate	Training Speed (FPS)	Accuracy
PaddlePaddle_SVHNClassifier	54000	GTX 1080 Ti	1024	0.01	100	625	0.9	~1700	95.65%
Pytorch_SVHNClassifier	54000	GTX 1080 Ti	512	0.16	100	625	0.9	~1700	95.65%

Introduction

The main idea of this exercise is to study the evolvement of the state of the art and main work along topic of visual attention model. There are two datasets that are studied: augmented MNIST and SVHN. The former dataset focused on canonical problem — handwritten digits recognition, but with cluttering and translation, the latter focus on real world problem — street view house number (SVHN) transcription. In this exercise, the following papers are studied in the way of developing a good intuition to choose a proper model to tackle each of the above challenges.

For more detail, please refer to this blog

Recommended environment

Python 3.6+
paddlepaddle-gpu 2.0.2
nccl 2.0+
editdistance
visdom
h5py
protobuf
lmdb

Install

Install env

Install paddle following the official tutorial.

pip install visdom
pip install h5py
pip install protobuf
pip install lmdb

Dataset

Download SVHN Dataset format 1

Extract to data folder, now your folder structure should be like below:

SVHNClassifier
    - data
        - extra
            - 1.png 
            - 2.png
            - ...
            - digitStruct.mat
        - test
            - 1.png 
            - 2.png
            - ...
            - digitStruct.mat
        - train
            - 1.png 
            - 2.png
            - ...
            - digitStruct.mat

Usage

(Optional) Take a glance at original images with bounding boxes
```
Open `draw_bbox.ipynb` in Jupyter
```

Convert to LMDB format

$ python convert_to_lmdb.py --data_dir ./data

(Optional) Test for reading LMDBs

Open `read_lmdb_sample.ipynb` in Jupyter

Train

$ python train.py --data_dir ./data --logdir ./logs

Retrain if you need

$ python train.py --data_dir ./data --logdir ./logs_retrain --restore_checkpoint ./logs/model-100.pth

Evaluate

$ python eval.py --data_dir ./data ./logs/model-100.pth

Visualize

$ python -m visdom.server
$ python visualize.py --logdir ./logs

Infer

$ python infer.py --checkpoint=./logs/model-100.pth ./images/test1.png

Clean

$ rm -rf ./logs
or
$ rm -rf ./logs_retrain

A Number Recognition algorithm

Related tags

Overview

Paddle-VisualAttention

Results_Compared

Introduction

Recommended environment

Install

Install env

Dataset

Usage

Owner

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

A universal memory dumper using Frida

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

Learning with Subset Stacking

GeDML is an easy-to-use generalized deep metric learning library

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

TensorFlow implementation of "Attention is all you need (Transformer)"

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

the code for paper "Energy-Based Open-World Uncertainty Modeling for Confidence Calibration"

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Make your AirPlay devices as TTS speakers

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

Embeds a story into a music playlist by sorting the playlist so that the order of the music follows a narrative arc.

Car Price Predictor App used to predict the price of the car based on certain input parameters created using python's scikit-learn, fastapi, numpy and joblib packages.

This Deep Learning Model Predicts that from which disease you are suffering.

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Large-Scale Unsupervised Object Discovery

This code implements constituency parse tree aggregation

DimReductionClustering - Dimensionality Reduction + Clustering + Unsupervised Score Metrics

A Number Recognition algorithm

Related tags

Overview

Paddle-VisualAttention

Results_Compared

Introduction

Recommended environment

Install

Install env

Dataset

Usage

Owner

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

A universal memory dumper using Frida

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

Learning with Subset Stacking

GeDML is an easy-to-use generalized deep metric learning library

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

TensorFlow implementation of "Attention is all you need (Transformer)"

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

the code for paper "Energy-Based Open-World Uncertainty Modeling for Confidence Calibration"

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Make your AirPlay devices as TTS speakers

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

Embeds a story into a music playlist by sorting the playlist so that the order of the music follows a narrative arc.

Car Price Predictor App used to predict the price of the car based on certain input parameters created using python's scikit-learn, fastapi, numpy and joblib packages.

This Deep Learning Model Predicts that from which disease you are suffering.

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Large-Scale Unsupervised Object Discovery

This code implements constituency parse tree aggregation

DimReductionClustering - Dimensionality Reduction + Clustering + Unsupervised Score Metrics

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch