a micro OCR network with 0.07mb params.

Last update: Aug 06, 2022

Related tags

Overview

MicroOCR

a micro OCR network with 0.07mb params.

    Layer (type)               Output Shape         Param #

        Conv2d-1            [-1, 64, 8, 32]           3,136
   BatchNorm2d-2            [-1, 64, 8, 32]             128
          GELU-3            [-1, 64, 8, 32]               0
     ConvBNACT-4            [-1, 64, 8, 32]               0
        Conv2d-5            [-1, 64, 8, 32]             640
   BatchNorm2d-6            [-1, 64, 8, 32]             128
          GELU-7            [-1, 64, 8, 32]               0
     ConvBNACT-8            [-1, 64, 8, 32]               0
        Conv2d-9            [-1, 64, 8, 32]           4,160
  BatchNorm2d-10            [-1, 64, 8, 32]             128
         GELU-11            [-1, 64, 8, 32]               0
    ConvBNACT-12            [-1, 64, 8, 32]               0
   MicroBlock-13            [-1, 64, 8, 32]               0
       Conv2d-14            [-1, 64, 8, 32]             640
  BatchNorm2d-15            [-1, 64, 8, 32]             128
         GELU-16            [-1, 64, 8, 32]               0
    ConvBNACT-17            [-1, 64, 8, 32]               0
       Conv2d-18            [-1, 64, 8, 32]           4,160
  BatchNorm2d-19            [-1, 64, 8, 32]             128
         GELU-20            [-1, 64, 8, 32]               0
    ConvBNACT-21            [-1, 64, 8, 32]               0
   MicroBlock-22            [-1, 64, 8, 32]               0
      Flatten-23              [-1, 64, 256]               0
AdaptiveAvgPool1d-24           [-1, 64, 30]               0
       Linear-25               [-1, 30, 60]           3,900

Total params: 17,276
Trainable params: 17,276
Non-trainable params: 0
Input size (MB): 0.05
Forward/backward pass size (MB): 2.90
Params size (MB): 0.07
Estimated Total Size (MB): 3.02

Script Description

MicroOCR
├── README.md                                   # Descriptions about MicroNet
├── collatefn.py                                # collatefn
├── ctc_label_converter.py                      # accuracy metric for MicroNet
├── dataset.py                                  # Data preprocessing for training and evaluation
├── demo.py                                     # demo
├── gen_image.py                                # generate image for train and eval
├── infer_tool.py                               # inference tool
├── keys.py                                     # character
├── loss.py                                     # Ctcloss definition
├── metric.py                                   # accuracy metric for MicroNet
├── model.py                                    # MicroNet
├── train.py                                    # train the model

Generate data for train and eval

python gen_image.py

Training

python train.py

Inference

python demo.py

a micro OCR network with 0.07mb params.

Related tags

Overview

MicroOCR

Script Description

Generate data for train and eval

Training

Inference

Owner

william

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Balabobapy - Using artificial intelligence algorithms to continue the text

Fun program to overlay a mask to yourself using a webcam

A webcam-based 3x3x3 rubik's cube solver written in Python 3 and OpenCV.

Handwritten Character Recognition using CNN

LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

Detect handwritten words in a text-line (classic image processing method).

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

A synthetic data generator for text recognition

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

document image degradation

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

pulse2percept: A Python-based simulation framework for bionic vision

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

Brief idea about our project is mentioned in project presentation file.

color detection using python