A three-stage detection and recognition pipeline of complex meters in wild

This is the first released system towards detection and recognition of complex meters in wild. The system can be divided into three moduels. Fisrtly, a yolo-based detector is applied to get pure meter region. Secondly, a spatial transformer module is eatablished to rectify the position of meter. Lastly, an end-to-end network is to read meter values, which is implemented by pointer/dail predcition and key number learning.

Visulization results

Left row is the original image, middle row is the process of meter rectification, right row is the result of meter value reading.

ToDo List

Installation

Requirements:

Python3 (Python3.7 is recommended)
PyTorch >= 1.0
torchvision from master
numpy
skimage
OpenCV==3.0.x
CUDA >= 9.0 (10.0 is recommended)

Models

Download Trained model

Please put distro_net.pt into meter_distro/weight.
put textgraph_vgg_450.pth into model/meter_data.

Demo

You can run a demo script for a single image inference by two steps.

python get_meter_area.py. and the detected meter will be stored in scene_image_data/deteced_meter

python predict.py to get distored meter and final result.

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Related tags

Overview

A three-stage detection and recognition pipeline of complex meters in wild

Visulization results

ToDo List

Installation

Requirements:

Models

Demo

Owner

Yan Shu

Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Video Autoencoder: self-supervised disentanglement of 3D structure and motion

Deep Q Learning with OpenAI Gym and Pokemon Showdown

Classification of EEG data using Deep Learning

ML-Ensemble – high performance ensemble learning

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation

Rax is a Learning-to-Rank library written in JAX

An evaluation toolkit for voice conversion models.

Model Zoo for MindSpore

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition (AGRA, ACM 2020, Oral)

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

PyTorch code for SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised DA

CMT: Convolutional Neural Networks Meet Vision Transformers

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.