An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Overview

Optical_Character_Recognition

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

As an IOT/Computer Visions Intern at the Graduate Rotational Internship program (GRIP) by The Sparks Foundation (TSF), the first task is to implement a character detector which extracts printed or handwritten text from an image/video.

For more learning purposes, I've utilized this feature in cleaning/extracting valuable information from Blood Pressure Reports as images.

download

Dependencies

  • tesseract-ocr package
  • pytesseract 0.3.8
  • Open-cv
  • Pandas

    Using the pytesseract open source library to detect text on image/video.

    Open-cv for Image Processing

    Pandas for data manipulation

  • Owner
    Ramsis Hammadi
    Ramsis Hammadi
    GDB python tool to pretty print and debug c++ xtensor containers

    gdb_xt2np GDB python tool to pretty print, examine, and debug c++ Xtensor containers. Xtensor is a c++ library for scientific computing using multidim

    Christopher Burke 4 Oct 29, 2021
    Let's explore how we can extract text from forms

    Form Segmentation Let's explore how we can extract text from any forms / scanned pages. Objectives The goal is to find an algorithm that can extract t

    Philip Doxakis 42 Jun 05, 2022
    LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

    LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

    Murtaza Hassan 815 Dec 29, 2022
    A set of workflows for corpus building through OCR, post-correction and normalisation

    PICCL: Philosophical Integrator of Computational and Corpus Libraries PICCL offers a workflow for corpus building and builds on a variety of tools. Th

    Language Machines 41 Dec 27, 2022
    Characterizing possible failure modes in physics-informed neural networks.

    Characterizing possible failure modes in physics-informed neural networks This repository contains the PyTorch source code for the experiments in the

    Aditi Krishnapriyan 55 Jan 02, 2023
    Implementation of EAST scene text detector in Keras

    EAST: An Efficient and Accurate Scene Text Detector This is a Keras implementation of EAST based on a Tensorflow implementation made by argman. The or

    Jan Zdenek 208 Nov 15, 2022
    An expandable and scalable OCR pipeline

    Overview Nidaba is the central controller for the entire OGL OCR pipeline. It oversees and automates the process of converting raw images into citable

    81 Jan 04, 2023
    轻量级公式 OCR 小工具:一键识别各类公式图片,并转换为 LaTeX 格式

    QC-Formula | 青尘公式 OCR 介绍 轻量级开源公式 OCR 小工具:一键识别公式图片,并转换为 LaTeX 格式。 支持从 电脑本地 导入公式图片;(后续版本将支持直接从网页导入图片) 公式图片支持 .png / .jpg / .bmp,大小为 4M 以内均可; 支持印刷体及手写体,前

    青尘工作室 26 Jan 07, 2023
    Text recognition (optical character recognition) with deep learning methods.

    What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis | paper | training and evaluation data | failure cases and cle

    Clova AI Research 3.2k Jan 04, 2023
    learn how to use Gesture Control to change the volume of a computer

    Volume-Control-using-gesture In this project we are going to learn how to use Gesture Control to change the volume of a computer. We first look into h

    Diwas Pandey 49 Sep 22, 2022
    Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

    DataTuner You have just found the DataTuner. This repository provides tools for fine-tuning language models for a task. See LICENSE.txt for license de

    81 Jan 01, 2023
    PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

    News Python3 implementations of PSENet [1], PAN [2] and PAN++ [3] are released at https://github.com/whai362/pan_pp.pytorch. [1] W. Wang, E. Xie, X. L

    1.1k Dec 24, 2022
    Text Detection from images using OpenCV

    EAST Detector for Text Detection OpenCV’s EAST(Efficient and Accurate Scene Text Detection ) text detector is a deep learning model, based on a novel

    Abhishek Singh 88 Oct 20, 2022
    Detect handwritten words in a text-line (classic image processing method).

    Word segmentation Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is fro

    Harald Scheidl 190 Jan 03, 2023
    Framework for the Complete Gaze Tracking Pipeline

    Framework for the Complete Gaze Tracking Pipeline The figure below shows a general representation of the camera-to-screen gaze tracking pipeline [1].

    Pascal 20 Jan 06, 2023
    Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

    ocr-fileformat Validate and transform between OCR file formats (hOCR, ALTO, PAGE, FineReader) Installation Docker System-wide Usage CLI GUI API Transf

    Universitätsbibliothek Mannheim 152 Dec 20, 2022
    A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

    TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes A PyTorch implement of TextSnake: A Flexible Representation for Detecting

    Prince Wang 417 Dec 12, 2022
    Generates a message from the infamous Jerma Impostor image

    Generate your very own jerma sus imposter message. Modes: Default Mode: Only supports the characters " ", !, a, b, c, d, e, h, i, m, n, o, p, q, r, s,

    Giorno420 1 Oct 27, 2022
    Repositório para registro de estudo da biblioteca opencv (Python)

    OpenCV (Python) Objetivo do Repositório: Registrar avanços no estudo da biblioteca opencv. O repositório estará aberto a qualquer pessoa e há tambem u

    1 Jun 14, 2022
    Perspective recovery of text using transformed ellipses

    unproject_text Perspective recovery of text using transformed ellipses. See full writeup at https://mzucker.github.io/2016/10/11/unprojecting-text-wit

    Matt Zucker 111 Nov 13, 2022