FastOCR is a desktop application for OCR API.

Overview

FastOCR

GitHub top language GitHub

FastOCR is a desktop application for OCR API.

Installation

Arch Linux

fastocr-git @ AUR

Build from AUR or install with your favorite AUR helper.

yay -S fastocr-git  # Using yay
pikaur -S fastocr-git  # Using pikaur
# ...

PyPI

fastocr @ PyPI

pip install --user fastocr
# Then copy desktop file to ~/.local/share/applications

Manually

Proudly use DepHell to generate setup.py file.

python setup.py install
# Then copy desktop file to ~/.local/share/applications

Development

Proudly use Poetry for developing.

poetry install

Usage

fastocr

DBus

Service name: io.github.brucezhang1993.FastOCR

Object: /io/github/brucezhang1993/FastOCR

Methods

Actions Arguments Description
captureToClipboard {seconds} {no_copy} Capture and OCR to system clipboard or dbus signal
seconds: Delay capture in seconds
no_copy: If true, the result will not be saved in clipboard
quitApp -- Quit app

Signal

Signals Arguments Description
captured {text} OCR result will be sent to this signal
text: OCR result in plain text

Contributing

Pull requests are welcome.

For major changes, please open an issue first to discuss what you would like to change.

License

LGPL3

Comments
  • invalid token doesn't need cache, fix potential TypeError

    invalid token doesn't need cache, fix potential TypeError

    For Baidu OCR, token and expires_in in .cache/fastocr/baidu_token_data.json will be null if wrong API (or empty API) is given, where there's no need to save token in the cache file. (Otherwise it can raise a TypeError during timestamp + expires_in)

    opened by wsyxbcl 1
  • [BUG] 截图操作后操作区内容可能存在未更新的问题

    [BUG] 截图操作后操作区内容可能存在未更新的问题

    Describe the bug 截图操作后操作区内容可能存在未更新的问题

    To Reproduce 用户反馈待复现

    Expected behavior TODO

    Environment information 用户反馈待复现 [Strongly recommend to run fastocr diagnose in the console and paste the outputs here]

    Additional context Add any other context about the problem here.

    unconfirmed bug 
    opened by BruceZhang1993 0
  • [BUG] The token cache for BaiduOCR may block the API update in setting?

    [BUG] The token cache for BaiduOCR may block the API update in setting?

    As the property method of BaiduOcr would use the cache file directly if there is, the API change in setting won't work.

    Remove the cache file after Setting.save or redefine the "cache file behavior" would probably be nice?

    confirmed bug 
    opened by wsyxbcl 0
  • [Feature Request] Please support Mathpix API

    [Feature Request] Please support Mathpix API

    Describe the feature Could you please add support for mathpix API so that users can use this to copy and paste math formulars? Reference: https://mathpix.com/ocr

    feature request 
    opened by sukanka 2
  • Distribution

    Distribution

    Linux

    • [x] Arch Linux derivative AUR ~~Thanks to myself~~
    • [x] NixOS/Nix fastocr Thanks to Potato Hatsue
    • [ ] Debian/Ubuntu derivative
    • [ ] Gentoo derivative
    • [x] AppImage

    ...

    Windows

    • [x] PyInstaller (Exe)
    • [x] PyInstaller (Zip)

    macOS

    • [x] PyInstaller (App)

    Common

    help wanted 
    opened by BruceZhang1993 2
Owner
Bruce Zhang
Bruce Zhang
An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

FOTS: Fast Oriented Text Spotting with a Unified Network Introduction This is a pytorch re-implementation of FOTS: Fast Oriented Text Spotting with a

GeorgeJoe 171 Aug 04, 2022
Captcha Recognition

The objective of this project is to recognize the target numbers in the captcha images correctly which would tell us how good or bad a captcha system has been built.

Mohit Kaushik 5 Feb 20, 2022
Dirty, ugly, and hopefully useful OCR of Facebook Papers docs released by Gizmodo

Quick and Dirty OCR of Facebook Papers Gizmodo has been working through the Facebook Papers and releasing the docs that they process and review. As lu

Bill Fitzgerald 2 Oct 28, 2021
An application of high resolution GANs to dewarp images of perturbed documents

Docuwarp This project is focused on dewarping document images through the usage of pix2pixHD, a GAN that is useful for general image to image translat

Thomas Huang 97 Dec 25, 2022
ocroseg - This is a deep learning model for page layout analysis / segmentation.

ocroseg This is a deep learning model for page layout analysis / segmentation. There are many different ways in which you can train and run it, but by

NVIDIA Research Projects 71 Dec 06, 2022
RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection For more details, please refer to our paper. Citing Please cite the related works

Minghui Liao 102 Jun 29, 2022
This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

Passport-Recogniton-System This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and fle

Mo'men Ashraf Muhamed 7 Jan 04, 2023
This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the

Elkin Javier Guerra Galeano 17 Nov 03, 2022
question‘s area recognition using image processing and regular expression

======================================== Paper-Question-recognition ======================================== question‘s area recognition using image p

Yuta Mizuki 7 Dec 27, 2021
Textboxes implementation with Tensorflow (python)

tb_tensorflow A python implementation of TextBoxes Dependencies TensorFlow r1.0 OpenCV2 Code from Chaoyue Wang 03/09/2017 Update: 1.Debugging optimize

Jayne Shin (신재인) 20 May 31, 2019
SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition PDF Abstract Explainable artificial intelligence has been gaining attention

87 Dec 26, 2022
The Open Source Framework for Machine Vision

SimpleCV Quick Links: About Installation [Docker] (#docker) Ubuntu Virtual Environment Arch Linux Fedora MacOS Windows Raspberry Pi SimpleCV Shell Vid

Sight Machine 2.6k Dec 31, 2022
The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Likert Scoring with Grade Decoupling for Long-term Action Assessment This is the code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Lon

10 Oct 21, 2022
"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

ID Verification by LibraX.ai This is the first free Identity verification in the market. LibraX.ai is an identity verification platform for developers

LibraX.ai 46 Dec 06, 2022
The papers published in top-tier AI conferences in recent years.

AI-conference-papers The papers published in top-tier AI conferences in recent years. Paper table AAAI ICLR CVPR ICML ICCV ECCV NIPS 2019 ✔️ ✔️ ✔️ ✔️

Jinbae Park 6 Dec 09, 2022
A pure pytorch implemented ocr project including text detection and recognition

ocr.pytorch A pure pytorch implemented ocr project. Text detection is based CTPN and text recognition is based CRNN. More detection and recognition me

coura 444 Dec 30, 2022
Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

DataTuner You have just found the DataTuner. This repository provides tools for fine-tuning language models for a task. See LICENSE.txt for license de

81 Jan 01, 2023
Controlling Volume by Hand Gestures

This program allows the user to control the volume of their device with specific hand gestures involving their thumb and index finger!

Riddhi Bajaj 1 Nov 11, 2021
Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo Thomas Kollar, Michael Laskey, Kevin Stone, Brijen Thananjeyan

68 Dec 14, 2022
Converts an image into funny, smaller amongus characters

SussyImage Converts an image into funny, smaller amongus characters Demo Mona Lisa | Lona Misa (Made up of AmongUs characters) API I've also added an

Dhravya Shah 14 Aug 18, 2022