This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Last update: Nov 03, 2022

Overview

THE COMPUTER VISION DOJO

This repository was created to learn and gain new knowledge about computer vision and all its possible applications in the field of robotics and smart systems.

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

Python
Python is a programming language that lets you work quickly and integrate systems more effectively.
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
Numpy
Numpy is a general-purpose array-processing package. It provides a high-performance multidimensional array object, and tools for working with these arrays. It is the fundamental package for scientific computing with Python.
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.

C++ DEPENDENCIES

Microsoft C++ Build Tools
The Microsoft C++ Build Tools provides MSVC toolsets via a scriptable, standalone installer without Visual Studio. Recommended if you build C++ libraries and applications targeting Windows from the command-line (e.g. as part of your continuous integration workflow). Includes tools shipped in Visual Studio 2015 Update 3, Visual Studio 2017 version 15.9, and all major updates to Visual Studio 2019 (v16.x).
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
cmake
CMake is an open-source, cross-platform family of tools designed to build, test and package software.

AUTHOR

Elkin Javier Guerra Galeano

Student of Mechatronics Engineering at EIA University, excited for integrating Software and Hardware systems.
He is curious about Control Theory and implementing Robotics Solutions with different math designs.
He has skills with problem-solving for real-life applications. He is passionate about building knowledge from a theory-practice approach.

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Related tags

Overview

THE COMPUTER VISION DOJO

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

C++ DEPENDENCIES

AUTHOR

Elkin Javier Guerra Galeano

Owner

Elkin Javier Guerra Galeano

scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.

An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

Ddddocr - 通用验证码识别OCR pypi版

Rubik's Cube in pygame with OpenGL

👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

基于Paddle框架的PSENet复现

A list of hyperspectral image super-solution resources collected by Junjun Jiang

Create single line SVG illustrations from your pictures

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Detect and fix skew in images containing text

a Deep Learning Framework for Text

Document blur detection based on Laplacian operator and text detection.

🖺 OCR using tensorflow with attention

OCR, Scene-Text-Understanding, Text Recognition

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.