Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Last update: Nov 15, 2022

Overview

Tensorflow-Mobile-Generic-Object-Localizer

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Original image taken from the OpenCV AI Kit - Lite, make sure to check it out: https://www.kickstarter.com/projects/opencv/opencv-ai-kit-oak-depth-camera-4k-cv-edge-object-detection

❗ ⚠️ The object detector works better with images with few objects and it starts to fail in more complex scenes. The model is suitable for automatically labelling objects for custom object detection models.

Requirements

OpenCV, imread-from-url and tensorflow. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

Tensorflow model

The original models was taken from Tensorflow Hub, download it, and place it in the models folder.

Use the following script to download the model:

python download_model.py

Examples

Image inference:

python imageObjectDetection.py

Webcam inference:

python webcamObjectDetection.py

Video inference:

python videoObjectDetection.py

Inference Examples

Original video by Animist: https://youtu.be/uKyoV0uG9rQ

Astronaut detection

Original image: https://commons.wikimedia.org/wiki/File:Astronaut_Standing_On_The_Moon.png

Excavator detection

Original image: https://en.wikipedia.org/wiki/Hitachi_Construction_Machinery_(Europe)#/media/File:ZX350LCN-3-Photo28-lo.jpg

Map island detection

Original image: https://ja.m.wikipedia.org/wiki/%E3%83%95%E3%82%A1%E3%82%A4%E3%83%AB:Map_of_Hawaii_highlighting_Hawaii_(island).svg

Phone accessories detection

Original image: https://upload.wikimedia.org/wikipedia/commons/thumb/1/1b/OnePlus_3_phone%2C_charger_and_package.jpg/1024px-OnePlus_3_phone%2C_charger_and_package.jpg

And many more

References:

Original model: https://tfhub.dev/google/object_detection/mobile_object_localizer_v1/1

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Related tags

Overview

Tensorflow-Mobile-Generic-Object-Localizer

❗ ⚠️ The object detector works better with images with few objects and it starts to fail in more complex scenes. The model is suitable for automatically labelling objects for custom object detection models.

Requirements

Installation

Tensorflow model

Examples

Inference Examples

Astronaut detection

Excavator detection

Map island detection

Phone accessories detection

And many more

References:

Owner

Ibai Gorordo

SOTA model in CIFAR10

Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

Adversarial Attacks on Probabilistic Autoregressive Forecasting Models.

End-to-End Speech Processing Toolkit

🇰🇷 Text to Image in Korean

Churn prediction

Demonstrational Session git repo for H SAF User Workshop (28/1)

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

Code for KHGT model, AAAI2021

PyTorch implementation of Pointnet2/Pointnet++

PyTorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision.

A boosting-based Multiple Instance Learning (MIL) package that includes MIL-Boost and MCIL-Boost

[AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

This is code to fit per-pixel environment map with spherical Gaussian lobes, using LBFGS optimization

This is a virtual picture dragging application. Users may virtually slide photos across the screen. The distance between the index and middle fingers determines the movement. Smaller distances indicate click and motion, whereas bigger distances indicate only hand movement.

A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.