Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Last update: Nov 14, 2022

Overview

ONNX-ImageNet-1K-Object-Detector

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX. The repository combines a class agnostic object localizer to first detect the objects in the image, and next a ResNet50 model trained on ImageNet is used to label each box.

Original image: https://commons.wikimedia.org/wiki/File:Il_cuore_di_Como.jpg

Why

There are a lot of object detection models, but since most of them are trained in the COCO dataset, most of them can only detect a maximum of 80 classes. This repository proposes a "quick and dirty" solution to be able to detect the 1000 objects available in the ImageNet dataset.

❗ Important ❗

This model uses a lightweight class agnostic object localizer to first detect the objects. Therefore, this repository is not going to behave as well as other object detection models in complex scenes. In those cases, the object localizer will fail quickly and therefore no objects will be detected.
The ResNet50 clasifier is fast in a desktop GPU, however, since it needs to run for each of the detected boxes, the performance might be affected for images with many objects.

Requirements

Check the requirements.txt file.

Installation

pip install -r requirements.txt

ONNX model

Class Agnostic Object Localizer: The original model from TensorflowHub (link at the bottom) was converted to different formats (including .onnx) by PINTO0309, the models can be found in his repository. This repository will automatically download the model if the model is not found in the models folder.
ResNet50 Classifier: The original model from PaddleClas (link at the bottom) was converted to ONNX format using a similar procedure as the one described in this article by PINTO0309. This repository will automatically download the model.

How to use

Image inference:

python image_object_detection.py

Video inference:

python video_object_detection.py

Webcam inference:

python video_object_detection.py

Examples

Macaque Detection

Original image: https://commons.wikimedia.org/wiki/File:Onsen_Monkey.JPG

Christmas Stocking Detection

Original image: https://unsplash.com/photos/paSqTlm3DsA

Burrito Detection

Original image: https://commons.wikimedia.org/wiki/File:Breakfast_burrito_(cropped).jpg

Bridge Detection

Original image: https://commons.wikimedia.org/wiki/File:Bayonne_Bridge_Collins_Pk_jeh-2.JPG

[Inference video Example]

1k.detector.output_Trim.mp4

Original video: https://www.pexels.com/video/a-medusa-jellyfish-swimming-gracefully-underwater-2731905/ (by Vova Krasilnikov)

References

Original Class Agnostic Object Localizer: https://tfhub.dev/google/object_detection/mobile_object_localizer_v1/1
Original Resnet50 (ResNet50_vd_ssld) Classifier from PaddleClass: https://github.com/PaddlePaddle/PaddleClas/blob/release/2.3/docs/zh_CN/algorithm_introduction/ImageNet_models.md
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
PaddlePaddle to ONNX conversion article: https://zenn.dev/pinto0309/scraps/cf319db8fea4c3

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Related tags

Overview

ONNX-ImageNet-1K-Object-Detector

Why

❗ Important ❗

Requirements

Installation

ONNX model

How to use

Examples

Macaque Detection

Christmas Stocking Detection

Burrito Detection

Bridge Detection

[Inference video Example]

References

Owner

Ibai Gorordo

Pytorch implementation of OCNet series and SegFix.

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields.

The source code of CVPR17 'Generative Face Completion'.

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Tree Nested PyTorch Tensor Lib

Explainer for black box models that predict molecule properties

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

Corruption Invariant Learning for Re-identification

Pydantic models for pywttr and aiopywttr.

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.