Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Last update: Oct 14, 2022

Overview

ONNX Object Localization Network

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Original image: https://en.wikipedia.org/wiki/File:Interior_design_865875.jpg

Important

I added a bit of logic to the box color selection to make it look nicer. Since it performs K-Means for each box, it might be slow. If you only care about speed, you can either set all the boxes to the same color or use random colors.

Requirements

Check the requirements.txt file.
For ONNX, if you have a NVIDIA GPU, then install the onnxruntime-gpu, otherwise use the onnxruntime library.
Additionally, pafy and youtube-dl are required for youtube video inference.

Installation

git clone https://github.com/ibaiGorordo/ONNX-Object-Localization-Network.git
cd ONNX-Object-Localization-Network
pip install -r requirements.txt

ONNX Runtime

For Nvidia GPU computers: pip install onnxruntime-gpu

Otherwise: pip install onnxruntime

For youtube video inference

pip install youtube_dl
pip install git+https://github.com/zizo-pro/[email protected]

ONNX model

The original model was converted to ONNX by PINTO0309, download the models from the download script in his repository and save them into the models folder.

The License of the models is Apache-2.0 License: https://github.com/mcahny/object_localization_network/blob/main/LICENSE

Pytorch model

The original Pytorch model can be found in this repository: https://github.com/mcahny/object_localization_network

Examples

Image inference:

python image_object_localization.py

Webcam inference:

python webcam_object_localization.py

Video inference: https://youtu.be/n9qhQJXYUWo

python video_object_localization.py

Original video: https://youtu.be/vgJUXvkdS78

References:

Object-Localization-Network model: https://github.com/mcahny/object_localization_network
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
Original paper: https://arxiv.org/abs/2108.06753

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Related tags

Overview

ONNX Object Localization Network

Important

Requirements

Installation

ONNX Runtime

For youtube video inference

ONNX model

Pytorch model

Examples

References:

Owner

Ibai Gorordo

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

A small tool to joint picture including gif

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Repository for the NeurIPS 2021 paper: "Exploiting Domain-Specific Features to Enhance Domain Generalization".

The challenge for Quantum Coalition Hackathon 2021

Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

HackBMU-5.0-Team-Ctrl-Alt-Elite - HackBMU 5.0 Team Ctrl Alt Elite

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

TensorFlow code for the neural network presented in the paper: "Structural Language Models of Code" (ICML'2020)

mPose3D, a mmWave-based 3D human pose estimation model.

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

A nutritional label for food for thought.

Session-aware Item-combination Recommendation with Transformer Network

Baselines for TrajNet++

Changing the Mind of Transformers for Topically-Controllable Language Generation

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

CTRL-C: Camera calibration TRansformer with Line-Classification