Detectorch - detectron for PyTorch

Last update: Dec 23, 2022

Overview

Detectorch - detectron for PyTorch

(Disclaimer: this is work in progress and does not feature all the functionalities of detectron. Currently only inference and evaluation are supported -- no training) (News: Now supporting FPN and ResNet-101!)

This code allows to use some of the Detectron models for object detection from Facebook AI Research with PyTorch.

It currently supports:

Fast R-CNN
Faster R-CNN
Mask R-CNN

It supports ResNet-50/101 models with or without FPN. The pre-trained models from caffe2 can be imported and used on PyTorch.

Example Mask R-CNN with ResNet-101 and FPN.

Evaluation

Both bounding box evaluation and instance segmentation evaluation where tested, yielding the same results as in the Detectron caffe2 models. These results below have been computed using the PyTorch code:

Model	box AP	mask AP	model id
fast_rcnn_R-50-C4_2x	35.6		36224046
fast_rcnn_R-50-FPN_2x	36.8		36225249
e2e_faster_rcnn_R-50-C4_2x	36.5		35857281
e2e_faster_rcnn_R-50-FPN_2x	37.9		35857389
e2e_mask_rcnn_R-50-C4_2x	37.8	32.8	35858828
e2e_mask_rcnn_R-50-FPN_2x	38.6	34.5	35859007
e2e_mask_rcnn_R-101-FPN_2x	40.9	36.4	35861858

Training

Training code is experimental. See train_fast.py for training Fast R-CNN. It seems to work, but slow.

Installation

First, clone the repo with git clone --recursive https://github.com/ignacio-rocco/detectorch so that you also clone the Coco API.

The code can be used with PyTorch 0.3.1 or PyTorch 0.4 (master) under Python 3. Anaconda is recommended. Other required packages

torchvision (conda install torchvision -c soumith)
opencv (conda install -c conda-forge opencv )
cython (conda install cython)
matplotlib (conda install matplotlib)
scikit-image (conda install scikit-image)
ninja (conda install ninja) (required for Pytorch 0.4 only)

Additionally, you need to build the Coco API and RoIAlign layer. See below.

Compiling the Coco API

If you cloned this repo with git clone --recursive you should have also cloned the cocoapi in lib/cocoapi. Compile this with:

cd lib/cocoapi/PythonAPI
make install

Compiling RoIAlign

The RoIAlign layer was converted from the caffe2 version. There are two different implementations for each PyTorch version:

Pytorch 0.4: RoIAlign using ATen library (lib/cppcuda). Compiled JIT when loaded.
PyTorch 0.3.1: RoIAlign using TH/THC and cffi (lib/cppcuda_cffi). Needs to be compiled with:

cd lib/cppcuda_cffi
./make.sh

Quick Start

Check the demo notebook.

Detectorch - detectron for PyTorch

Related tags

Overview

Detectorch - detectron for PyTorch

Evaluation

Training

Installation

Compiling the Coco API

Compiling RoIAlign

Quick Start

Owner

Ignacio Rocco

A modular application for performing anomaly detection in networks

Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers on hybryd datasets samples for facial emotion recognition.

Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3

Paddle-Adversarial-Toolbox (PAT) is a Python library for Deep Learning Security based on PaddlePaddle.

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Eth brownie struct encoding example

A fast model to compute optical flow between two input images.

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Implement slightly different caffe-segnet in tensorflow

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

IDA file loader for UF2, created for the DEFCON 29 hardware badge

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

Using BERT+Bi-LSTM+CRF

Attentive Implicit Representation Networks (AIR-Nets)