Object Detection and Multi-Object Tracking

Last update: Jan 04, 2023

Overview

Object Detection and Tracking

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos.

Environment

I have tested on Ubuntu 16.04/18.04. The code may work on other systems.

[Ubuntu-Deep-Learning-Environment-Setup]

Ubuntu 16.04 / 18.04
ROS Kinetic / Melodic
GTX 1080Ti / RTX 2080Ti
python 2.7 / 3.6

Installation

Clone the repository

git clone https://github.com/yehengchen/Object-Detection-and-Tracking.git

[OneStage]

YOLO: Real-Time Object Detection and Tracking

How to train a YOLO model on custom images: YOLOv3 - [Link] / YOLOv4 - [Link]

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]
YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]
Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Fast R-CNN / Faster R-CNN / Mask R-CNN

How to train a Mask R-CNN model on own images - [Link]

Mask R-CNN + ROS Kinetic - [Link]

This project is ROS package of Mask R-CNN algorithm for object detection and segmentation.

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]
How to get it working on the COCO dataset coco2voc - [Link]
Convert Dataset2Yolo - COCO / VOC - [Link]

Object Detection and Multi-Object Tracking

Related tags

Overview

Object Detection and Tracking

Environment

Ubuntu 16.04 / 18.04

ROS Kinetic / Melodic

GTX 1080Ti / RTX 2080Ti

python 2.7 / 3.6

Installation

[OneStage]

YOLO: Real-Time Object Detection and Tracking

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]

YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]

Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Mask R-CNN + ROS Kinetic - [Link]

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]

How to get it working on the COCO dataset coco2voc - [Link]

Convert Dataset2Yolo - COCO / VOC - [Link]

CV & Robotics Paper List (3D object detection & 6D pose estimation) - [Link]

PapersWithCode: Browse > Computer Vision > Object Detection - [Link]

ObjectDetection Two-stage vs One-stage Detectors - [Link]

ObjectDetection mAP & IoU - [Link]

Owner

Bobby Chen

Using image super resolution models with vapoursynth and speeding them up with TensorRT

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

🥈78th place in Riiid Answer Correctness Prediction competition

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

Official repository for the paper F, B, Alpha Matting

GPU-Accelerated Deep Learning Library in Python

Ranking Models in Unlabeled New Environments （iccv21）

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Train/evaluate a Keras model, get metrics streamed to a dashboard in your browser.

Interactive Image Generation via Generative Adversarial Networks

Analysis of rationale selection in neural rationale models

Code for our SIGCOMM'21 paper "Network Planning with Deep Reinforcement Learning".

ML-based medical imaging using Azure

Hso-groupie - A pwnable challenge in Real World CTF 4th

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Efficient Deep Learning Systems course

Code for the paper "Reinforced Active Learning for Image Segmentation"