Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Last update: Jun 05, 2022

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

For Educational Purposes Only This is a recursive neural net trained to classify specific crime classes based on the UCF-Crime dataset UCF-CRIME or to perform general anomaly detection. The model uses images that have been encoded into the CLIP image embedding space.

Introducing CLIP

The model we are utilizing in our application, CLIP (developed by OpenAI), is a generalized image classification model which can take any image and produce word embeddings for the purpose of matching raw text strings to the contents of the image. The design and training of the model allows for high zero-shot performance in classifying images (i.e. image classification problems outside of the training set). The following image provides a summary of the model (taken from A. Radford et al.):

While typical image classification models train an image feature extractor and a linear classifier to predict a label, CLIP trains an image encoder and text encoder to predict the correct pairings of a batch of (image, text) training examples. At test time the learned text encoder synthesizes a zero-shot linear classifier by embedding the names or descriptions of the target dataset’s classes.

Installation

Clone the repo and the required packages can be found in the required.txt file. Running classifier.py will start an interactive application that will attempt to perform anomaly detection or multi-class classification on videos found in the 'Videos' directory.

The scripts that were used to create the image sequence database from the video files of the UCF-Crime dataset as well as the training scripts and models can be found in the src directory.

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Related tags

Overview

👁️ Hindsight AI: Crime Classification With Clip

About

Introducing CLIP

Installation

Owner

Miles Tweed

68 keypoint annotations for COFW test data

Parametric Contrastive Learning (ICCV2021)

CenterPoint 3D Object Detection and Tracking using center points in the bird-eye view.

Implements the training, testing and editing tools for "Pluralistic Image Completion"

Versatile Generative Language Model

✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.

🤗 Push your spaCy pipelines to the Hugging Face Hub

Self-attentive task GAN for space domain awareness data augmentation.

AquaTimer - Programmable Timer for Aquariums based on ATtiny414/814/1614

Keras like implementation of Deep Learning architectures from scratch using numpy.

Leaderboard, taxonomy, and curated list of few-shot object detection papers.

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Relative Positional Encoding for Transformers with Linear Complexity

This is an official source code for implementation on Extensive Deep Temporal Point Process

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

Introduction to CPM

Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces

Implementation for Learning to Track with Object Permanence

Estimating Example Difficulty using Variance of Gradients

Laplace Redux -- Effortless Bayesian Deep Learning