Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

Approaches to modeling terrain and maps in python

Unsupervised Representation Learning via Neural Activation Coding

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

[CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

Official code for paper Exemplar Based 3D Portrait Stylization.

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch

MDMM - Learning multi-domain multi-modality I2I translation

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Solving reinforcement learning tasks which require language and vision

Repository for publicly available deep learning models developed in Rosetta community

The official implementation of Variable-Length Piano Infilling (VLI).

Data cleaning, missing value handle, EDA use in this project

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Plug and play transformer you can find network structure and official complete code by clicking List

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

Scalable Graph Neural Networks for Heterogeneous Graphs

Dialect classification