Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Last update: Dec 07, 2022

Related tags

Deep Learning SELFY

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

This is the official implementation of the paper "Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition" by H.Kwon, M.Kim, S.Kwak, and M.Cho. For more information, checkout the project website and the paper on arXiv.

Environment:

Cuda: 9.0
gcc: 7.3.0
Python 3.6.8
PyTorch 1.0.1
TorchVison: 0.2.2
Spatial Correlation Sampler
Others: environment.yml

Anaconda environment setting

git clone https://github.com/arunos728/SELFY.git
cd selfy
conda env create -f environment.yml
conda activate selfy

Installing Correlation sampler

cd Pytorch-Correlation-extension
python setup.py install

# check whether SpatialCorrelationSampler is installed correctly.
python check.py forward
python check.py backward
python checkCorrelationSampler.py

Please check this repo for the detailed instructions.

Dataset preparation

Please refer to TSM repo for the detailed data preparation instructions.

File lists (.txt files in ./data) specify configurations of each video clips (path, #frames, class). We upload our Something-Something-V1 & V2 video file lists in ./data. The path of the file lists should be added into the scripts for training (or testing).

Training & Testing

For training SELFYNet on Something-Something, use the following command:

    ./scripts/train_SELFY_Something.sh

For testing your trained model on Something-Something, use the following command:

    ./scripts/test_SELFY_Something.sh

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@inproceedings{kwon2021learning,
  title={Learning self-similarity in space and time as generalized motion for video action recognition},
  author={Kwon, Heeseung and Kim, Manjin and Kwak, Suha and Cho, Minsu},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={13065--13075},
  year={2021}
}

Contact

Heeseung Kwon([email protected]), Manjin Kim([email protected])

Questions can also be left as issues in the repository. We will be happy to answer them.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Related tags

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

Environment:

Anaconda environment setting

Installing Correlation sampler

Dataset preparation

Training & Testing

Citation

Contact

Owner

Nerf pl - NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

CVPR2021 Workshop - HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization.

It's A ML based Web Site build with python and Django to find the breed of the dog

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

A lane detection integrated Real-time Instance Segmentation based on YOLACT (You Only Look At CoefficienTs)

Synthetic Humans for Action Recognition, IJCV 2021

Face-Recognition-Attendence-System - This face recognition Attendence system using Python

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

Official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models.

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

Sematic-Segmantation - Semantic Segmentation on MIT ADE20K dataset in PyTorch

Python script to download the celebA-HQ dataset from google drive

Implementation of Research Paper "Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation"

Hardware accelerated, batchable and differentiable optimizers in JAX.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Deeper DCGAN with AE stabilization

Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage