code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Last update: Dec 14, 2022

Overview

Video_Pace

This repository contains the code for the following paper:

Jiangliu Wang, Jianbo Jiao and Yunhui Liu, "Self-Supervised Video Representation Learning by Pace Prediction", In: ECCV (2020).

Main idea:

Framework:

Requirements

pytroch >= 1.3.0
tensorboardX
cv2
scipy

Usage

Data preparation

UCF101 dataset

Download the original UCF101 dataset from the official website. And then extarct RGB images from videos.
Or direclty download the pre-processed RGB data of UCF101 here provided by feichtenhofer.

Pre-train

Train with pace prediction task on S3D-G, the default clip length is 64 and input video size is 224 x 224.

python train.py --rgb_prefix RGB_DIR --gpu 0,1,2,3 --bs 32 --lr 0.001 --height 256 --width 256 --crop_sz 224 --clip_len 64

Train with pace prediction task on c3d/r3d/r21d, the default clip length is 16 and input video size is 112 x 112.

python train.py --rgb_prefix RGB_DIR --gpu 0 --bs 30 --lr 0.001 --model c3d/r3d/r21d --height 128 --width 171 --crop_sz 112 --clip_len 16

Evaluation

To be updated...

Citation

If you find this work useful or use our code, please consider citing:

@InProceedings{Wang20,
  author       = "Jiangliu Wang and Jianbo Jiao and Yunhui Liu",
  title        = "Self-Supervised Video Representation Learning by Pace Prediction",
  booktitle    = "European Conference on Computer Vision",
  year         = "2020",
}

Acknowlegement

Part of our codes are adapted from S3D-G HowTO100M, we thank the authors for their contributions.

code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Related tags

Overview

Video_Pace

Main idea:

Framework:

Requirements

Usage

Data preparation

Pre-train

Evaluation

Citation

Acknowlegement

Owner

Jiangliu Wang

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

A package to predict protein inter-residue geometries from sequence data

Pytorch cuda extension of grid_sample1d

Planar Prior Assisted PatchMatch Multi-View Stereo

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Python package for visualizing the loss landscape of parameterized quantum algorithms.

PCGNN - Procedural Content Generation with NEAT and Novelty

quantize aware training package for NCNN on pytorch

Advancing mathematics by guiding human intuition with AI

Semiconductor Machine learning project

Code for this paper The Lottery Ticket Hypothesis for Pre-trained BERT Networks.

The Generic Manipulation Driver Package - Implements a ROS Interface over the robotics toolbox for Python

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

The Face Mask recognition system uses AI technology to detect the person with or without a mask.

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

1st place solution in CCF BDCI 2021 ULSEG challenge

An open source Jetson Nano baseboard and tools to design your own.

Continuous Time LiDAR odometry