Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Last update: Nov 29, 2022

Related tags

Overview

TimeCycle

Code for Learning Correspondence from the Cycle-consistency of Time (CVPR 2019, Oral). The code is developed based on the PyTorch framework, in version PyTorch 0.4 with Python 2. It also runs smoothly with PyTorch 1.0. This repo includes the training code for learning semi-dense correspondence from unlabeled videos, and testing code for applying this correspondence on segmentation mask tracking in videos.

Citation

If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry.

@inproceedings{CVPR2019_CycleTime,
    Author = {Xiaolong Wang and Allan Jabri and Alexei A. Efros},
    Title = {Learning Correspondence from the Cycle-Consistency of Time},
    Booktitle = {CVPR},
    Year = {2019},
}

Model and Result

Our trained model can be downloaded from here. The tracking performance on DAVIS-2017 for this model (without training on DAVIS-2017) is:

cropSize	J_mean	J_recall	J_decay	F_mean	F_recall	F_decay
320 x 320	0.419	0.409	0.272	0.394	0.336	0.328
400 x 400	0.430	0.437	0.296	0.426	0.413	0.356
480 x 480	0.464	0.500	0.332	0.500	0.480	0.379

Note that one can easily improve the results in test time by increasing the input image size "cropSize" in the script. The training and testing procedures for this model are described as follows.

Converting Our Model to Standard Pytorch ResNet-50

Please see convert_model.ipynb for converting our model here to standard Pytorch ResNet-50 model format.

Dataset Preparation

Please read DATASET.md for downloading and preparing the VLOG dataset for training and DAVIS dataset for testing.

Training

Replace the input list in train_video_cycle_simple.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/vlog_frames_12fps.txt'

Then run the following code:

    python train_video_cycle_simple.py --checkpoint pytorch_checkpoints/release_model_simple

Testing

Replace the input list in test_davis.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/davis/DAVIS/vallist.txt'

Set up the dataset path YOUR_DATASET_FOLDER in run_test.sh . Then run the testing and evaluation code together:

    sh run_test.sh

Acknowledgements

weakalign by Ignacio Rocco, Relja Arandjelović and Josef Sivic.

inflated_convnets_pytorch by Yana Hasson.

pytorch-classification by Wei Yang.

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Related tags

Overview

TimeCycle

Citation

Model and Result

Converting Our Model to Standard Pytorch ResNet-50

Dataset Preparation

Training

Testing

Acknowledgements

Owner

Xiaolong Wang

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

基于tensorflow 2.x的图片识别工具集

FaRL for Facial Representation Learning

M3DSSD: Monocular 3D Single Stage Object Detector

Blender Add-on that sets a Material's Base Color to one of Pantone's Colors of the Year

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Re-TACRED: Addressing Shortcomings of the TACRED Dataset

An all-in-one application to visualize multiple different local path planning algorithms

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

Code for ICCV 2021 paper Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes using Scene Graphs

A New Approach to Overgenerating and Scoring Abstractive Summaries

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!

Flexible Option Learning - NeurIPS 2021

Invert and perturb GAN images for test-time ensembling

Compressed Video Action Recognition