Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Last update: Jul 08, 2021

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

This is a PyTorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, W. Zhang, and Q. Huang. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACM MM 2020.

Dependencies

Pytorch 1.2.0
Cuda 9.2.148
Cudnn 7.6.2
Opencv-python 4.2.0.34
Python 3.6.9

Data

Dataset Prepare

Download the pre-trained concept detector weights from Baidu passward 'wv0e' or Google Grive and put them in folder weights/
Download the FCVID dataset from http://bigvid.fudan.edu.cn/FCVID/.
The annotation information of each dataset is provided in folder data/FCVID/video_labels.
Extract the video frames for each video and put the extracted frames in folder data/FCVID/frames/.

For ActivityNet dataset ( http://activity-net.org/. ) , we use the latest released version of the dataset (v1.3).

Train

python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --no_test

for other hyperparameters, please refer to opts.py file.

Test

Pretrained model weigths are avaiable in Baidu passward 'szlk' or Google Grive
Download the pre-trained weights and put them in folder results/
python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --resume_path pretrained_model/tdcmn_si_soa.pth --no_train --test_crop_number 1

Citation

Please cite our paper if you use this code in your own work:

@inproceedings{qi2020modeling,
  title={Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Zhang, Weigang and Huang, Qingming},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={3798--3806},
  year={2020}
}

Contcat

If you have any problem about our code, feel free to contact

[email protected]

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Dependencies

Data

Dataset Prepare

Train

Test

Citation

Contcat

Owner

qzhb

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

optimization routines for hyperparameter tuning

Multi-Objective Loss Balancing for Physics-Informed Deep Learning

Cours d'Algorithmique Appliquée avec Python pour BTS SIO SISR

This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.

FB-tCNN for SSVEP Recognition

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

Focal Loss for Dense Rotation Object Detection

Python Implementation of algorithms in Graph Mining, e.g., Recommendation, Collaborative Filtering, Community Detection, Spectral Clustering, Modularity Maximization, co-authorship networks.

PyTorch implementation of probabilistic deep forecast applied to air quality.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

A Python module for parallel optimization of expensive black-box functions

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

Code and Data for the paper: Molecular Contrastive Learning with Chemical Element Knowledge Graph [AAAI 2022]

A PyTorch Toolbox for Face Recognition

🤗 Paper Style Guide