Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Last update: Oct 18, 2022

Related tags

Deep Learning MIGCN

Overview

Multi-modal Interaction Graph Convolutioal Network for Temporal Language Localization in Videos

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Model Pipeline

Usage

Environment Settings

We use the PyTorch framework.

Python version: 3.7.0
PyTorch version: 1.4.0

Get Code

Clone the repository:

git clone https://github.com/zmzhang2000/MIGCN.git
cd MIGCN

Data Preparation

Charades-STA

Download the preprocessed annotations and features of Charades-STA with I3D features.
Save them in data/charades.

ActivityNet

Download the preprocessed annotations of ActivityNet.
Download the C3D features of ActivityNet.
Process the C3D feature according to process_activitynet_c3d() in data/preprocess/preprocess.py.
Save them in data/activitynet.

Pre-trained Models

Download the checkpoints of Charades-STA and ActivityNet.
Save them in checkpoints

Data Generation

We provide the generation procedure of all MIGCN data.

The raw data is listed in data/raw_data/download.sh.
The preprocess code is in data/preprocess.

Training

Train MIGCN on Charades-STA with I3D feature:

python main.py --dataset charades --feature i3d

Train MIGCN on ActivityNet with C3D feature:

python main.py --dataset activitynet --feature c3d

Testing

Test MIGCN on Charades-STA with I3D feature:

python main.py --dataset charades --feature i3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT

Test MIGCN on ActivityNet with C3D feature:

python main.py --dataset activitynet --feature c3d --test --model_load_path checkpoints/$MODEL_CHECKPOINT

Other Hyper-parameters

List other hyper-parameters by:

python main.py -h

Reference

Please cite the following paper if MIGCN is helpful for your research

@ARTICLE{9547801,
  author={Zhang, Zongmeng and Han, Xianjing and Song, Xuemeng and Yan, Yan and Nie, Liqiang},
  journal={IEEE Transactions on Image Processing}, 
  title={Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos}, 
  year={2021},
  volume={30},
  number={},
  pages={8265-8277},
  doi={10.1109/TIP.2021.3113791}}

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Related tags

Overview

Multi-modal Interaction Graph Convolutioal Network for Temporal Language Localization in Videos

Model Pipeline

Usage

Environment Settings

Get Code

Data Preparation

Charades-STA

ActivityNet

Pre-trained Models

Data Generation

Training

Testing

Other Hyper-parameters

Reference

Owner

Zongmeng Zhang

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

RTSeg: Real-time Semantic Segmentation Comparative Study

Running Google MoveNet Multipose Tracking models on OpenVINO.

Implementation of several Bayesian multi-target tracking algorithms, including Poisson multi-Bernoulli mixture filters for sets of targets and sets of trajectories. The repository also includes the GOSPA metric and a metric for sets of trajectories to evaluate performance.

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Example Of Fine-Tuning BERT For Named-Entity Recognition Task And Preparing For Cloud Deployment Using Flask, React, And Docker

VM3000 Microphones

Torch-ngp - A pytorch implementation of the hash encoder proposed in instant-ngp

A modification of Daniel Russell's notebook merged with Katherine Crowson's hq-skip-net changes

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Fast, differentiable sorting and ranking in PyTorch

Character-Input - Create a program that asks the user to enter their name and their age

Evaluating different engineering tricks that make RL work

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

Adjusting for Autocorrelated Errors in Neural Networks for Time Series

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal

RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.