Activity image-based video retrieval

Last update: Oct 21, 2021

Related tags

Overview

Cross-modal-retrieval

Our approach is focus on Activity Image-to-Video Retrieval (AIVR) task. The compared methods are state-of-the-art single modality hashing methods, multiple modalities hashing methods and cross-modal retrieval methods.

Single modality hashing methods

Some hashing baselines for image retrieval can be found in https://github.com/willard-yuan/hashing-baseline-for-image-retrieval.

Multiple modalities hashing methods

More details refer to https://github.com/czxxjtu/Hash-Learning.github.io. Some details about hashing methods are in hashing-baseline-for-image-retrieval-master folder.

Cross-modal retrieval methods

The compared cross-modal retrieval methods are according to the paper:

Datasets

THUMOS'14 Dataset:

https://pan.baidu.com/s/1H6c8nh_Hs7gVkhESpxtvAg 提取码：qp26

ActivityNet Dataset:

https://pan.baidu.com/s/1P0jRecEmplCPaTPwFoOpVQ 提取码：pnw9

Bibtex

When using images from our dataset, please cite our paper using the following BibTeX[PDF]：

@article{pba2020,
author    = {Ruicong Xu and Li Niu and Jianfu Zhang and Liqing Zhang},
title     = {A Proposal-based Approach for Activity Image-to-Video Retrieval},
journal   = {AAAI},
year      = {2020}}

Activity image-based video retrieval

Related tags

Overview

Cross-modal-retrieval

Single modality hashing methods

Multiple modalities hashing methods

Cross-modal retrieval methods

Datasets

THUMOS'14 Dataset:

ActivityNet Dataset:

Bibtex

Owner

BCMI

SuRE Evaluation: A Supplementary Material

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

Research - dataset and code for 2016 paper Learning a Driving Simulator

Reference PyTorch implementation of "End-to-end optimized image compression with competition of prior distributions"

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Keras-tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation（Unfinished）

Official Implementation of Domain-Aware Universal Style Transfer

TEA: A Sequential Recommendation Framework via Temporally Evolving Aggregations

Code for paper: Towards Tokenized Human Dynamics Representation

MG-GCN: Scalable Multi-GPU GCN Training Framework

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

Code implementation for the paper 'Conditional Gaussian PAC-Bayes'.

Text to image synthesis using thought vectors

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

Learning To Have An Ear For Face Super-Resolution

TransMorph: Transformer for Medical Image Registration