Instance-level Image Retrieval using Reranking Transformers

Fuwen Tan, Jiangbo Yuan, Vicente Ordonez, ICCV 2021.

Abstract

Instance-level image retrieval is the task of searching in a large database for images that match an object in a query image. To address this task, systems usually rely on a retrieval step that uses global image descriptors, and a subsequent step that performs domain-specific refinements or reranking by leveraging operations such as geometric verification based on local features. In this work, we propose Reranking Transformers (RRTs) as a general model to incorporate both local and global features to rerank the matching images in a supervised fashion and thus replace the relatively expensive process of geometric verification. RRTs are lightweight and can be easily parallelized so that reranking a set of top matching results can be performed in a single forward-pass. We perform extensive experiments on the Revisited Oxford and Paris datasets, and the Google Landmark v2 dataset, showing that RRTs outperform previous reranking approaches while using much fewer local descriptors. Moreover, we demonstrate that, unlike existing approaches, RRTs can be optimized jointly with the feature extractor, which can lead to feature representations tailored to downstream tasks and further accuracy improvements.

Software required

The code is only tested on Linux 64:

  conda create -n rrt python=3.6
  conda activate rrt
  pip install -r requirements.txt

Organization

To use the code for experiments on Google Landmarks v2, Revisited Oxford/Paris, please refer to the folder RRT_GLD.

To use the code for experiments on Stanford Online Products, please refer to the folder RRT_SOP.

To use the code for evaluating SuperGlue on Revisited Oxford/Paris and Stanford Online Products, please refer to the repo SuperGlue.

Citing

If you find our paper/code useful, please consider citing:

@inproceedings{fwtan-instance-2021,
    author = {Fuwen Tan and Jiangbo Yuan and Vicente Ordonez},
    title = {Instance-level Image Retrieval using Reranking Transformers},
    year = {2021},
    booktitle = {International Conference on Computer Vision (ICCV)}
 }

Instance-level Image Retrieval using Reranking Transformers

Related tags

Overview

Instance-level Image Retrieval using Reranking Transformers

Abstract

Software required

Organization

Citing

Owner

UVA Computer Vision

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Mitsuba 2: A Retargetable Forward and Inverse Renderer

Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.

Centroid-UNet is deep neural network model to detect centroids from satellite images.

Meta Learning for Semi-Supervised Few-Shot Classification

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference

Near-Optimal Sparse Allreduce for Distributed Deep Learning (published in PPoPP'22)

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Boost learning for GNNs from the graph structure under challenging heterophily settings. (NeurIPS'20)

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

A list of all papers and resoureces on Semantic Segmentation

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

Multiple custom object count and detection using YOLOv3-Tiny method

Code for IntraQ, PyTorch implementation of our paper under review

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling