Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Last update: Jan 03, 2023

Related tags

Deep Learning geo_warp

Overview

Viewpoint Invariant Dense Matching for Visual Geolocalization: PyTorch implementation

This is the implementation of the ICCV21 paper:

G Berton, C. Masone, V. Paolicelli and B. Caputo, Viewpoint Invariant Dense Matching for Visual Geolocalization

Setup

First download the baseline models which have been trained following the training procedure in the NetVLAD paper. We provide a script to download the six models used, which are a combination of 3 backbone encoders (AlexNet, VGG-16 and ResNet-50) with 2 pooling/aggregation layers (GeM and NetVLAD).

python download_pretrained_baselines.py

Then you should prepare your geo-localization dataset, so that the directory tree is as such:

dataset_name
└── images
    ├── train
    │   ├── gallery
    │   └── queries
    ├── val
    │   ├── gallery
    │   └── queries
    └── test
        ├── gallery
        └── queries

and the images are named as @UTM [email protected] [email protected]@.jpg

Dependencies

See requirements.txt

Training

You can train the model using the train.py, here's an example with the lightest/fastest model (i.e. AlexNet + GeM):

python train.py --arch alexnet --pooling gem --resume_fe pretrained_baselines/alexnet_gem.pth

For a full set of options, run python train.py -h. The script will create a folder under ./runs/default/YYYY-MM-DD_HH-mm-ss where logs and checkpoints will be saved.

Evaluation

Coming soon.

BibTeX

If you use this code in your project, please cite us using:

@InProceedings{Berton_ICCV_2021,
    author    = {Berton, Gabriele and Masone, Carlo and Paolicelli, Valerio and Caputo, Barbara},
    title     = {Viewpoint Invariant Dense Matching for Visual Geolocalization},
    booktitle = ICCV,
    month     = {October},
    year      = {2021},
    pages     = {12169-12178}
}

Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Related tags

Overview

Viewpoint Invariant Dense Matching for Visual Geolocalization: PyTorch implementation

Setup

Dependencies

Training

Evaluation

BibTeX

Owner

Gabriele Berton

Data stream analytics: Implement online learning methods to address concept drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" accepted in IEEE GlobeCom 2021.

Winners of DrivenData's Overhead Geopose Challenge

The source code and dataset for the RecGURU paper (WSDM 2022)

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

Use CLIP to represent video for Retrieval Task

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

CS50x-AI - Artificial Intelligence with Python from Harvard University

Code for the paper "Reinforcement Learning as One Big Sequence Modeling Problem"

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Architecture Patterns with Python (TDD, DDD, EDM)

Unofficial Implement PU-Transformer

MatchGAN: A Self-supervised Semi-supervised Conditional Generative Adversarial Network

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

The official implementation of the IEEE S&P`22 paper "SoK: How Robust is Deep Neural Network Image Classification Watermarking".

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Extreme Lightwegith Portrait Segmentation