Code release for "COTR: Correspondence Transformer for Matching Across Images"

Last update: Jan 06, 2023

Related tags

Overview

COTR: Correspondence Transformer for Matching Across Images

This repository contains the inference code for COTR. We plan to release the training code in the future. COTR establishes correspondence in a functional and end-to-end fashion. It solves dense and sparse correspondence problem in the same framework.

Demos

Check out our demo video at here.

1. Install environment

Our implementation is based on PyTorch. Install the conda environment by: conda env create -f environment.yml.

Activate the environment by: conda activate cotr_env.

Notice that we use scipy=1.2.1 .

2. Download the pretrained weights

Down load the pretrained weights at here. Extract in to ./out, such that the weights file is at /out/default/checkpoint.pth.tar.

3. Single image pair demo

python demo_single_pair.py --load_weights="default"

Example sparse output:

Example dense output with triangulation:

Note: This example uses 10K valid sparse correspondences to densify.

4. Facial landmarks demo

python demo_face.py --load_weights="default"

Example:

5. Homography demo

python demo_homography.py --load_weights="default"

Citation

If you use this code in your research, cite the paper:

@article{jiang2021cotr,
  title={{COTR: Correspondence Transformer for Matching Across Images}},
  author={Wei Jiang and Eduard Trulls and Jan Hosang and Andrea Tagliasacchi and Kwang Moo Yi},
  booktitle={arXiv preprint},
  publisher_page={https://arxiv.org/abs/2103.14167},
  year={2021}
}

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Related tags

Overview

COTR: Correspondence Transformer for Matching Across Images

Demos

1. Install environment

2. Download the pretrained weights

3. Single image pair demo

4. Facial landmarks demo

5. Homography demo

Citation

Owner

UBC Computer Vision Group

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

NeuroGen: activation optimized image synthesis for discovery neuroscience

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.

Setup freqtrade/freqUI on Heroku

On-device speech-to-index engine powered by deep learning.

RefineGNN - Iterative refinement graph neural network for antibody sequence-structure co-design (RefineGNN)

Stochastic gradient descent with model building

A collection of papers about Transformer in the field of medical image analysis.

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Paper Code：A Self-adaptive Weighted Differential Evolution Approach for Large-scale Feature Selection

ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

Fast sparse deep learning on CPUs

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

This program was designed to detect whether someone is wearing a facemask through a live video stream.

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

Code for layerwise detection of linguistic anomaly paper (ACL 2021)