CAT-Net: Learning Canonical Appearance Transformations

Code to accompany our paper "How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change".

Dependencies

numpy
matpotlib
pytorch + torchvision (1.2)
Pillow
progress (for progress bars in train/val/test loops)
tensorboard + tensorboardX (for visualization)
pyslam + liegroups (optional, for running odometry/localization experiments)
OpenCV (optional, for running odometry/localization experiments)

Training the CAT

Download the ETHL dataset from here or the Virtual KITTI dataset from here
1. ETHL only: rename ethl1/2 to ethl1/2_static.
2. ETHL only: Update the local paths in tools/make_ethl_real_sync.py and run python3 tools/make_ethl_real_sync.py to generate a synchronized copy of the real sequences.
Update the local paths in run_cat_ethl/vkitti.py and run python3 run_cat_ethl/vkitti.py to start training.
In another terminal run tensorboard --port [port] --logdir [path] to start the visualization server, where [port] should be replaced by a numeric value (e.g., 60006) and [path] should be replaced by your local results directory.
Tune in to localhost:[port] and watch the action.

Running the localization experiments

Ensure the pyslam and liegroups packages are installed.
Update the local paths in make_localization_data.py and run python3 make_localization_data.py [dataset] to compile the model outputs into a localization_data directory.
Update the local paths in run_localization_[dataset].py and run python3 run_localization_[dataset].py [rgb,cat] to compute VO and localization results using either the original RGB or CAT-transformed images.
You can compute localization errors against ground truth using the compute_localization_errors.py script, which generates CSV files and several plots. Update the local paths and run python3 compute_localization_errors.py [dataset].

Citation

If you use this code in your research, please cite:

@article{2018_Clement_Learning,
  author = {Lee Clement and Jonathan Kelly},
  journal = {{IEEE} Robotics and Automation Letters},
  link = {https://arxiv.org/abs/1709.03009},
  title = {How to Train a {CAT}: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change},
  year = {2018}
}

Canonical Appearance Transformations

Related tags

Overview

CAT-Net: Learning Canonical Appearance Transformations

Dependencies

Training the CAT

Running the localization experiments

Citation

Owner

STARS Laboratory

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"

Pytorch Lightning Distributed Accelerators using Ray

TilinGNN: Learning to Tile with Self-Supervised Graph Neural Network (SIGGRAPH 2020)

deep-prae

The code of NeurIPS 2021 paper "Scalable Rule-Based Representation Learning for Interpretable Classification".

Implementation of light baking system for ray tracing based on Activision's UberBake

Python project to take sound as input and output as RGB + Brightness values suitable for DMX

Unsupervised Pre-training for Person Re-identification (LUPerson)

A powerful framework for decentralized federated learning with user-defined communication topology

iBOT: Image BERT Pre-Training with Online Tokenizer

Deep Learning and Logical Reasoning from Data and Knowledge

Supporting code for short YouTube series Neural Networks Demystified.

This repository introduces a short project about Transfer Learning for Classification of MRI Images.

Face detection using deep learning.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

[ACM MM 2019 Oral] Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

Real-time Neural Representation Fusion for Robust Volumetric Mapping

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

A framework for joint super-resolution and image synthesis, without requiring real training data