Python and C++ implementation of "MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation". Accepted at LXCV @ CVPR 2021.

Last update: Nov 18, 2022

Related tags

Deep Learning MarkerPose

Overview

MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation

This is a PyTorch and LibTorch implementation of MarkerPose: a robust, real-time pose estimation method based on a planar marker of three circles and a calibrated stereo vision system for high-accuracy pose estimation.

MarkerPose method consists of three stages. In the first stage, marker points in a pixel-level accuracy, and their IDs are estimated with a SuperPoint-like network for both views. In the second stage, three square patches that contain each ellipse of the target are extracted centered in the rough 2D locations previously estimated. With EllipSegNet the contour of the ellipses is segmented for sub-pixel-level centroid estimation for the first and second view. Finally, in the last stage, with the sub-pixel matches of both views, triangulation is applied for 3D pose estimation. For more details see our paper.

Pose estimation example

To run the Python or C++ pose estimation examples, you need first to clone this repository and download the dataset. This dataset contains the stereo calibration parameters, stereo images, and pretrained weights for SuperPoint and EllipSegNet.

Clone this repo: git clone https://github.com/jhacsonmeza/MarkerPose
Download the dataset here.
Move the dataset/ folder to the cloned repo folder: mv path/to/dataset/ MarkerPose/.

The folder structure into MarkerPose/ directory should be:

MarkerPose
    ├── C++
    ├── dataset
    ├── figures
    └── Python

To know how to run the pose estimation examples, see the Python/ folder for the PyTorch version, and the C++/ folder the LibTorch version. Furthermore, the code for training SuperPoint and EllipSegNet is also available in both versions.

Citation

If you find this code useful, please consider citing:

@inproceedings{meza2021markerpose,
  title={MarkerPose: Robust Real-time Planar Target Tracking for Accurate Stereo Pose Estimation},
  author={Meza, Jhacson and Romero, Lenny A and Marrugo, Andres G},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year={2021}
}

Python and C++ implementation of "MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation". Accepted at LXCV @ CVPR 2021.

Related tags

Overview

MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation

Pose estimation example

Citation

Owner

Jhacson Meza

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

3D detection and tracking viewer (visualization) for kitti & waymo dataset

Source code of all the projects of Udacity Self-Driving Car Engineer Nanodegree.

A PyTorch Toolbox for Face Recognition

Source code and data from the RecSys 2020 article "Carousel Personalization in Music Streaming Apps with Contextual Bandits" by W. Bendada, G. Salha and T. Bontempelli

Keras like implementation of Deep Learning architectures from scratch using numpy.

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

This repository is all about spending some time the with the original problem posed by Minsky and Papert

An end-to-end image translation model with weight-map for color constancy

DLWP: Deep Learning Weather Prediction

[Machine Learning Engineer Basic Guide] 부스트캠프 AI Tech - Product Serving 자료

Speech-Emotion-Analyzer - The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

using yolox+deepsort for object-tracker

Fast and customizable reconnaissance workflow tool based on simple YAML based DSL.

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

Official respository for "Modeling Defocus-Disparity in Dual-Pixel Sensors", ICCP 2020

Official TensorFlow code for the forthcoming paper

An updated version of virtual model making

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.