The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Last update: Jan 08, 2023

Related tags

Deep Learning SiamTPNTracker

Overview

SiamTPN

Introduction

This is the official implementation of the SiamTPN (WACV2022). The tracker intergrates pyramid feature network and transformer into Siamese network, achieving state-of-the-art performance (better than DiMP) while runing 30 FPS on a single CPU. The tracker optimized with ONXX and openvino could run at 45 FPS on cpu end, leading promising performance when deploying on drones for tracking.

[Paper] [Raw Results] [Drone Tracking Videos] [Models]

Training

prepare data

change the path in lib/train/admin/local.py to your data location

# Distributed training withh 4 nodes 
python -m torch.distributed.launch --nproc_per_node 4 tools/run_training.py --config shufflenet_l345_192

# single gpu training for test purpose
python tools/run_training.py --config shufflenet_l345_192

Test and evaluate SiamTPN

prepare data

change the path in lib/test/evaluation/local.py to your data location

running on cpu

# Download the pretrain model and put it under ./results/checkpoints/train/SiamTPN/ folder

python tools/test.py siamtpn shufflenet_l345_192 --dataset_name got10k_val --debug 1 --cpu 1 --epoch 100 --sequence GOT-10k_Val_000001

running on cpu with onnx optimized

The debug mode will show tracking results, more details refer to tools/test.py

Currently, onnx only support cpu version

First, you need to install onxx and onxxruningtime:

pip install onxx
# for onxx runining time, download the openvino version from release [page](https://github.com/intel/onnxruntime/releases/tag/v3.1) and install with
pip install onnxruntime_openvino-1.9.0-cp37-cp37m-linux_x86_64.whl

# please refer the [page](https://github.com/intel/onnxruntime/releases/tag/v3.1) for openvino installation details.

# Download the converted onnx model and put it under ./results/onnx/ folder
# or conver your own model with 
python tools/onnx_search.py
python tools/onnx_template.py

python tools/test.py siamtpn_onnx shufflenet_l345_192 --dataset_name got10k_val --debug 1 --cpu 1 --epoch 100 --sequence GOT-10k_Val_000001

Citation

Acknowledge

Our code is implemented based on the following libraries:

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Related tags

Overview

SiamTPN

Introduction

Training

prepare data

Test and evaluate SiamTPN

prepare data

running on cpu

running on cpu with onnx optimized

Citation

Acknowledge

Owner

Robotics and Intelligent Systems Control @ NYUAD

Official implementation of NLOS-OT: Passive Non-Line-of-Sight Imaging Using Optimal Transport (IEEE TIP, accepted)

Creating Multi Task Models With Keras

SCU OlympicsRunning Baseline

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

HIVE: Evaluating the Human Interpretability of Visual Explanations

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars, CVPR 2022.

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

LAnguage Model Analysis

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

BRNet - code for Automated assessment of BI-RADS categories for ultrasound images using multi-scale neural networks with an order-constrained loss function

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Learning embeddings for classification, retrieval and ranking.