training script for space time memory network

Last update: Dec 20, 2022

Related tags

Deep Learning STM-Training

Overview

Trainig Script for Space Time Memory Network

This codebase implemented training code for Space Time Memory Network with some cyclic features.

Requirement

python package

torch
python-opencv
pillow
yaml
imgaug
yacs
progress
nvidia-dali (optional)

GPU support

GPU Memory >= 12GB
CUDA >= 10.0

Data

See the doc DATASET.md for more details on data organization of our prepared dataset.

Release

We provide pre-trained model with different backbone in our codebase, results are validated on DAVIS17-val with gradient correction.

model	backbone	data backend	J	F	J & F	link	FPS
STM-Cycle	Resnet18	DALI	65.3	70.8	68.1	Google Drive	14.8
STM-Cycle	Resnet50	PIL	70.5	76.3	73.4	Google Drive	9.3

Runing

Appending the root folder to the search path of python interpreter

export PYTHONPATH=${PYTHONPATH}:./

To train the STM network, run following command.

python3 train.py --cfg config.yaml OPTION_KEY OPTION_VAL

To test the STM network, run following command

python3 test.py --cfg config.yaml initial ${PATH_TO_MODEL} OPTION_KEY OPTION_VAL

The test results will be saved as indexed png file at ${ROOT}/${output_dir}/${valset}.

To run a segmentation demo, run following command

python3 demo/demo.py --cfg demo/demo.yaml OPTION_KEY OPTION_VAL

The segmentation results will be saved at ${output_dir}.

Acknowledgement

This codebase borrows the code and structure from official STM repository

Reference

The codebase is built based on following works

@InProceedings{Oh_2019_ICCV,
author = {Oh, Seoung Wug and Lee, Joon-Young and Xu, Ning and Kim, Seon Joo},
title = {Video Object Segmentation Using Space-Time Memory Networks},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2019}
}

@InProceedings{Li_2020_NeurIPS,
author = {Li, Yuxi and Xu, Ning and Peng Jinlong and John See and Lin Weiyao},
title = {Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation},
booktitle = {Neural Information Processing System (NeurIPS)},
year = {2020}
}

training script for space time memory network

Related tags

Overview

Trainig Script for Space Time Memory Network

Requirement

python package

GPU support

Data

Release

Runing

Acknowledgement

Reference

Owner

Yuxi Li

Geometry-Free View Synthesis: Transformers and no 3D Priors

Algorithmic encoding of protected characteristics and its implications on disparities across subgroups

Official Implementation for Fast Training of Neural Lumigraph Representations using Meta Learning.

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

Microscopy Image Cytometry Toolkit

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

RefineGNN - Iterative refinement graph neural network for antibody sequence-structure co-design (RefineGNN)

Send text to girlfriend in the morning

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

A computer vision pipeline to identify the "icons" in Christian paintings

LaneDet is an open source lane detection toolbox based on PyTorch that aims to pull together a wide variety of state-of-the-art lane detection models

[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

10th place solution for Google Smartphone Decimeter Challenge at kaggle.

Rank 3 : Source code for OPPO 6G Data Generation Challenge

Convert weight file.pth to weight file.blob

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Unsupervised Pre-training for Person Re-identification (LUPerson)

This repository is maintained for the scientific paper tittled " Study of keyword extraction techniques for Electric Double Layer Capacitor domain using text similarity indexes: An experimental analysis "

Robbing the FED: Directly Obtaining Private Data in Federated Learning with Modified Models

A dual benchmarking study of visual forgery and visual forensics techniques