TransCD: Scene Change Detection via Transformer-based Architecture

Last update: Dec 11, 2022

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Python 3.7.0  
Pytorch 1.6.0  
Visdom 0.1.8.9  
Torchvision 0.7.0

Datasets

CD2014 dataset
- paper: changedetection.net: A new change detection benchmark dataset
- paper: CDnet 2014: An Expanded Change Detection Benchmark Dataset
- dataset: http://changedetection.net/
VL-CMU-CD
- paper: Street-view change detection with deconvolutional networks
- dataset: https://ghsi.github.io/proj/RSS2016.html

Pretrained Model

Pretrained models for CDNet-2014 and VL-CMU-CD are available. You can download them from the following link.

CDNet-2014: [Baiduyun] the password is 78cp. [GoogleDrive].
- We uploaded six models trained on CDNet-2014 dataset, they are SViT_E1_D1_16, SViT_E1_D1_32, SViT_E4_D4_16, SViT_E4_D4_32, Res_SViT_E1_D1_16 and Res_SViT_E4_D4_16.
VL-CMU-CD: [Baiduyun] the password is ydzl. [GoogleDrive].
- We uploaded four models trained on VL-CMU-CD dataset, ther are SViT_E1_D1_16, SViT_E1_D1_32, Res_SViT_E1_D1_16 and Res_SViT_E1_D1_32.

Test

Before test, please download datasets and predtrained models. Copy pretrained models to folder './dataset_name/outputs/best_weights', and run the following command:

cd TransCD_ROOT
python test.py --net_cfg 
   
     --train_cfg

Use --save_changemap True to save predicted changemaps. For example:

python test.py --net_cfg SVit_E1_D1_32 --train_cfg CDNet_2014 --save_changemap True

Training

Before training, please download datasets and revise dataset path in configs.py to your path. CD TransCD_ROOT

python -m visdom.server
python train.py --net_cfg 
   
     --train_cfg

For example:

python -m visdom.server
python train.py --net_cfg Res_SViT_E1_D1_16 --train_cfg VL_CMU_CD

To display training processing, copy 'http://localhost:8097' to your browser.

Citing TransCD

If you use this repository or would like to refer the paper, please use the following BibTex entry.

@inproceddings{TransCD,
title={TransCD: Scene Change Detection via Transformer-based Architecture},
author={ZHIXUE WANG, YU ZHANG*, LIN LUO, NAN WANG},
journal={Optics Express},
yera={2021},
organization={The Optical Society},
}

Reference

-Akcay, Samet, Amir Atapour-Abarghouei, and Toby P. Breckon. "Ganomaly: Semi-supervised anomaly detection via adversarial training." Asian conference on computer vision. Springer, Cham, 2018.
-Chen, Jieneng, et al. "Transunet: Transformers make strong encoders for medical image segmentation." arXiv preprint arXiv:2102.04306 (2021).

TransCD: Scene Change Detection via Transformer-based Architecture

Related tags

Overview

TransCD: Scene Change Detection via Transformer-based Architecture

Requirements

Datasets

Pretrained Model

Test

Training

Citing TransCD

Reference

Owner

wangzhixue

OpenGAN: Open-Set Recognition via Open Data Generation

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

Implementation for the IJCAI2021 work "Beyond the Spectrum: Detecting Deepfakes via Re-synthesis"

PyTorch implementation for paper Neural Marching Cubes.

Style transfer between images was performed using the VGG19 model

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

The Adapter-Bot: All-In-One Controllable Conversational Model

Data Augmentation with Variational Autoencoders

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

Spatial-Location-Constraint-Prototype-Loss-for-Open-Set-Recognition

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-local Spatial-Temporal Similarity

Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?

网络协议2天集训

An atmospheric growth and evolution model based on the EVo degassing model and FastChem 2.0

CLIPImageClassifier wraps clip image model from transformers

A particular navigation route using satellite feed and can help in toll operations & traffic managemen