Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation"

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.

Author Disambiguation using Knowledge Graph Embeddings with Literals

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

Set of models for classifcation of 3D volumes

AI virtual gym is an AI program which can be used to exercise and can be used to see if we are doing the exercises

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

Advanced yabai wooting scripts

Open CV - Convert a picture to look like a cartoon sketch in python

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Mask-invariant Face Recognition through Template-level Knowledge Distillation

A fast model to compute optical flow between two input images.

Unofficial pytorch-lightning implement of Mip-NeRF

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Single Image Deraining Using Bilateral Recurrent Network (TIP 2020)

Omnidirectional camera calibration in python

A Kaggle competition: discriminate gender based on handwriting