Learning Calibrated-Guidance for Object Detection in Aerial Images

Last update: Sep 22, 2022

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

We propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity-pairs. Specifically, given a set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance. Our CG can be plugged into any deep neural network, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented and horizontal object detection tasks of aerial images. Results on two challenging benchmarks (i.e., DOTA and HRSC2016) demonstrate that our CG-Net can achieve state-of-the-art performance in accuracy with a fair computational overhead.

Introduction

This codebase is created to build benchmarks for object detection in aerial images. It is modified from mmdetection. The master branch works with PyTorch 1.1 or higher. If you would like to use PyTorch 0.4.1, please checkout to the pytorch-0.4.1 branch.

Results

Visualization results for oriented object detection on the test set of DOTA.

Comparison to the baseline on DOTA for oriented object detection with ResNet-101. The figures with blue boxes are the results of the baseline and pink boxes are the results of our proposed CG-Net.

Experiment

ImageNet Pretrained Model from Pytorch

The effectiveness of our proposed methods with different backbone network on the test of DOTA.

Backbone	+CG	Weight	mAP(%)
ResNet-50		download	73.26
ResNet-50	+	download	74.21
ResNet-101		download	73.06
ResNet-101	+	download	74.30
ResNet-152		download	72.78
ResNet-152	+	download	73.53

CG-Net Results in DOTA.

Backbone	Aug Rotate	Task	Weight	mAP(%)
ResNet-101	+	Oriented	download	77.89
ResNet-101	+	Horizontal	download	78.26

Installation

Please refer to INSTALL.md for installation.

Get Started

Please see GETTING_STARTED.md for the basic usage of mmdetection.

Contributing

We appreciate all contributions to improve benchmarks for object detection in aerial images.

Citing

If you use our work, please consider citing:

@InProceedings{liang2021learning,
      title={Learning Calibrated-Guidance for Object Detection in Aerial Images}, 
      author={Dong, Liang and Zongqi, Wei and Dong, Zhang and Qixiang, Geng and Liyan, Zhang and Han, Sun and Huiyu, Zhou and Mingqiang, Wei and Pan, Gao},
      booktitle ={arXiv:2103.11399},
      year={2021}
}

Thanks to the Third Party Libs

Pytorch

mmdetection

AerialDetection

Learning Calibrated-Guidance for Object Detection in Aerial Images

Related tags

Overview

Learning Calibrated-Guidance for Object Detection in Aerial Images arxiv

Introduction

Results

Experiment

Installation

Get Started

Contributing

Citing

Thanks to the Third Party Libs

Owner

Kaggle Ultrasound Nerve Segmentation competition [Keras]

Introducing neural networks to predict stock prices

Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Jingju baseline - A baseline model of our project of Beijing opera script generation

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

ChatBot-Pytorch - A GPT-2 ChatBot implemented using Pytorch and Huggingface-transformers

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations)

Automate issue discovery for your projects against Lightning nightly and releases.

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

Equipped customers with insights about their EVs Hourly energy consumption and helped predict future charging behavior using LSTM model

Revisiting Self-Training for Few-Shot Learning of Language Model.

Artificial Intelligence search algorithm base on Pacman

🌊 Online machine learning in Python

Code for Transformer Hawkes Process, ICML 2020.

FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

The Pytorch code of "Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification", CVPR 2022 (Oral).

[NeurIPS 2021] Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Implementation of the bachelor's thesis "Real-time stock predictions with deep learning and news scraping".

A booklet on machine learning systems design with exercises