Anchor-free Oriented Proposal Generator for Object Detection

Gong Cheng, Jiabao Wang, Ke Li, Xingxing Xie, Chunbo Lang, Yanqing Yao, Junwei Han,

Introudction

Oriented object detection is a practical and challenging task in remote sensing image interpretation. Nowadays, oriented detectors mostly use horizontal boxes as intermedium to derive oriented boxes from them. However, the horizontal boxes are inclined to get a small Intersection-over-Unions (IoUs) with ground truths, which may have some undesirable effects, such as introducing redundant noise, mismatching with ground truths, detracting from the robustness of detectors, etc. In this paper, we propose a novel Anchor-free Oriented Proposal Generator (AOPG) that abandons the horizontal boxes-related operations from the network architecture. AOPG first produces coarse oriented boxes by Coarse Location Module (CLM) in an anchor-free manner and then refines them into high-quality oriented proposals. After AOPG, we apply a Fast R-CNN head to produce the final detection results. Furthermore, the shortage of large-scale datasets is also a hindrance to the development of oriented object detection. To alleviate the data insufficiency, we release a new dataset on the basis of our DIOR dataset and name it DIOR-R. Massive experiments demonstrate the effectiveness of AOPG. Particularly, without bells and whistles, we achieve the highest accuracy of 64.41%, 75.24% and 96.22% mAP on the DIOR-R, DOTA and HRSC2016 datasets respectively.

Benchmark and model zoo

Model	Backbone	Dataset	ms	rr	Lr schd	mAP	Google	Baidu Yun
AOPG	R50-FPN	DIOR-R	-	-	1x	64.41	-	-
AOPG	R50-FPN	DOTA1.0	-	-	1x	75.24	-	-
AOPG	R101-FPN	DOTA1.0	-	-	1x	75.39	-	-
AOPG	R50-FPN	DOTA1.0	√	√	1x	80.66	-	-
AOPG	R101-FPN	DOTA1.0	√	√	1x	80.19	-	-
AOPG	R50-FPN	HRSC2016	-	-	3x	96.22	-	-

You can download DIOR-R dataset at https://gcheng-nwpu.github.io/.

Installation

Please refer to install.md for installation and dataset preparation.

Get Started

Please refer to oriented_model_starting.md for training and testing.

Citation

This repo is based on OBBDetection.

If you use this repo in your research, please cite the following information.

@misc{cheng2021,
  title={Anchor-free Oriented Proposal Generator for Object Detection}, 
  author={Gong Cheng and Jiabao Wang and Ke Li and Xingxing Xie and Chunbo Lang and Yanqing Yao and Junwei Han},
  year={2021},
  eprint={2110.01931},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@article{RN37,
   author = {Li, Ke and Wan, Gang and Cheng, Gong and Meng, Liqiu and Han, Junwei},
   title = {Object detection in optical remote sensing images: A survey and a new benchmark},
   journal = {ISPRS Journal of Photogrammetry and Remote Sensing},
   volume = {159},
   pages = {296-307},
   ISSN = {0924-2716},
   DOI = {10.1016/j.isprsjprs.2019.11.023},
   year = {2020},
   type = {Journal Article}
}

Anchor-free Oriented Proposal Generator for Object Detection

Related tags

Overview

Anchor-free Oriented Proposal Generator for Object Detection

Introudction

Benchmark and model zoo

Installation

Get Started

Citation

Owner

jbwang1997

🐦 Opytimizer is a Python library consisting of meta-heuristic optimization techniques.

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

Pytorch codes for "Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation"

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Implementation of CVPR 2021 paper "Spatially-invariant Style-codes Controlled Makeup Transfer"

Unofficial implementation of Fast-SCNN: Fast Semantic Segmentation Network

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

CyTran: Cycle-Consistent Transformers for Non-Contrast to Contrast CT Translation

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

The-Secret-Sharing-Schemes - This interactive script demonstrates the Secret Sharing Schemes algorithm

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

TensorFlow for Raspberry Pi

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Pixray is an image generation system