Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Last update: Nov 28, 2022

Overview

DAL

This project hosts the official implementation for our AAAI 2021 paper:

Dynamic Anchor Learning for Arbitrary-Oriented Object Detection [arxiv] [comments].

Abstract

In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly deﬁned matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efﬁcient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classiﬁcation and regression will be alleviated.

Getting Started

The codes build Rotated RetinaNet with the proposed DAL method for rotation object detection. The supported datasets include: DOTA, HRSC2016, ICDAR2013, ICDAR2015, UCAS-AOD, NWPU VHR-10, VOC.

Installation

Insatll requirements:

pip install -r requirements.txt
pip install git+git://github.com/lehduong/torch-warmup-lr.git

Build the Cython and CUDA modules:

cd $ROOT/utils
sh make.sh
cd $ROOT/utils/overlaps_cuda
python setup.py build_ext --inplace

Installation for DOTA_devkit:

cd $ROOT/datasets/DOTA_devkit
sudo apt-get install swig
swig -c++ -python polyiou.i
python setup.py build_ext --inplace

Inference

You can use the following command to test a dataset. Note that weight, img_dir, dataset,hyp should be modified as appropriate.

python demo.py

Train

Move the dataset to the $ROOT directory.
Generate imageset files for daatset division via:

cd $ROOT/datasets
python generate_imageset.py

Modify the configuration file hyp.py and arguments in train.py, then start training:

python train.py

Evaluation

Different datasets use different test methods. For UCAS-AOD/HRSC2016/VOC/NWPU VHR-10, you need to prepare labels in the appropriate format in advance. Take evaluation on HRSC2016 for example:

cd $ROOT/datasets/evaluate
python hrsc2gt.py

then you can conduct evaluation:

python eval.py

Note that :

the script needs to be executed only once, but testing on different datasets needs to be executed again.
the imageset file used in hrsc2gt.py is generated from generate_imageset.py.

Main Results

Method	Dataset	Bbox	Backbone	Input Size	mAP/F1
DAL	DOTA	OBB	ResNet-101	800 x 800	71.78
DAL	UCAS-AOD	OBB	ResNet-101	800 x 800	89.87
DAL	HRSC2016	OBB	ResNet-50	416 x 416	88.60
DAL	ICDAR2015	OBB	ResNet-101	800 x 800	82.4
DAL	ICDAR2013	HBB	ResNet-101	800 x 800	81.3
DAL	NWPU VHR-10	HBB	ResNet-101	800 x 800	88.3
DAL	VOC 2007	HBB	ResNet-101	800 x 800	76.1

Detections

Citation

If you find our work or code useful in your research, please consider citing:

@article{ming2020dynamic,
  title={Dynamic Anchor Learning for Arbitrary-Oriented Object Detection},
  author={Ming, Qi and Zhou, Zhiqiang and Miao, Lingjuan and Zhang, Hongwei and Li, Linhao},
  journal={arXiv preprint arXiv:2012.04150},
  year={2020}
}

If you have any questions, please contact me via issue or email.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Related tags

Overview

DAL

Abstract

Getting Started

Installation

Inference

Train

Evaluation

Main Results

Detections

Citation

Owner

ming71

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

WatermarkRemoval-WDNet-WACV2021

An SMPC companion library for Syft

Post-Training Quantization for Vision transformers.

This repository contains code released by Google Research.

So-ViT: Mind Visual Tokens for Vision Transformer

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Code for the TIP 2021 Paper "Salient Object Detection with Purificatory Mechanism and Structural Similarity Loss"

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Code for Active Learning at The ImageNet Scale.

This repository contains implementations of all Machine Learning Algorithms from scratch in Python. Mathematics required for ML and many projects have also been included.

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation

Prediction of MBA refinance Index (Mortgage prepayment)

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

Official Implementation (PyTorch) of "Point Cloud Augmentation with Weighted Local Transformations", ICCV 2021

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

OpenL3: Open-source deep audio and image embeddings