An official implementation of the Anchor DETR.

Last update: Dec 28, 2022

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

This repository is an official implementation of the Anchor DETR. We encode the anchor points as the object queries in DETR. Multiple patterns are attached to each anchor point to solve the difficulty: "one region, multiple objects". We also propose an attention variant RCDA to reduce the memory cost for high-resolution features.

Main Results

	feature	epochs	AP	GFLOPs	Infer Speed (FPS)
DETR	DC5	500	43.3	187	10 (12)
SMCA	multi-level	50	43.7	152	10
Deformable DETR	multi-level	50	43.8	173	15
Conditional DETR	DC5	50	43.8	195	10
Anchor DETR	DC5	50	44.3	151	16 (19)

Note:

The results are based on ResNet-50 backbone.
Inference speeds are measured on NVIDIA Tesla V100 GPU.
The values in parentheses of the Infer Speed indicate the speed with torchscript optimization.

Model

name	backbone	AP	URL
AnchorDETR-C5	R50	42.1	model / log
AnchorDETR-DC5	R50	44.3	model / log
AnchorDETR-C5	R101	43.5	model / log
AnchorDETR-DC5	R101	45.1	model / log

Note: the models and logs are also available at Baidu Netdisk with code hh13.

Usage

Installation

First, clone the repository locally:

git clone https://github.com/megvii-research/AnchorDETR.git

Then, install dependencies:

pip install -r requirements.txt

Training

To train AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py  --coco_path /path/to/coco

Evaluation

To evaluate AnchorDETR on a single node with 8 GPUs:

python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

To evaluate AnchorDETR with a single GPU:

python main.py --eval --coco_path /path/to/coco --resume /path/to/checkpoint.pth

Citation

If you find this project useful for your research, please consider citing the paper.

@misc{wang2021anchor,
      title={Anchor DETR: Query Design for Transformer-Based Detector},
      author={Yingming Wang and Xiangyu Zhang and Tong Yang and Jian Sun},
      year={2021},
      eprint={2109.07107},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions, feel free to open an issue or contact us at [email protected].

An official implementation of the Anchor DETR.

Related tags

Overview

Anchor DETR: Query Design for Transformer-Based Detector

Introduction

Main Results

Model

Usage

Installation

Training

Evaluation

Citation

Contact

Owner

MEGVII Research

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

PyElecCL - Electron Monte Carlo Second Checks

Training data extraction on GPT-2

Confidence Propagation Cluster aims to replace NMS-based methods as a better box fusion framework in 2D/3D Object detection

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

This respository includes implementations on Manifoldron: Direct Space Partition via Manifold Discovery

Council-GAN - Implementation for our paper Breaking the Cycle - Colleagues are all you need (CVPR 2020)

A simple API wrapper for Discord interactions.

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

K-Nearest Neighbor in Pytorch

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

The object detection pipeline is based on Ultralytics YOLOv5

details on efforts to dump the Watermelon Games Paprium cart

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Edge-aware Guidance Fusion Network for RGB-Thermal Scene Parsing

Controlling Hill Climb Racing with Hand Tacking

DC3: A Learning Method for Optimization with Hard Constraints

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Least Square Calibration for Peer Reviews