You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Last update: Jan 03, 2023

Related tags

Deep Learning YOLOF

Overview

You Only Look One-level Feature (YOLOF), CVPR2021

A simple, fast, and efficient object detector without FPN.

This repo provides a neat implementation for YOLOF based on Detectron2. A cvpods version can be found in https://github.com/megvii-model/YOLOF.

You Only Look One-level Feature,
Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun

Getting Started

Our project is developed on detectron2. Please follow the official detectron2 installation.

Install mish-cuda to speed up the training and inference when using CSPDarkNet-53 as the backbone (optional)

git clone https://github.com/thomasbrandon/mish-cuda
cd mish-cuda
python setup.py build install
cd ..

Install YOLOF by:
```
python setup.py develop
```
Then link your dataset path to datasets
```
cd datasets/
ln -s /path/to/coco coco
```
Download the pretrained model in OneDrive or in the Baidu Cloud with code qr6o to train with the CSPDarkNet-53 backbone (optional)
```
mkdir pretrained_models
# download the `cspdarknet53.pth` to the `pretrained_models` directory
```

Train with yolof

python ./tools/train_net.py --num-gpus 8 --config-file ./configs/yolof_R_50_C5_1x.yaml

Test with yolof

python ./tools/train_net.py --num-gpus 8 --config-file ./configs/yolof_R_50_C5_1x.yaml --eval-only MODEL.WEIGHTS /path/to/checkpoint_file

Note that there might be API changes in future detectron2 releases that make the code incompatible.

Main results

The models listed below can be found in this onedrive link or in the BaiduCloud link with code qr6o. The FPS is tested on a 2080Ti GPU. More models will be available in the near future.

Model	COCO val mAP	FPS
YOLOF_R_50_C5_1x	37.7	36
YOLOF_R_50_DC5_1x	39.2	23
YOLOF_R_101_C5_1x	39.8	23
YOLOF_R_101_DC5_1x	40.5	17
YOLOF_CSP_D_53_DC5_3x	41.2	41

Note that, the speed reported in this repo is 2~3 FPS faster than the one reported in the cvpods version.

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@inproceedings{chen2021you,
  title={You Only Look One-level Feature},
  author={Chen, Qiang and Wang, Yingming and Yang, Tong and Zhang, Xiangyu and Cheng, Jian and Sun, Jian},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Related tags

Overview

You Only Look One-level Feature (YOLOF), CVPR2021

Getting Started

Main results

Citation

Owner

qiang chen

An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

Image-to-image regression with uncertainty quantification in PyTorch

Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

ColBERT: Contextualized Late Interaction over BERT (SIGIR'20)

Implementation for NeurIPS 2021 Submission: SparseFed

Focal Loss for Dense Rotation Object Detection

This code is a toolbox that uses Torch library for training and evaluating the ERFNet architecture for semantic segmentation.

A PyTorch implementation of "Signed Graph Convolutional Network" (ICDM 2018).

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

Pytorch Lightning code guideline for conferences

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

Create Own QR code with Python

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

PyTorch implementation of VAGAN: Visual Feature Attribution Using Wasserstein GANs

Train Dense Passage Retriever (DPR) with a single GPU

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

Unofficial implementation of Point-Unet: A Context-Aware Point-Based Neural Network for Volumetric Segmentation