Multispectral Object Detection with Yolov5

Last update: Jan 01, 2023

Overview

Multispectral-Object-Detection

Intro

Official Code for Cross-Modality Fusion Transformer for Multispectral Object Detection.

Multispectral Object Detection with Transformer and Yolov5

Citation

If you use this repo for your research, please cite our paper:

@article{fang2021cross,
  title={Cross-Modality Fusion Transformer for Multispectral Object Detection},
  author={Fang Qingyun and Han Dapeng and Wang Zhaokui},
  journal={arXiv preprint arXiv:2111.00273},
  year={2021}
}

Installation

Python>=3.6.0 is required with all requirements.txt installed including PyTorch>=1.7 (The same as yolov5 https://github.com/ultralytics/yolov5 ).

Clone the repo

git clone https://github.com/DocF/multispectral-object-detection

Install requirements

$ cd  multispectral-object-detection
$ pip install -r requirements.txt

Dataset

-[FLIR] download A new aligned version.

-[LLVIP] download

-[VEDAI] download

Run

Download the pretrained weights

yolov5 weights:

CFT weights:

Add the some file

create runs/train, runs/test and runs/detect three files for save the results.

Change the data cfg

some example in data/multispectral/

Train Test and Detect

train: python train.py

test: python test.py

detect: python detect_twostream.py

Results

Dataset	CFT	mAP50	mAP75	mAP
FLIR		73.0	32.0	37.4
FLIR	✔️	77.7 (Δ4.7)	34.8 (Δ2.8)	40.0 (Δ2.6)
LLVIP		95.8	71.4	62.3
LLVIP	✔️	97.5 (Δ1.7)	72.9 (Δ1.5)	63.6 (Δ1.3)
VEDAI		79.7	47.7	46.8
VEDAI	✔️	85.3 (Δ5.6)	65.9(Δ18.2)	56.0 (Δ9.2)

Multispectral Object Detection with Yolov5

Related tags

Overview

Multispectral-Object-Detection

Intro

Citation

Installation

Clone the repo

Install requirements

Dataset

Run

Download the pretrained weights

Add the some file

Change the data cfg

Train Test and Detect

Results

Owner

Richard Fang

A naive ROS interface for visualDet3D.

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

QuALITY: Question Answering with Long Input Texts, Yes!

Tracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)

CyTran: Cycle-Consistent Transformers for Non-Contrast to Contrast CT Translation

On Evaluation Metrics for Graph Generative Models

Experiments for Neural Flows paper

Medical image analysis framework merging ANTsPy and deep learning

A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking

[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427

[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver

Keras like implementation of Deep Learning architectures from scratch using numpy.

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

Code for ViTAS_Vision Transformer Architecture Search

Code artifacts for the submission "Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems"

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

Python codes for Lite Audio-Visual Speech Enhancement.

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

SemiNAS: Semi-Supervised Neural Architecture Search