Visualizing Yolov5's layers using GradCam

Last update: Jan 01, 2023

Overview

YOLO-V5 GRADCAM

I constantly desired to know to which part of an object the object-detection models pay more attention. So I searched for it, but I didn't find any for Yolov5. Here is my implementation of Grad-cam for YOLO-v5. To load the model I used the yolov5's main codes, and for computing GradCam I used the codes from the gradcam_plus_plus-pytorch repository. Please follow my GitHub account and star ⭐ the project if this functionality benefits your research or projects.

Installation

pip install -r requirements.txt

Infer

python main.py --model-path yolov5s.pt --img-path images/cat-dog.jpg --output-dir outputs

NOTE: If you don't have any weights and just want to test, don't change the model-path argument. The yolov5s model will be automatically downloaded thanks to the download function from yolov5.

NOTE: For more input arguments, check out the main.py or run the following command:

python main.py -h

Examples

Note

I checked the code, but I couldn't find an explanation for why the truck's heatmap does not show anything. Please inform me or create a pull request if you find the reason.

TO Do

Add GradCam++
Add ScoreCam
Add the functionality to the deep_utils library

References

Citation

Please cite yolov5-gradcam if it helps your research. You can use the following BibTeX entry:

@misc{deep_utils,
	title = {yolov5-gradcam},
	author = {Mohammadi Kazaj, Pooya},
	howpublished = {\url{github.com/pooya-mohammadi/yolov5-gradcam}},
	year = {2021}
}

Visualizing Yolov5's layers using GradCam

Related tags

Overview

YOLO-V5 GRADCAM

Installation

Infer

Examples

Note

TO Do

References

Citation

Owner

Pooya Mohammadi Kazaj

For visualizing the dair-v2x-i dataset

Inteligência artificial criada para realizar interação social com idosos.

공공장소에서 눈만 돌리면 CCTV가 보인다는 말이 과언이 아닐 정도로 CCTV가 우리 생활에 깊숙이 자리 잡았습니다.

Segmentation Training Pipeline

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

pytorch implementation of dftd2 & dftd3

Score refinement for confidence-based 3D multi-object tracking

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

Multi agent DDPG algorithm written in Python + Pytorch

Simple transformer model for CIFAR10

Codes for CVPR2021 paper "PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization"

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

Automatic packaging of the open-composite libs for OvGME

Learning to Identify Top Elo Ratings with A Dueling Bandits Approach

PyStan, a Python interface to Stan, a platform for statistical modeling. Documentation: https://pystan.readthedocs.io

Complete system for facial identity system. Include one-shot model, database operation, features visualization, monitoring

A TikTok-like recommender system for GitHub repositories based on Gorse

Implementation for Stankevičiūtė et al. "Conformal time-series forecasting", NeurIPS 2021.

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

A quick recipe to learn all about Transformers