The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Last update: Dec 04, 2022

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation. The code is written in PyTorch, very simple to understand.

There is also a Caffe implementation, please check it if you use Caffe and Matlab.

Requirement:

Python 3
Pytorch 0.4.0

How to run:

open the ipython notebooks with jupyter notebook

then open vgg_fr.ipynb or vgg_fsp.ipynb, these are the two main files for demonstrate feedback idea.

How it looks:

If you run vgg_fsp.ipynb without modification of code, you are supposed to see below visualization:

Input image:

Image gradient with respect to the target label:

Image gradient with respect to the target label after 4 iterations of feedback selective pruning (FSP):

Files explanation:

vgg_fr.ipynb: the main file that defines the vgg feedback network with the feedback recovering mechanism and run a feedback visualization on examplar images.
vgg_fsp.ipynb: the main file that defines the vgg feedback network with the feedback selective pruning mechanism and run a feedback visualization on examplar images.
images: storing exmaplar images
imagenet1000_clsid_to_human.txt: storing image net 1000 class names, for visualization and understanding purpose
test/simple_test.ipynb: unit test for a simple feedback network, using a simple fully connected structure
test/vgg_test.ipynb: unit test for the loading of a pretrained vgg network, then check the weights copying from pretrained network to a new defined network interface

Citation

Please consider citing in your publications if it helps your research:

@inproceedings{cao2015look,
  title={Look and think twice: Capturing top-down visual attention with feedback convolutional neural networks},
  author={Cao, Chunshui and Liu, Xianming and Yang, Yi and Yu, Yinan and Wang, Jiang and Wang, Zilei and Huang, Yongzhen and Wang, Liang and Huang, Chang and Xu, Wei and others},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={2956--2964},
  year={2015}
}

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

Requirement:

How to run:

How it looks:

Files explanation:

Citation

Owner

VGGFace2-HQ - A high resolution face dataset for face editing purpose

The official implementation of EIGNN: Efficient Infinite-Depth Graph Neural Networks (NeurIPS 2021)

Code for the paper Open Sesame: Getting Inside BERT's Linguistic Knowledge.

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

Distributing reference energies for SMIRNOFF implementations

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

3D ResNets for Action Recognition (CVPR 2018)

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Efficient neural networks for analog audio effect modeling

9th place solution

Multiple custom object count and detection using YOLOv3-Tiny method

This package implements THOR: Transformer with Stochastic Experts.

An implementation of a discriminant function over a normal distribution to help classify datasets.

Out-of-boundary View Synthesis towards Full-frame Video Stabilization

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records