Multi-Glimpse Network With Python

Last update: May 10, 2022

Related tags

Deep Learning MGNet

Overview

Multi-Glimpse Network

Our code requires Python ≥ 3.8

Installation

For example, venv + pip:

$ python3 -m venv env
$ source env/bin/activate
(env) $ python3 -m pip install -r requirements.txt

Evaluation

Accuracy on clean images

Create ImageNet100 from ImageNet (using symbolic links).

$ python3 tools/create_imagenet100.py tools/imagenet100.txt \
    /path/to/ImageNet /path/to/ImageNet100

Download checkpoints from Google Drive.
Test accuracy.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Add the flag --flop_count to count the approximate FLOPs for the inference of an image. (using fvcore)

Accuracy on adversarial attacks (PGD)

Test adversarial accuracy.

# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Accuracy on common corruptions

Create ImageNet100-C from ImageNet-C (using symbolic links).

$ python3 tools/create_imagenet100c.py  \
    tools/imagenet100.txt  /path/to/ImageNet-C/ /path/to/ImageNet100-C/

Test for a single corruption.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100-C/pixelate/5 \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

A simple script to test all corruptions and collect results.

# Modify tools/eval_imagenet100c.py and run it to generate script
$ python3 tools/eval_imagenet100c.py /home2/ImageNet100-C/ > run.sh
# Evaluate
$ bash run.sh
# Collect results
$ python3 tools/collect_imagenet100c.py

Training

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --epochs 400 --n_iter 1 --scale 1.0 \
    --model resnet18 --gpu 0,1,2,3
# Ours
$ python3 main.py $dataset --epochs 400 --n_iter 4 --scale 2.33 \
    --model resnet18 --alpha 0.6 --s 0.02  --gpu 0,1,2,3

Check tensorboard for the logs. (When training with multiple gpus, the log value may be scaled by the number of gpus except for the validation accuracy)

tensorboard  --logdir=logs

Note that we left our exploration in the code for further study, e.g., self-supervised spatial guidance, dynamic gradient re-scaling operation.

Multi-Glimpse Network With Python

Related tags

Overview

Multi-Glimpse Network

Installation

Evaluation

Accuracy on clean images

Accuracy on adversarial attacks (PGD)

Accuracy on common corruptions

Training

Owner

Affine / perspective transformation in Pose Estimation with Tensorflow 2

PyTorch implementation of ENet

FasterAI: A library to make smaller and faster models with FastAI.

TensorFlow implementation of Style Transfer Generative Adversarial Networks: Learning to Play Chess Differently.

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Display, filter and search log messages in your terminal

Pre-Training 3D Point Cloud Transformers with Masked Point Modeling

SpineAI Bilsky Grading With Python

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python

Where2Act: From Pixels to Actions for Articulated 3D Objects

Implementation of Vaswani, Ashish, et al. "Attention is all you need."

A modular application for performing anomaly detection in networks

Official Repsoitory for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

A package, and script, to perform imaging transcriptomics on a neuroimaging scan.

Image Captioning using CNN ,LSTM and Attention

AquaTimer - Programmable Timer for Aquariums based on ATtiny414/814/1614

3D-Transformer: Molecular Representation with Transformer in 3D Space

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

An All-MLP solution for Vision, from Google AI