Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Last update: Dec 26, 2022

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Deploying deep convolutional neural networks on ultra-low power systems is challenging, because the systems put a hard limit on the size of on-chip memory. To overcome this drawback, we propose a novel Raster-Scanning Network, named RaScaNet, inspired by raster-scanning in image sensors.

RaScaNet reads only a few rows of pixels at a time using a convolutional neural network and then sequentially learns the representation of the whole image using a recurrent neural network. The proposed method requires 15.9-24.3x smaller peak memory and 5.3-12.9x smaller weight memory than the state-of-the-art tiny models. The total memory usage of RaScaNet does not exceed 60 KB, in the VWW dataset with competitive accuracy.

Conference: CVPR 2021
Paper | Video | Citation

Requirements

python 3.6
torch 1.7.0
torchvision 0.8.1
pycocotools 2.0.1
numpy 0.19.0
VWW dataset

Usage

For running the model, (only support vww dataset)

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar

With early termination,

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar --early_terminate=1
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar --early_terminate=1

Currently, we do not provide the code for training.

Result

Model	Weight Memory	Peak Memory	OPs Cnt.	Accuracy
rascanet(210x240)	47.03 KB	7.92 KB	56.34 M	91.835%
rascanet(105x120)	31.77 KB	3.60 KB	9.71 M	88.100%

Citation

@InProceedings{Yoo_2021_CVPR,
    author    = {Yoo, Jaehyoung and Lee, Dongwook and Son, Changyong and Jung, Sangil and Yoo, ByungIn and Choi, Changkyu and Han, Jae-Joon and Han, Bohyung},
    title     = {RaScaNet: Learning Tiny Models by Raster-Scanning Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13673-13682}
}

License

Copyright (C) 2021 Samsung Electronics Co. LTD

This software is a property of Samsung Electronics.
No part of this software, either material or conceptual may be copied or distributed, transmitted,
transcribed, stored in a retrieval system or translated into any human or computer language in any form by any means,
electronic, mechanical, manual or otherwise, or disclosed
to third parties without the express written permission of Samsung Electronics.
(Use of the Software is restricted to non-commercial, personal or academic, research purpose only)

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Requirements

Usage

Result

Citation

License

Owner

SAIT (Samsung Advanced Institute of Technology)

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

Research on Tabular Deep Learning (Python package & papers)

phylotorch-bito is a package providing an interface to BITO for phylotorch

Efficiently Disentangle Causal Representations

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

(CVPR 2022 Oral) Official implementation for "Surface Representation for Point Clouds"

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

🐦 Quickly annotate data from the comfort of your Jupyter notebook

An educational resource to help anyone learn deep reinforcement learning.

Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization

A python package simulating the quasi-2D pseudospin-1/2 Gross-Pitaevskii equation with NVIDIA GPU acceleration.

A Gura parser implementation for Python

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation

Multi-Scale Geometric Consistency Guided Multi-View Stereo

Multi-agent reinforcement learning algorithm and environment

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Anonymous implementation of KSL

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

PyTorch version repo for CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes