Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Last update: Dec 27, 2022

Related tags

Deep Learning InfoPro-Pytorch

Overview

InfoPro-Pytorch

The Information Propagation algorithm for training deep networks with local supervision.

(ICLR 2021) Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

Update on 2021/01/25: Release Pre-trained models on ImageNet and Cityscapes.

Update on 2021/01/24: Release Code for Image Classification on CIFAR/SVHN/STL10/ImageNet and Semantic Segmentation on Cityscapes.

Introduction

We propose Information Propagation (InfoPro), a locally supervised deep learning algorithm, from the information-theoretic perspective. By splitting the whole deep network into multiple local modules and training them with local InfoPro loss, we reduce the GPU memory footprint by 40-60% without introducing notable extra computational cost or training time, but improve the performance moderately.

Citation

If you find this work valuable or use our code in your own research, please consider citing us with the following bibtex:

@inproceedings{wang2021revisiting,
        title = {Revisiting Locally Supervised Learning: an Alternative to End-to-end Training},
       author = {Yulin Wang and Zanlin Ni and Shiji Song and Le Yang and Gao Huang},
    booktitle = {International Conference on Learning Representations (ICLR)},
         year = {2021},
          url = {https://openreview.net/forum?id=fAbkE6ant2}
}

Get Started

Please go to the folder Experiments on CIFAR-SVHN-STL10, Experiments on ImageNet and Semantic segmentation for specific docs.

Results

CIFAR & STL-10

ImageNet

Semantic Segmentation

GPU Memory Cost

In the paper, we report the minimally required GPU memory to run the InfoPro* algorithm with torch.backends.cudnn.benchmark=True (for practical acceleration). Note that this result is (sometimes largely) different from what is printed by nvidia-smi.

Contact

This repo is a re-implementation of our original code. If you have any question, please feel free to contact the authors. Yulin Wang: [email protected].

Acknowledgments

Our code of Semantic Segmentation is from MMSegmentation. We highly appreciate their awesome work!

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Related tags

Overview

InfoPro-Pytorch

Introduction

Citation

Get Started

Results

GPU Memory Cost

Contact

Acknowledgments

Owner

A curated (most recent) list of resources for Learning with Noisy Labels

ConformalLayers: A non-linear sequential neural network with associative layers

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

A compendium of useful, interesting, inspirational usage of pandas functions, each example will be an ipynb file

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Official Implementation for the "An Empirical Investigation of 3D Anomaly Detection and Segmentation" paper.

StackNet is a computational, scalable and analytical Meta modelling framework

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

A repository for generating stylized talking 3D and 3D face

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

DrQ-v2: Improved Data-Augmented Reinforcement Learning

MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions,spherical coordinates, and intensity

The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation"

Shape-Adaptive Selection and Measurement for Oriented Object Detection

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

Gender Classification Machine Learning Model using Sk-learn in Python with 97%+ accuracy and deployment

POPPY (Physical Optics Propagation in Python) is a Python package that simulates physical optical propagation including diffraction

official code for dynamic convolution decomposition

This is the dataset for testing the robustness of various VO/VIO methods