Training Structured Neural Networks Through Manifold Identification and Variance Reduction

Last update: Dec 23, 2021

Related tags

Overview

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

This repository is a pytorch implementation of the Regularized Modernized Dual Averaging (RMDA) algorithm for training structred neural network models. Details of the algorithm can be found in the following paper:

Zih-Syuan Huang, Ching-pei Lee, Training Structured Neural Networks Through Manifold Identification and Variance Reduction[arXiv]

When provided with a regularizer and the corresponding proximal operator, this algorithm trains a neural network model that conforms the structure induced by the regularizer. In this repository, we include the proximal operator of the L1 norm and the group-LASSO norm as illustrating examples, but users can replace them with any other proximal operators.

This repository contains:

Regularized modernized dual averaging (RMDA) algorithm.
Scheduler for learning rate, momentum scheduling and restart.
Proximal operators for the group-LASSO and L1 norms.
Training file. An exemplary wrapper for using our algorithm to train a structured neural network.

Getting started

To compile the code, you will need to install torch and torchvision.

Examples

Logistic Regression on MNIST

To run an experiment of logistic regression on MNIST, run:

python LogisticRegression_on_MNIST.py

in the Experiments directory.

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

Related tags

Overview

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

Getting started

Examples

Logistic Regression on MNIST

Owner

SpiroMask: Measuring Lung Function Using Consumer-Grade Masks

AdelaiDepth is an open source toolbox for monocular depth prediction.

We provided a matlab implementation for an evolutionary multitasking AUC optimization framework (EMTAUC).

Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

Subdivision-based Mesh Convolutional Networks

Chinese Advertisement Board Identification(Pytorch)

tsflex - feature-extraction benchmarking

Code for the paper Learning the Predictability of the Future

This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.

U-Net for GBM

On Effective Scheduling of Model-based Reinforcement Learning

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

Official Pytorch implementation of Meta Internal Learning

Semi-supervised Domain Adaptation via Minimax Entropy

This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

SARS-Cov-2 Recombinant Finder for fasta sequences

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

You Only 👀 One Sequence