Autoencoder - Reducing the Dimensionality of Data with Neural Network

Last update: Nov 17, 2022

Related tags

Overview

autoencoder

Implementation of the Reducing the Dimensionality of Data with Neural Network – G. E. Hinton and R. R. Salakhutdinov paper.

Notes

Aim to minimise the squared reconstruction error using backpropagation.

If the neuron activation functions are linear, it will learn a compact encoding in the hidden units that is a linear function of the data. Exactly the same result as PCA (but probably less efficient). The M hidden units will span the same space as the first M components of PCA, but they may not be orthogonal.

If the activation functions are non-linear, it's able to represent data on a non-linear manifold – more powerful than PCA.

Deep Autoencoders

Learning time is linear (or better) in the number of training cases.

MNIST Digits

Network

Encoding: 784 (pixels) -> 1000 -> 500 -> 250 -> 30 linear units [central code layer]

Decoding: 30 linear units -> 250 -> 500 -> 1000 -> 784 pixel [reconstruction]

First trained by stacking RBMs to get the 30 hidden units. Transpose of those weights used for decoding. Then fine-tune with backprop with cross-entropy error.

MNIST digits are continuous valued [0,1] and very non-Gaussian. RBM units pre-trained as follows:

First layer (784 <-> 1000 units):

hidden units are binary (bernoulli with activation probability given by logistic).
visible units are linear with Gaussian noise

Second layer (1000 <-> 500 units) and third layer (500 <-> 250):

hidden units are binary (bernoulli using probability of 1 given by logistic).
visible units are continuous (activation probabilities of the hidden units in the preceding layer.

Top layer (250 <-> 30):

hidden units have stochastic real-values states drawn form a unit variance Gaussian whose mean was is determined by the input from the that RBM's logistic visible units.

For fine-tuning, the model is "unfolded" and stochastic activities are are replaced by deterministic real-valued probabilties. Top layer is a linear, every other layer is logistic.

Autoencoder - Reducing the Dimensionality of Data with Neural Network

Related tags

Overview

autoencoder

Notes

Deep Autoencoders

MNIST Digits

Network

Resources

Owner

Jordan Burgess

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

MPI-IS Mesh Processing Library

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

【steal piano】GitHub偷情分析工具！

Sudoku solver - A sudoku solver with python

Survival analysis in Python

PyTorch implementation of DreamerV2 model-based RL algorithm

Attentive Implicit Representation Networks (AIR-Nets)

Repository for the semantic WMI loss

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

LieTransformer: Equivariant Self-Attention for Lie Groups

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

TICC is a python solver for efficiently segmenting and clustering a multivariate time series

SAS: Self-Augmentation Strategy for Language Model Pre-training

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Face Recognition plus identification simply and fast | Python

Definition of a business problem according to Wilson Lower Bound Score and Time Based Average Rating

Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)

Multi-Person Extreme Motion Prediction

Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.