Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Last update: Oct 18, 2021

Related tags

Overview

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

This repo contains official code for the NeurIPS 2021 paper Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations by Jiayao Zhang, Hua Wang, Weijie J. Su.

Discussions welcome, please submit via Discussions. You can also read the reviews on OpenReview.

@misc{zhang2021imitating,
      title={Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations}, 
      author={Jiayao Zhang and Hua Wang and Weijie J. Su},
      year={2021},
      eprint={2110.05960},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Reproducing Experiments

Dependencies

We use Python 3.8 and pytorch for training neural nets, please use pip install -r requirements.txt (potentially in a virtual environment) to install dependencies.

Datasets

We use a dataset of geometric shapes (GeoMNIST) we constructed as well as CIFAR-10. GeoMNIST is lightweighted and will be generated when simulation runs; CIFAR-10 will be downloaded from torchvision.

Code Structure

After instsalling the dependencies, one may navigate through the two Jupyter notebooks for running experiments and producing plots and figures. Below we outline the code structure.

.
├── LICENSE                         # code license
├── README.md                       # this file
├── LE-SDE Data Analysis.ipynb      # reproducing plots and figures
├── LE-SDE Experiments.ipynb        # reproducing experiments
└── src                         # source code
    ├── data_analyzer.py            # processing experiment data
    ├── datasets.py                 # generating and loading datasets
    ├── models.py                   # definition of neural net models
    ├── plotter.py                  # generating plots and figures
    └── utils.py                    # utilities, including training pipelines
└── exp_data                    # experiment data
    ├── *.csv                       # dataframes from neural net training
    └── *.npy                       # numpy.ndarray storing LE-ODE simulations

More info regarding npy files can be found in the numpy documentation.

Reproducing Figures

Experiment Data

Although all simulations can be run on your machine, it is quite time-consuming. Data from our experiments can be downloaded from the following anonymous Dropbox links:

lesde_exp_data.tar.gz (1.02GB): *.csv files for reproducing Figures 1-4.
lesde_sim_data.tar.gz (2.54GB): *.npy files for reproducing Figure 5.

After downloading those tarballs, extract them into ./exp_data (or change the EXP_DIR variable in the notebooks accordingly).

Plotter

Once experiment data are ready, simply follow LE-SDE Data Analysis.ipynb for reproducing all figures.

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Related tags

Overview

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Reproducing Experiments

Dependencies

Datasets

Code Structure

Reproducing Figures

Experiment Data

Plotter

Owner

Jiayao Zhang

WarpRNNT loss ported in Numba CPU/CUDA for Pytorch

TransferNet: Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network

A port of muP to JAX/Haiku

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Object-Centric Learning with Slot Attention

(CVPR 2022) A minimalistic mapless end-to-end stack for joint perception, prediction, planning and control for self driving.

PyTorch implementation of our method for adversarial attacks and defenses in hyperspectral image classification.

Histocartography is a framework bringing together AI and Digital Pathology

How to use TensorLayer

Industrial knn-based anomaly detection for images. Visit streamlit link to check out the demo.

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

Maximum Spatial Perturbation for Image-to-Image Translation (Official Implementation)

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

Variational autoencoder for anime face reconstruction