A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Last update: Jan 08, 2023

Related tags

Overview

micrograd

A tiny Autograd engine (with a bite! :)). Implements backpropagation (reverse-mode autodiff) over a dynamically built DAG and a small neural networks library on top of it with a PyTorch-like API. Both are tiny, with about 100 and 50 lines of code respectively. The DAG only operates over scalar values, so e.g. we chop up each neuron into all of its individual tiny adds and multiplies. However, this is enough to build up entire deep neural nets doing binary classification, as the demo notebook shows. Potentially useful for educational purposes.

Installation

pip install micrograd

Example usage

Below is a slightly contrived example showing a number of possible supported operations:

from micrograd.engine import Value

a = Value(-4.0)
b = Value(2.0)
c = a + b
d = a * b + b**3
c += c + 1
c += 1 + c + (-a)
d += d * 2 + (b + a).relu()
d += 3 * d + (b - a).relu()
e = c - d
f = e**2
g = f / 2.0
g += 10.0 / f
print(f'{g.data:.4f}') # prints 24.7041, the outcome of this forward pass
g.backward()
print(f'{a.grad:.4f}') # prints 138.8338, i.e. the numerical value of dg/da
print(f'{b.grad:.4f}') # prints 645.5773, i.e. the numerical value of dg/db

Training a neural net

The notebook demo.ipynb provides a full demo of training an 2-layer neural network (MLP) binary classifier. This is achieved by initializing a neural net from micrograd.nn module, implementing a simple svm "max-margin" binary classification loss and using SGD for optimization. As shown in the notebook, using a 2-layer neural net with two 16-node hidden layers we achieve the following decision boundary on the moon dataset:

Tracing / visualization

For added convenience, the notebook trace_graph.ipynb produces graphviz visualizations. E.g. this one below is of a simple 2D neuron, arrived at by calling draw_dot on the code below, and it shows both the data (left number in each node) and the gradient (right number in each node).

from micrograd import nn
n = nn.Neuron(2)
x = [Value(1.0), Value(-2.0)]
y = n(x)
dot = draw_dot(y)

Running tests

To run the unit tests you will have to install PyTorch, which the tests use as a reference for verifying the correctness of the calculated gradients. Then simply:

python -m pytest

License

MIT

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Related tags

Overview

micrograd

Installation

Example usage

Training a neural net

Tracing / visualization

Running tests

License

Owner

Andrej

A tutorial on "Bayesian Compression for Deep Learning" published at NIPS (2017).

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations

OptNet: Differentiable Optimization as a Layer in Neural Networks

PyNIF3D is an open-source PyTorch-based library for research on neural implicit functions (NIF)-based 3D geometry representation.

Bunch of optimizer implementations in PyTorch

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

PyTorch Extension Library of Optimized Scatter Operations

Kaldi-compatible feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd

Distiller is an open-source Python package for neural network compression research.

Reformer, the efficient Transformer, in Pytorch

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.

A PyTorch implementation of L-BFGS.

A Closer Look at Structured Pruning for Neural Network Compression

Over9000 optimizer

S3-plugin is a high performance PyTorch dataset library to efficiently access datasets stored in S3 buckets.

PyTorch Lightning Optical Flow models, scripts, and pretrained weights.

Training PyTorch models with differential privacy

PyGCL: Graph Contrastive Learning Library for PyTorch