Pure python implementation reverse-mode automatic differentiation

Last update: Sep 12, 2022

Related tags

Overview

MiniGrad

A minimal implementation of reverse-mode automatic differentiation (a.k.a. autograd / backpropagation) in pure Python.

Inspired by Andrej Karpathy's micrograd, but with more comments and less cleverness. Thanks for the wonderful reference implementation and tests!

Overview

Create a Scalar.

a = Scalar(1.5)

Do some calculations.

b = Scalar(-4.0)
c = a**3 / 5
d = c + (b**2).relu()

Compute the gradients.

d.backward()

Plot the computational graph.

draw_graph(d)

Repo Structure

demo.ipynb: Demo notebook of MiniGrad's functionality.
tests.ipynb: Test notebook to verify gradients against PyTorch and JAX. Install both to run tests.
minigrad/minigrad.py: The entire autograd logic in one (~100 loc) numeric class. See section below for details.
minigrad/visualize.py: This just draws nice-looking computational graphs. Install Graphviz to run it.
requirements.txt: MiniGrad requires no external modules to run. This file just sets up my dev environment.

Implementation

MiniGrad is implemented in one small (~100 loc) Python class, using no external modules.

The entirety of the auto-differentiation logic lives in the Scalar class in minigrad.py.

A Scalar wraps a float/int and overrides its arithmetic magic methods in order to:

Stitch together a define-by-run computational graph when doing arithmetic operations on a Scalar
Hard code the derivative functions of arithmetic operations
Keep track of ∂self/∂parent between adjacent nodes
Compute ∂output/∂self with the chain rule on demand (when .backward() is called)

This is called reverse-mode automatic differentiation. It's great when you have few outputs and many inputs, since it computes all derivatives of one output in one pass. This is also how TensorFlow and PyTorch normally compute gradients.

(Forward-mode automatic differentiation also exists, and has the opposite advantage.)

Not in Scope

This project is just for fun, so the following are not planned:

Vectorization
Higher order derivatives (i.e. Scalar.grad is a Scalar itself)
Forward-mode automatic differentiation
Neural network library on top of MiniGrad

Pure python implementation reverse-mode automatic differentiation

Related tags

Overview

MiniGrad

Overview

Repo Structure

Implementation

Not in Scope

Owner

Kenny Song

CATE: Computation-aware Neural Architecture Encoding with Transformers

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

A Python reference implementation of the CF data model

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Code for the paper "Combining Textual Features for the Detection of Hateful and Offensive Language"

CRISCE: Automatically Generating Critical Driving Scenarios From Car Accident Sketches

A PyTorch implementation of the architecture of Mask RCNN

A motion detection system with RaspberryPi, OpenCV, Python

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

Framework for Spectral Clustering on the Sparse Coefficients of Learned Dictionaries

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

A Deep Reinforcement Learning Framework for Stock Market Trading

ML models implementation practice

HistoKT: Cross Knowledge Transfer in Computational Pathology

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

RSNA Intracranial Hemorrhage Detection with python

Link prediction using Multiple Order Local Information (MOLI)

Free-duolingo-plus - Duolingo account creator that uses your invite code to get you free duolingo plus

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset