A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Owner

Ilya Kostrikov

Post doc

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

News SRU++, a new SRU variant, is released. [tech report] [blog] The experimental code and SRU++ implementation are available on the dev branch which

2.1k Jan 01, 2023

High-fidelity performance metrics for generative models in PyTorch

5 Oct 24, 2021

You like pytorch? You like micrograd? You love tinygrad! ❤️

For something in between a pytorch and a karpathy/micrograd This may not be the best deep learning framework, but it is a deep learning framework. Due

9.7k Jan 05, 2023

Riemannian Adaptive Optimization Methods with pytorch optim

geoopt Manifold aware pytorch.optim. Unofficial implementation for “Riemannian Adaptive Optimization Methods” ICLR2019 and more. Installation Make sur

642 Jan 03, 2023

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.

Tez: a simple pytorch trainer NOTE: Currently, we are not accepting any pull requests! All PRs will be closed. If you want a feature or something does

1.1k Jan 04, 2023

higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.

higher is a library providing support for higher-order optimization, e.g. through unrolled first-order optimization loops, of "meta" aspects of these

1.5k Jan 03, 2023

A pure Python implementation of Compact Bilinear Pooling and Count Sketch for PyTorch.

Compact Bilinear Pooling for PyTorch. This repository has a pure Python implementation of Compact Bilinear Pooling and Count Sketch for PyTorch. This

234 Dec 07, 2022

Use Jax functions in Pytorch with DLPack

106 Dec 17, 2022

PyTorch wrappers for using your model in audacity!

130 Dec 14, 2022

A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-precision, and PyTorch extensions.

56 Sep 13, 2022

Bunch of optimizer implementations in PyTorch

76 Jan 03, 2023

Learning Sparse Neural Networks through L0 regularization

Example implementation of the L0 regularization method described at Learning Sparse Neural Networks through L0 regularization, Christos Louizos, Max W

202 Nov 10, 2022

A PyTorch implementation of EfficientNet

EfficientNet PyTorch Quickstart Install with pip install efficientnet_pytorch and load a pretrained EfficientNet with: from efficientnet_pytorch impor

7.2k Jan 06, 2023

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

(Generic) EfficientNets for PyTorch A 'generic' implementation of EfficientNet, MixNet, MobileNetV3, etc. that covers most of the compute/parameter ef

1.5k Jan 01, 2023

Pytorch bindings for Fortran

46 Dec 29, 2022

Pytorch implementation of Distributed Proximal Policy Optimization

Pytorch-DPPO Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286 Using PPO with clip loss (from https

164 Jan 05, 2023

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Intro PyTorch implementation of Learning to learn by gradient descent by gradient descent. Run python main.py TODO Initial implementation Toy data LST

300 Dec 11, 2022

Fast and Easy-to-use Distributed Graph Learning for PyTorch Geometric

221 Dec 22, 2022

TorchShard is a lightweight engine for slicing a PyTorch tensor into parallel shards

TorchShard is a lightweight engine for slicing a PyTorch tensor into parallel shards. It can reduce GPU memory and scale up the training when the model has massive linear layers (e.g., ViT, BERT and

275 Nov 22, 2022

Over9000 optimizer

Optimizers and tests Every result is avg of 20 runs. Dataset LR Schedule Imagenette size 128, 5 epoch Imagewoof size 128, 5 epoch Adam - baseline OneC

405 Nov 27, 2022

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Related tags

Overview

Intro

Run

TODO

Owner

Ilya Kostrikov

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

High-fidelity performance metrics for generative models in PyTorch

You like pytorch? You like micrograd? You love tinygrad! ❤️

Riemannian Adaptive Optimization Methods with pytorch optim

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.

higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.

A pure Python implementation of Compact Bilinear Pooling and Count Sketch for PyTorch.

Use Jax functions in Pytorch with DLPack

PyTorch wrappers for using your model in audacity!

A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-precision, and PyTorch extensions.

Bunch of optimizer implementations in PyTorch

Learning Sparse Neural Networks through L0 regularization

A PyTorch implementation of EfficientNet

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

Pytorch bindings for Fortran

Pytorch implementation of Distributed Proximal Policy Optimization

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Fast and Easy-to-use Distributed Graph Learning for PyTorch Geometric

TorchShard is a lightweight engine for slicing a PyTorch tensor into parallel shards

Over9000 optimizer