Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)

Last update: Jun 11, 2022

Related tags

Overview

This repository contains code to reproduce results for submission NeurIPS 2021, "Momentum Centering and Asynchronous Update for Adaptive Gradient Methods".

This repo heavily depends on the official implementation of AdaBound: https://github.com/Luolc/AdaBound and AdaBelief: https://github.com/juntang-zhuang/Adabelief-Optimizer.

Dependencies

python 3.7 pytorch 1.1.0 torchvision 0.3.0 jupyter notebook AdaBelief (Please install by "pip install adabelief-pytorch==0.2.0")

Visualization of pre-trained curves

Please use the jupyter notebook "visualization.ipynb" to visualize the training and test curves of different optimizers.

Training and evaluation code

(1) train network with CUDA_VISIBLE_DEVICES=0 python main.py --model vgg --optim acprop --lr 1e-3 --eps 1e-8 --beta1 0.9 --beta2 0.999 --momentum 0.9

--model: name of model, choices include ['vgg','resnet','densenet'] --optim: name of optimizers, choices include ['acprop','adashift','sgd', 'adam', 'adamw', 'adabelief', 'radam',] --lr: learning rate --eps: epsilon value used for optimizers. Note that Yogi uses a default of 1e-03, other optimizers typically uses 1e-08 --beta1, --beta2: beta values in adaptive optimizers --momentum: momentum used for SGD.

(2) visualize using the notebook "visualization.ipynb"

Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)

Related tags

Overview

Dependencies

Visualization of pre-trained curves

Training and evaluation code

Owner

Juntang Zhuang

A collection of semantic image segmentation models implemented in TensorFlow

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Supervised Classification from Text (P)

cl;asification problem using classification models in supervised learning

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

Distributionally robust neural networks for group shifts

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" (PRICAI 2021)

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding

Discord bot-CTFD-Thread-Parser - Discord bot CTFD-Thread-Parser

Code for the paper "There is no Double-Descent in Random Forests"

Turning SymPy expressions into PyTorch modules.

Voice control for Garry's Mod

Portfolio analytics for quants, written in Python

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

This repo is duplication of jwyang/faster-rcnn.pytorch

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

A Deep learning based streamlit web app which can tell with which bollywood celebrity your face resembles.