Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Code for “Efficient Sharpness-aware Minimization for Improved Training of Neural Networks”

Requisite

This code is implemented in PyTorch, and we have tested the code under the following environment settings:

python = 3.8.8
torch = 1.8.0
torchvision = 0.9.0

What is in this repository

Codes for our ESAM on CIFAR10/CIFAR100 datasets.

How to use it

from utils.layer_dp_sam import ESAM
base_optimizer = torch.optim.SGD(model.parameters(),lr=args.learning_rate,momentum=0.9,weight_decay=args.weight_decay)
optimizer = ESAM(paras, base_optimizer, rho=args.rho, weight_dropout=args.weight_dropout,adaptive=args.isASAM,nograd_cutoff=args.nograd_cutoff,opt_dropout = args.opt_dropout,temperature=args.temperature)

--beta the SWP hyperparameter

--gamma the SDS hyperparameter

During training loss_fct should have reduction="none", to return instance-wise losses. defined_backward is the function used for DDP and mixed precision backward

loss_fct = torch.nn.CrossEntropyLoss(reduction="none")
def defined_backward():
    if args.fp16:
    with amp.scale_loss(loss, optimizer0) as scaled_loss:
        scaled_loss.backward()
    else:
        loss.backward()

paras = [inputs,targets,loss_fct,model,defined_backward]
optimizer.paras = paras
optimizer.step()
predictions_logits,loss = optimizer.returnthings

Example

bash run.sh

Reference Code

[1] SAM

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Related tags

Overview

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Requisite

What is in this repository

How to use it

Example

Reference Code

Owner

Angusdu

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Code for Robust Contrastive Learning against Noisy Views

IGCN : Image-to-graph convolutional network

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

Maximum Spatial Perturbation for Image-to-Image Translation (Official Implementation)

Cooperative Driving Dataset: a dataset for multi-agent driving scenarios

Revisting Open World Object Detection

Research on Tabular Deep Learning (Python package & papers)

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

This is a simple framework to make object detection dataset very quickly

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

Extension to fastai for volumetric medical data

Code basis for the paper "Camera Condition Monitoring and Readjustment by means of Noise and Blur" (2021)

A repository for benchmarking neural vocoders by their quality and speed.