MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Last update: Jan 16, 2022

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Using mixup data augmentation as reguliraztion and tuning the hyper parameters of ResNet 50 models to achieve 94.57% test accuracy on CIFAR-10 Dataset. Link to paper

network	error %
resnet-50	6.97
resnet-110	6.61
resnet-164	5.93
resnet-1001	7.61
This method	5.43

Overview

Change the wandb api key to valid api key.
Python 3.8 and pytorch 1.9 (works on older versions as well)
main.py is to train model
sweep.py and sweep_config.py are for hyperparameter optimization for experiment tracking wandb is used please change api key
pred.py is to run the trained model on the custom data. (Appropriately provide model paths)

Important

If you want to run sweep.py then you must use wandb apikey and if you want to run main.py use wandb to log the experiment for comparision else comment out wandb part.

Training


# Start training with:

python main.py (Added --run_name optional argument for better tracking experiments)

  

# You can manually resume the training with:

python main.py --resume --lr=0.01

Hyperparameters sweep


# Start sweep with:

python sweep.py

  

# Provide appropriate hyperparameters range in sweep_config.py (Config written in py file to use the power of math package for sweep configs)

Running on custom dataset


# Convert traget data of (N*32*32*3) into (N*3*32*32) shape and pass through the model:

python pred.py (Provide path of the saved models)

Other files

mixup.py contains functions to claculate loss of mixup predictions as you cant use nn.CrossEntropyLoss
utils.py contain somehelper functions
dataloader.py is a torch class based dataloader of our train data (CIFAR-10 data)
private_loader.py is a torch class based dataloader of our private data.
Transformations are done using torchtransforms in main.py and sweep.py files depending on usage.

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Overview

Important

Training

Hyperparameters sweep

Running on custom dataset

Other files

Owner

Bhanu

Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020.

PyTorch implementation of Histogram Layers from DeepHist: Differentiable Joint and Color Histogram Layers for Image-to-Image Translation

Implementation of parameterized soft-exponential activation function.

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

a spacial-temporal pattern detection system for home automation

MPViT:Multi-Path Vision Transformer for Dense Prediction

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

RRL: Resnet as representation for Reinforcement Learning

Code and Resources for the Transformer Encoder Reasoning Network (TERN)

Pathdreamer: A World Model for Indoor Navigation

Numerai tournament example scripts using NN and optuna

FuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space OptimizationFuseDream: Training-Free Text-to-Image Generationwith Improved CLIP+GAN Space Optimization

I created My own Virtual Artificial Intelligence named genesis, He can assist with my Tasks and also perform some analysis,,

PyTorch implementation of Pay Attention to MLPs

CTF challenges and write-ups for MicroCTF 2021.

Pytorch implementation of VAEs for heterogeneous likelihoods.

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

This repository contains python code necessary to replicated the experiments performed in our paper "Invariant Ancestry Search"

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors to Sequences