ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Last update: Oct 02, 2022

Overview

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

This repository is the official implementation of the empirical research presented in the supplementary material of the paper, ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees.

Requirements

To install requirements:

pip install -r requirements.txt

Please install Python before running the above setup command. The code was tested on Python 3.8.10.

Create a folder to store all the models and results:

mkdir ckeckpoint

Training

To fully replicate the results below, train all the models by running the following two commands:

./train_cuda0.sh

./train_cuda1.sh

We used two separate scripts because we had two NVIDIA GPUs and we wanted to run two training processes for different models at the same time. If you have more GPUs or resources, you can submit multiple jobs and let them run in parallel.

To train a model with different seeds (initializations), run the command in the following form:

python main.py --data <dataset> --model <DNN_model> --mu <learning_rate>

The above command uses the default seed list. You can also specify your seeds like the following example:

python main.py --data CIFAR10 --model CIFAR10_BNResNEst_ResNet_110 --seed_list 8 9

Run this command to see how to customize your training or hyperparameters:

python main.py --help

Evaluation

To evaluate all trained models on benchmarks reported in the tables below, run:

./eval.sh

To evaluate a model, run:

python eval.py --data  <dataset> --model <DNN_model> --seed_list <seed>

Pre-trained models

All pretrained models can be downloaded from this Google Drive link. All last_model.pt files are fully trained models.

Results

Image Classification on CIFAR-10

Architecture	Standard	ResNEst	BN-ResNEst	A-ResNEst
WRN-16-8	95.56% (11M)	94.39% (11M)	95.48% (11M)	95.29% (8.7M)
WRN-40-4	95.45% (9.0M)	94.58% (9.0M)	95.61% (9.0M)	95.48% (8.4M)
ResNet-110	94.46% (1.7M)	92.77% (1.7M)	94.52% (1.7M)	93.97% (1.7M)
ResNet-20	92.60% (0.27M)	91.02% (0.27M)	92.56% (0.27M)	92.47% (0.24M)

Image Classification on CIFAR-100

Architecture	Standard	ResNEst	BN-ResNEst	A-ResNEst
WRN-16-8	79.14% (11M)	75.43% (11M)	78.99% (11M)	78.74% (8.9M)
WRN-40-4	79.08% (9.0M)	75.16% (9.0M)	78.97% (9.0M)	78.62% (8.7M)
ResNet-110	74.08% (1.7M)	69.08% (1.7M)	73.95% (1.7M)	72.53% (1.9M)
ResNet-20	68.56% (0.28M)	64.73% (0.28M)	68.47% (0.28M)	68.16% (0.27M)

BibTeX

@inproceedings{chen2021resnests,
  title={{ResNEsts} and {DenseNEsts}: Block-based {DNN} Models with Improved Representation Guarantees},
  author={Chen, Kuan-Lin and Lee, Ching-Hua and Garudadri, Harinath and Rao, Bhaskar D.},
  booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Related tags

Overview

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Requirements

Training

Evaluation

Pre-trained models

Results

Image Classification on CIFAR-10

Image Classification on CIFAR-100

BibTeX

Owner

Kuan-Lin (Jason) Chen

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

Official repository of the AAAI'2022 paper "Contrast and Generation Make BART a Good Dialogue Emotion Recognizer"

Deep GPs built on top of TensorFlow/Keras and GPflow

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding (CVPR2022)

Unimodal Face Classification with Multimodal Training

TICC is a python solver for efficiently segmenting and clustering a multivariate time series

Fast SHAP value computation for interpreting tree-based models

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks

Analysis code and Latex source of the manuscript describing the conditional permutation test of confounding bias in predictive modelling.

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Finetune SSL models for MOS prediction

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Gradient-free global optimization algorithm for multidimensional functions based on the low rank tensor train format

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

This is our ARTS test set, an enriched test set to probe Aspect Robustness of ABSA.

Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"