Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Last update: Apr 24, 2022

Related tags

Overview

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

1. Classification Task

PyTorch implementation of DisturbLabel: Regularizing CNN on the Loss Layer [CVPR 2016] extended with Directional DisturbLabel method.

This classification code is built on top of https://github.com/amirhfarzaneh/disturblabel-pytorch/blob/master/README.md project and utilizes implementation from ResNet 18 from https://github.com/huyvnphan/PyTorch_CIFAR10

Directional DisturbLabel

  if args.mode == 'ddl' or args.mode == 'ddldr':
      out = F.softmax(output, dim=1)
      norm = torch.norm(out, dim=1)
      out = out / norm[:, None]
      idx = []
      for i in range(len(out)):
          if out[i,target[i]] > .5:
              idx.append(i)
              
      if len(idx) > 0:
          target[idx] = disturb(target[idx]).to(device)

Usage

python main_ddl.py --mode=dl --alpha=20

Most important arguments

--dataset - which data to use

Possible values:

value	dataset
MNIST	MNIST
FMNIST	Fashion MNIST
CIFAR10	CIFAR-10
CIFAR100	CIFAR-100
ART	Art Images: Drawing/Painting/Sculptures/Engravings
INTEL	Intel Image Classification

Default: MNIST

-- mode - regularization method applied

Possible values:

value	method
noreg	Without any regularization
dl	Vanilla DistrubLabel
ddl	Directional DisturbLabel
dropout	Dropout
dldr	DistrubLabel+Dropout
ddldl	Directional DL+Dropout

Default: ddl

--alpha - alpha for vanilla Distrub label and Directional DisturbLabel

Possible values: int from 0 to 100. Default: 20

--epochs - number of training epochs

Default: 100

2. Regression Task

DisturbValue

def noise_generator(x, alpha):
    noise = torch.normal(0, 1e-8, size=(len(x), 1))
    noise[torch.randint(0, len(x), (int(len(x)*(1-alpha)),))] = 0

    return noise

DisturbError

def disturberror(outputs, values):
    epsilon = 1e-8
    e = values - outputs
    for i in range(len(e)):
        if (e[i] < epsilon) & (e[i] >= 0):
            values[i] = values[i] + e[i] / 4
        elif (e[i] > -epsilon) & (e[i] < 0):
            values[i] = values[i] - e[i] / 4

    return values

Datasets

Boston: 506 instances, 13 features
Bike Sharing: 731 instances, 13 features
Air Quality(AQ): 9357 instances, 10 features
make_regression(MR): 5000 instances, 30 features (random sample for regression)
Housing Price - Kaggle(HP): 1460 instances, 81 features
Student Performance (SP): 649 instances, 13 features (20 - categorical were dropped)
Superconductivity Dataset (SD): 21263 instances, 81 features
Communities & Crime (CC): 1994 instances, 100 features
Energy Prediction (EP): 19735 instancies, 27 features

Experiment Setting

Model: MLP which has 3 hidden layers

Result: Averaged over 20 runs

Hyperparameters: Using grid search options

Usage

python main_new.py --de y --dataset "bike" --dv_annealing y --epoch 100 --T 80
python main_new.py --de y --dv y --dataset "bike" -epoch 100
python main_new.py --de y --l2 y --dataset "air" -epoch 100
python main_new.py --dv y --dv_annealing y --dataset "air" -epoch 100 #for annealing setting dv should be "y"

--dataset: 'bike', 'air', 'boston', 'housing', 'make_sklearn', 'superconduct', 'energy', 'crime', 'students'
--dropout, --dv(disturbvalue), --de(disturberror), --l2, --dv_annealing: (string) y / n
--lr: (float)
--batch_size, --epoch, --T(cos annealing T): (int)
-- default dv_annealing: alpha_min = 0.05, alpha_max = 0.12, T_i = 80

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Related tags

Overview

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

1. Classification Task

Directional DisturbLabel

Usage

Most important arguments

2. Regression Task

DisturbValue

DisturbError

Datasets

Experiment Setting

Model: MLP which has 3 hidden layers

Result: Averaged over 20 runs

Hyperparameters: Using grid search options

Usage

Owner

Yongho Kim

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

MediaPipeのPythonパッケージのサンプルです。2020/12/11時点でPython実装のある4機能(Hands、Pose、Face Mesh、Holistic)について用意しています。

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

Pansharpening by convolutional neural networks in the full resolution framework

CCP dataset from Clothing Co-Parsing by Joint Image Segmentation and Labeling

GANmouflage: 3D Object Nondetection with Texture Fields

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Sequence-to-Sequence learning using PyTorch

This is a re-implementation of TransGAN: Two Pure Transformers Can Make One Strong GAN (CVPR 2021) in PyTorch.

Multi-Glimpse Network With Python

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

Models, datasets and tools for Facial keypoints detection

Implementation of our recent paper, WOOD: Wasserstein-based Out-of-Distribution Detection.

DeepLab resnet v2 model in pytorch

Dynamic wallpaper generator.

A GridMixup augmentation, inspired by GridMask and CutMix