We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Last update: Sep 08, 2022

Related tags

Overview

This repository provides the implementation for the paper "Improved Regularization and Robustness for Fine-tuning in Neural Networks", which will be presented as a poster paper in NeurIPS'21.

In this work, we propose a regularized self-labeling approach that combines regularization and self-training methods for improving the generalization and robustness properties of fine-tuning. Our approach includes two components:

First, we encode layer-wise regularization to penalize the model weights at different layers of the neural net.
Second, we add self-labeling that relabels data points based on current neural net's belief and reweights data points whose confidence is low.

Requirements

Python >= 3.6
PyTorch >= 1.7
Optuna >= 2.5
Numpy

Usage

Our algorithm is based on layer-wise regularization and self label-correction and label-weighting.

As an example, here are the test accuracy results on the Indoor dataset with independent label noise:

Method	Noise = 20%	Noise = 40%	Noise = 60%	Noise = 80%
Ours	75.21 $\pm$ 0.46	68.13 $\pm$ 0.16	57.59 $\pm$ 0.55	34.08 $\pm$ 0.79
Fine-tuning	65.02 $\pm$ 0.39	57.49 $\pm$ 0.39	44.60 $\pm$ 0.95	27.09 $\pm$ 0.19

Run following code to replicate above results:

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 7.80246991703043 --reg_predictor 14.077402847906 \
    --noise_rate 0.2 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 8.47139398080791 --reg_predictor 19.0191127114923 \
    --noise_rate 0.4 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 10.7576018531961 --reg_predictor 19.8157649727473 \
    --noise_rate 0.6 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 
    
python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 9.2031662757248 --reg_predictor 6.41568500472423 \
    --noise_rate 0.8 --train_correct_label --reweight_epoch 5 --reweight_temp 1.5 --correct_epoch 10 --correct_thres 0.9

Data Preparation

We use seven image datasets in our paper. We list the link for downloading these datasets and describe how to prepare data to run our code below.

Aircrafts: download and extract into ./data/aircrafts
- remove the class 257.clutter out of the data directory
CUB-200-2011: download and extract into ./data/CUB_200_2011/
Caltech-256: download and extract into ./data/caltech256/
Stanford-Cars: download and extract into ./data/StanfordCars/
Stanford-Dogs: download and extract into ./data/StanfordDogs/
Flowers: download and extract into ./data/flowers/
MIT-Indoor: download and extract into ./data/Indoor/

Our code automatically handles the split of the datasets.

Citation

If you find this repository useful, consider citing our work titled above.

Acknowledgment

Thanks to the authors of mars-finetuning and WS-DAN.PyTorch for providing their implementation publicly available.

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Related tags

Overview

Overview

Requirements

Usage

Data Preparation

Citation

Acknowledgment

Owner

NEU-StatsML-Research

EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

A TensorFlow implementation of FCN-8s

An unofficial styleguide and best practices summary for PyTorch

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

Source code for the paper "Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect"

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

No Code AI/ML platform

Training DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

KDD CUP 2020 Automatic Graph Representation Learning: 1st Place Solution

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

A Streamlit component to render ECharts.

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

Tensorflow 2.x implementation of Vision-Transformer model