Code for the paper "Curriculum Dropout", ICCV 2017

Last update: Jan 02, 2022

Overview

Curriculum Dropout

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. However, we show that using a fixed dropout probability during training is a suboptimal choice. We propose a time scheduling for the probability of retaining neurons in the network. This induces an adaptive regularization scheme that smoothly increases the difficulty of the optimization problem. This idea of "starting easy" and adaptively increasing the difficulty of the learning problem has its roots in curriculum learning and allows one to train better models. Indeed, we prove that our optimization strategy implements a very general curriculum scheme, by gradually adding noise to both the input and intermediate feature representations in the network architecture. The method, named Curriculum Dropout, yields to better generalization.

Code

Each sub-folder (...in progress...) is named after the dataset analyzed and equipped with its own README. The provided code runs with Python 2.7 (should run with Python 3 as well, not tested). For the installation of tensorflow-gpu please refer to the website.

The following command should install the main dependencies on most Linux (Ubuntu) machines

sudo apt-get install python-dev python-pip && sudo pip install -r requirements.txt

Download and extract MNIST

The script download.sh downloads and extracts mnist. Deafult storing directory is ~/mnist.

sudo chmod a+x download.sh
./download.sh

Move the mnist/ folder wherever you like (e.g. /mydata) and then tell the training scripts where to find it

echo /mydata >> data_dir.txt

Reference

If you use this code as part of any published research, please acknowledge the following paper:

"Curriculum Dropout"
Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, René Vidal and Vittorio Murino pdf

@InProceedings{Morerio2017dropout,
    title={Curriculum Dropout},
    author={Morerio, Pietro and Cavazza, Jacopo and Volpi, Riccardo and Vidal, Ren\'e and Murino, Vittorio},
    booktitle = {ICCV},
    year={2017}
}

License

This repository is released under the GNU GENERAL PUBLIC LICENSE.

Code for the paper "Curriculum Dropout", ICCV 2017

Related tags

Overview

Curriculum Dropout

Code

Download and extract MNIST

Reference

License

Owner

Pietro Morerio

Space-event-trace - Tracing service for spaceteam events

This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification".

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

WRENCH: Weak supeRvision bENCHmark

SOTR: Segmenting Objects with Transformers [ICCV 2021]

This is the implementation of "SELF SUPERVISED REPRESENTATION LEARNING WITH DEEP CLUSTERING FOR ACOUSTIC UNIT DISCOVERY FROM RAW SPEECH" submitted to ICASSP 2022

Non-Attentive-Tacotron - This is Pytorch Implementation of Google's Non-attentive Tacotron.

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

QuanTaichi evaluation suite

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Distinguishing Commercial from Editorial Content in News

Distributed Deep learning with Keras & Spark

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Code for TIP 2017 paper --- Illumination Decomposition for Photograph with Multiple Light Sources.

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".

CondNet: Conditional Classifier for Scene Segmentation

COIN the currently largest dataset for comprehensive instruction video analysis.

FewBit — a library for memory efficient training of large neural networks