SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

Last update: Oct 26, 2022

Related tags

Overview

SelfAugment

@misc{reed2020selfaugment,
      title={SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning}, 
      author={Colorado Reed and Sean Metzger and Aravind Srinivas and Trevor Darrell and Kurt Keutzer},
      year={2020},
      eprint={2009.07724},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

SelfAugment extends MoCo to include automatic unsupervised augmentation selection. In addition, we've included the ability to pretrain on several new datasets and included a wandb integration.

Using your own dataset.

To interface your own dataset, make sure that you carefully check the three main scripts to incorporate your dataset:

main_moco.py
main_lincls.py
faa.py

Some things to check:

Ensure that the sizing for your dataset is right. If your images are 32x32 (e.g. CIFAR10) - you should ensure that you are using the CIFAR10 style model, which uses a 3x3 input conv, and resizes images to be 28x28 instead of 224x224 (e.g. for ImageNet). This can make a big difference!
If you want selfaugment to run quickly, consider using a small subset of your full dataset. For example, for ImageNet, we only use a small subset of the data - 50,000 random images. This may mean that you need to run unsupervised pretraining for longer than you usually do. We usually scale the number of epochs MoCov2 runs so that the number of total iterations is the same, or a bit smaller, for the subset and the full dataset.

Base augmentation.

If you want to find the base augmentation, then use slm_utils/submit_single_augmentations.py

This will result in 16 models, each with the results of self supervised training using ONLY the augmentation provided. slm_utils/submit_single_augmentations is currently using imagenet, so it uses a subset for this part.

Then you will need to train rotation classifiers for each model. this can be done using main_lincls.py

Train 5 Folds of MoCov2 on the folds of your data.

To get started, train 5 moco models using only the base augmentation. To do this, you can run python slm_utils/submit_moco_folds.py.

Run SelfAug

Now, you must run SelfAug on your dataset. Note - some changes in dataloaders may be necessary depending on your dataset.

@Colorado, working on making this process cleaner.

For now, you will need to go into faa_search_single_aug_minmax_w.py, and edit the config there. I will change this to argparse here soon. The most critical part of this is entering your checkpoint names in order of each fold under config.checkpoints.

Loss can be rotation, icl, icl_and_rotation. If you are doing icl_and_rotation, then you will need to normalize the loss_weights in loss_weight dict so that each loss is 1/(avg loss across k-folds) for each type of loss, I would just use the loss that was in wandb (rot train loss, and ICL loss from pretraining). Finally, you are trying to maximize negative loss with Ray, so a negative weighting in the loss weights means that the loss with that weight will be maximized.

Retrain using new augmentations found by SelfAug.

Just make sure to change the augmentation path to the pickle file with your new augmentations in load_policies function in get_faa_transforms.py Then, submit the job using slm_utils/submit_faa_moco.py

SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

Related tags

Overview

SelfAugment

Using your own dataset.

Base augmentation.

Train 5 Folds of MoCov2 on the folds of your data.

Run SelfAug

Retrain using new augmentations found by SelfAug.

Owner

Colorado Reed

A python module for scientific analysis of 3D objects based on VTK and Numpy

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

CMT: Convolutional Neural Networks Meet Vision Transformers

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Keqing Chatbot With Python

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Classifying cat and dog images using Kaggle dataset

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

Code, Models and Datasets for OpenViDial Dataset

Official Implementation of HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

The project was to detect traffic signs, based on the Megengine framework.

Code for DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Gym for multi-agent reinforcement learning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

Related tags

Overview

SelfAugment

Using your own dataset.

Base augmentation.

Train 5 Folds of MoCov2 on the folds of your data.

Run SelfAug

Retrain using new augmentations found by SelfAug.

Owner

Colorado Reed

A python module for scientific analysis of 3D objects based on VTK and Numpy

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

CMT: Convolutional Neural Networks Meet Vision Transformers

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Keqing Chatbot With Python

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Classifying cat and dog images using Kaggle dataset

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

Code, Models and Datasets for OpenViDial Dataset

Official Implementation of HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

The project was to detect traffic signs, based on the Megengine framework.

Code for DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Gym for multi-agent reinforcement learning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.