An essential implementation of BYOL in PyTorch + PyTorch Lightning

Last update: Sep 27, 2022

Related tags

Overview

Essential BYOL

A simple and complete implementation of Bootstrap your own latent: A new approach to self-supervised Learning in PyTorch + PyTorch Lightning.

Good stuff:

good performance (~67% linear eval accuracy on CIFAR100)
minimal code, easy to use and extend
multi-GPU / TPU and AMP support provided by PyTorch Lightning
ImageNet support (needs testing)
linear evaluation is performed during training without any additional forward pass
logging with Wandb

Performance

Linear Evaluation Accuracy

Here is the accuracy after training for 1000 epochs:

Dataset	[email protected]	[email protected]
CIFAR10	91.1%	99.8%
CIFAR100	67.0%	90.5%

Training and Validation Curves

CIFAR10

CIFAR100

Environment

conda create --name essential-byol python=3.8
conda activate essential-byol
conda install pytorch=1.7.1 torchvision=0.8.2 cudatoolkit=XX.X -c pytorch
pip install pytorch-lightning==1.1.6 pytorch-lightning-bolts==0.3 wandb opencv-python

The code has been tested using these versions of the packages, but it will probably work with slightly different environments as well. When your run the code (see below for commands), PyTorch Lightning will probably throw a warning, advising you to install additional packages as gym, sklearn and matplotlib. They are not needed for this implementation to work, but you can install them to get rid of the warnings.

Datasets

Three datasets are supported:

CIFAR10
CIFAR100
ImageNet

For imagenet you need to pass the appropriate --data_dir, while for CIFAR you can just pass --download to download the dataset.

Commands

The repo comes with minimal model specific arguments, check main.py for info. We also support all the arguments of the PyTorch Lightning trainer. Default parameters are optimized for CIFAR100 but can also be used for CIFAR10.

Sample commands for running CIFAR100 on a single GPU setup:

python main.py \
    --gpus 1 \
    --dataset CIFAR100 \
    --batch_size 256 \
    --max_epochs 1000 \
    --arch resnet18 \
    --precision 16 \
    --comment wandb-comment

and multi-GPU setup:

python main.py \
    --gpus 2 \
    --distributed_backend ddp \
    --sync_batchnorm \
    --dataset CIFAR100 \
    --batch_size 256 \
    --max_epochs 1000 \
    --arch resnet18 \
    --precision 16 \
    --comment wandb-comment

Logging

Logging is performed with Wandb, please create an account, and follow the configuration steps in the terminal. You can pass your username using --entity. Training and validation stats are logged at every epoch. If you want to completely disable logging use --offline.

Contribute

Help is appreciated. Stuff that needs work:

test ImageNet performance
exclude bias and bn from LARS adaptation (see comments in the code)

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Related tags

Overview

Essential BYOL

Performance

Linear Evaluation Accuracy

Training and Validation Curves

CIFAR10

CIFAR100

Environment

Datasets

Commands

Logging

Contribute

Owner

Enrico Fini

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Learn other languages using artificial intelligence with python.

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Large scale embeddings on a single machine.

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

BraTs-VNet - BraTS(Brain Tumour Segmentation) using V-Net

CVPR2021 Content-Aware GAN Compression

MQBench Quantization Aware Training with PyTorch

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Used to record WKU's utility bills on a regular basis.

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Make your master artistic punk avatar through machine learning world famous paintings.

A system used to detect whether a person is wearing a medical mask or not.

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

An essential implementation of BYOL in PyTorch + PyTorch Lightning

Related tags

Overview

Essential BYOL

Performance

Linear Evaluation Accuracy

Training and Validation Curves

CIFAR10

CIFAR100

Environment

Datasets

Commands

Logging

Contribute

Owner

Enrico Fini

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Learn other languages ​​using artificial intelligence with python.

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

Large scale embeddings on a single machine.

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

BraTs-VNet - BraTS(Brain Tumour Segmentation) using V-Net

CVPR2021 Content-Aware GAN Compression

MQBench Quantization Aware Training with PyTorch

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Used to record WKU's utility bills on a regular basis.

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Make your master artistic punk avatar through machine learning world famous paintings.

A system used to detect whether a person is wearing a medical mask or not.

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

Learn other languages using artificial intelligence with python.