Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

Last update: May 04, 2022

Overview

Ensembling parameters with differential evolution

This repository shows how to ensemble parameters of two trained neural networks using differential evolution. The steps followed are as follows:

Train two networks (architecturally same) on the same dataset (CIFAR-10 used here) but from two different random initializations.
Ensemble their weights using the following formulae:
```
w_t = w_o * ema + (1 - ema) * w_p
```
w_o and w_p represents the learned of a neural network.
Randomly initialize a network (same architecture as above) and populate its parameters w_t using the above formulae.

ema is usually chosen by the developer in an empirical manner. This project uses differential evolution to find it.

Below are the top-1 accuracies (on CIFAR-10 test set) of two individually trained two models along with their ensembled variant:

Model one: 63.23%
Model two: 63.42%
Ensembled: 63.35%

With the more conventional average prediction ensembling, I was able to get to 64.92%. This is way better than what I got by ensembling the parameters. Nevertheless, the purpose of this project was to just try out an idea.

Reproducing the results

Ensure the requirements.txt is satisfied. Then train two models with ensuring your working directory is at the root of this project:

$ git clone https://github.com/sayakpaul/parameter-ensemble-differential-evolution
$ cd parameter-ensemble-differential-evolution
$ pip install -qr requirements.txt
$ for i in `seq 1 2`; python train.py; done

Then just follow the ensemble-parameters.ipynb notebook. You can also use the networks I trained. Instructions are available inside the notebook.

Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

Related tags

Overview

Ensembling parameters with differential evolution

Reproducing the results

References

You might also like...

Neural Ensemble Search for Performant and Calibrated Predictions

An Ensemble of CNN (Python 3.5.1 Tensorflow 1.3 numpy 1.13)

zeus is a Python implementation of the Ensemble Slice Sampling method.

Pytorch implementation of SenFormer: Efficient Self-Ensemble Framework for Semantic Segmentation

Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning

This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

A fast Evolution Strategy implementation in Python

Code for the paper Task Agnostic Morphology Evolution.

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Releases(v0.1.0)

v0.1.0(Jan 2, 2022)

Owner

Sayak Paul

Python package provinding tools for artistic interactive applications using AI

Hypercomplex Neural Networks with PyTorch

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

🛰️ Awesome Satellite Imagery Datasets

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Image to Image translation, image generataton, few shot learning

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.

The fastai book, published as Jupyter Notebooks

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

converts nominal survey data into a numerical value based on a dictionary lookup.

Software associated to AAAI paper "Planning with Biological Neurons and Synapses"

Learning Neural Network Subspaces

Lightweight Cuda Renderer with Python Wrapper.

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

Enabling dynamic analysis of Legacy Embedded Systems in full emulated environment

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising