Experiments for distributed optimization algorithms

Last update: Dec 04, 2022

Overview

Network-Distributed Algorithm Experiments

This repository contains a set of optimization algorithms and objective functions, and all code needed to reproduce experiments in:

"DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization" [PDF]. (code is in this file [link])
"Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction" [PDF]. (code is in the previous version of this repo [link])

Due to the random data generation procedure, resulting graphs may be slightly different from those appeared in the paper, but conclusions remain the same.

If you find this code useful, please cite our papers:

@article{li2021destress,
  title={DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization},
  author={Li, Boyue and Li, Zhize and Chi, Yuejie},
  journal={arXiv preprint arXiv:2110.01165},
  year={2021}
}

@article{li2020communication,
  title={Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction},
  author={Li, Boyue and Cen, Shicong and Chen, Yuxin and Chi, Yuejie},
  journal={Journal of Machine Learning Research},
  volume={21},
  pages={1--51},
  year={2020}
}

Implemented objective functions

The gradient implementations of all objective functions are checked numerically.

Linear regression

Linear regression with random generated data. The objective function is $f(w) = \frac{1}{N} \sum_i (y_i - x_i^\top w)^2$

Logistic regression

Logistic regression with $l$-2 or nonconvex regularization with random generated data or the Gisette dataset or datasets from libsvmtools. The objective function is $$ f(w) = - \frac{1}{N} * \Big(\sum_i y_i \log \frac{1}{1 + exp(w^T x_i)} + (1 - y_i) \log \frac{exp(w^T x_i)}{1 + exp(w^T x_i)} \Big) + \frac{\lambda}{2} | w |_2^2 + \alpha \sum_j \frac{w_j^2}{1 + w_j^2} $$

One-hidden-layer fully-connected neural netowrk

One-hidden-layer fully-connected neural network with softmax loss on the MNIST dataset.

Implemented optimization algorithms

Centralized optimization algorithms

Gradient descent
Stochastic gradient descent
Nesterov's accelerated gradient descent
SVRG
SARAH

Distributed optimization algorithms (i.e. with parameter server)

ADMM
DANE

Decentralized optimization algorithms

Decentralized gradient descent
Decentralized stochastic gradient descent
Decentralized gradient descent with gradient tracking
EXTRA
NIDS
Network-DANE/SARAH/SVRG
GT-SARAH
DESTRESS

Experiments for distributed optimization algorithms

Related tags

Overview

Network-Distributed Algorithm Experiments

Implemented objective functions

Linear regression

Logistic regression

One-hidden-layer fully-connected neural netowrk

Implemented optimization algorithms

Centralized optimization algorithms

Distributed optimization algorithms (i.e. with parameter server)

Decentralized optimization algorithms

Owner

Boyue Li

Official code of "Mitigating the Mutual Error Amplification for Semi-Supervised Object Detection"

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

DeiT: Data-efficient Image Transformers

A project that uses optical flow and machine learning to detect aimhacking in video clips.

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

Learning Features with Parameter-Free Layers (ICLR 2022)

ReferFormer - Official Implementation of ReferFormer

Neural style in TensorFlow! 🎨

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

This tutorial repository is to introduce the functionality of KGTK to first-time users

Example of semantic segmentation in Keras

Few-Shot Graph Learning for Molecular Property Prediction

基于Paddlepaddle复现yolov5，支持PaddleDetection接口

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Source code for our paper "Molecular Mechanics-Driven Graph Neural Network with Multiplex Graph for Molecular Structures"

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

Self-supervised learning (SSL) is a method of machine learning