Deep Distributed Control of Port-Hamiltonian Systems

Last update: Aug 17, 2022

Related tags

Deep Learning DeepDisCoPH

Overview

De(e)pendable Distributed Control of Port-Hamiltonian Systems (DeepDisCoPH)

This repository is associated to the paper [1] and it contains:

The full paper manuscript.
The code to reproduce numerical experiments.

Summary

By embracing the compositional properties of port-Hamiltonian (pH) systems, we characterize deep Hamiltonian control policies with built-in closed-loop stability guarantees — irrespective of the interconnection topology and the chosen neural network parameters. Furthermore, our setup enables leveraging recent results on well-behaved neural ODEs to prevent the phenomenon of vanishing gradients by design [2]. The numerical experiments described in the report and available in this repository corroborate the dependability of the proposed DeepDisCoPH architecture, while matching the performance of general neural network policies.

Report

The report as well as the corresponding Appendices can be found in the docs folder.

Installation of DeepDisCoPH

The following lines indicates how to install the Deep Distributed Control for Port-Hamiltonian Systems (DeepDisCoPH) package.

git clone https://github.com/DecodEPFL/DeepDisCoPH.git

cd DeepDisCoPH

python setup.py install

Basic usage

To train distributed controllers for the 12 robots in the xy-plane:

./run.py --model [MODEL]

where available values for MODEL are distributed_HDNN, distributed_HDNN_TI and distributed_MLP.

To plot the norms of the backward sensitivity matrices (BSMs) when training a distributed H-DNN as the previous example, run:

./bsm.py --layer [LAYER]

where available values for LAYER are 1,2,...,100. If LAYER=-1, then it is set to N. The LAYER parameter indicates the layer number at which we consider the loss function is evaluated.

Examples: formation control with collision avoidance

The following gifs show the trajectories of the robots before and after the training of a distributed H-DNN controller. The goal is to reach the target positions within T = 5 seconds while avoiding collisions.

Training performed for t in [0,5]. Trajectories shown for t in [0,6], highlighting that robots stay close to the desired position when the time horizon is extended (grey background).

Early stopping of the training

We verify that DeepDisCoPH controllers ensure closed-loop stability by design even during exploration. We train the DeepDisCoPH controller for 25%, 50% and 75% of the total number of iterations and report the results in the following gifs.

Training performed for t in [0,5]. Trajectories shown for t in [0,15]. The extended horizon, i.e. when t in [5,15], is shown with grey background. Partially trained distributed controllers exhibit suboptimal behavior, but never compromise closed-loop stability.

References

[1] Luca Furieri, Clara L. Galimberti, Muhammad Zakwan and Giancarlo Ferrrari Trecate. "Distributed neural network control with dependability guarantees: a compositional port-Hamiltonian approach", under review.

[2] Clara L. Galimberti, Luca Furieri, Liang Xu and Giancarlo Ferrrari Trecate. "Hamiltonian Deep Neural Networks Guaranteeing Non-vanishing Gradients by Design," arXiv:2105.13205, 2021.

Deep Distributed Control of Port-Hamiltonian Systems

Related tags

Overview

De(e)pendable Distributed Control of Port-Hamiltonian Systems (DeepDisCoPH)

Summary

Report

Installation of DeepDisCoPH

Basic usage

Examples: formation control with collision avoidance

Early stopping of the training

References

Owner

Dependable Control and Decision group - EPFL

NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

A rough implementation of the paper "A Steering Algorithm for Redirected Walking Using Reinforcement Learning"

Official code of our work, AVATAR: A Parallel Corpus for Java-Python Program Translation.

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

[ICCV 2021 Oral] SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer

Joint Gaussian Graphical Model Estimation: A Survey

Easy and Efficient Object Detector

Taichi Course Homework Template

Pytorch implementation of Nueral Style transfer

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

Code for database and frontend of webpage for Neural Fields in Visual Computing and Beyond.

Malware Bypass Research using Reinforcement Learning

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)