Application of the L2HMC algorithm to simulations in lattice QCD.

Last update: Dec 14, 2022

Overview

l2hmc-qcd

📊 Slides

Recent talk on Training Topological Samplers for Lattice Gauge Theory from the Machine Learning for High Energy Physics, on and off the Lattice @ ect* Trento (09/30/2021)

📒 Example Notebook

Accepted to the Deep Learning for Simulation (SimDL) Workshop at ICLR 2021
- 📚 : arXiv:2105.03418
- 📊 : poster

Overview

The L2HMC algorithm aims to improve upon HMC by optimizing a carefully chosen loss function which is designed to minimize autocorrelations within the Markov Chain, thereby improving the efficiency of the sampler.

This work is based on the original implementation: brain-research/l2hmc/.

A detailed description of the L2HMC algorithm can be found in the paper:

Generalizing Hamiltonian Monte Carlo with Neural Network

by Daniel Levy, Matt D. Hoffman and Jascha Sohl-Dickstein.

Broadly, given an analytically described target distribution, π(x), L2HMC provides a statistically exact sampler that:

Quickly converges to the target distribution (fast burn-in).
Quickly produces uncorrelated samples (fast mixing).
Is able to efficiently mix between energy levels.
Is capable of traversing low-density zones to mix between modes (often difficult for generic HMC).

L2HMC for LatticeQCD

Goal: Use L2HMC to efficiently generate gauge configurations for calculating observables in lattice QCD.

A detailed description of the (ongoing) work to apply this algorithm to simulations in lattice QCD (specifically, a 2D U(1) lattice gauge theory model) can be found in doc/main.pdf.

Organization

Dynamics / Network

The base class for the augmented L2HMC leapfrog integrator is implemented in the BaseDynamics (a tf.keras.Model object).

The GaugeDynamics is a subclass of BaseDynamics containing modifications for the 2D U(1) pure gauge theory.

The network is defined in l2hmc-qcd/network/functional_net.py.

Network Architecture

An illustration of the leapfrog layer updating (x, v) --> (x', v') can be seen below.

Lattice

Lattice code can be found in lattice.py, specifically the GaugeLattice object that provides the base structure on which our target distribution exists.

Additionally, the GaugeLattice object implements a variety of methods for calculating physical observables such as the average plaquette, ɸₚ, and the topological charge Q,

Training

The training loop is implemented in l2hmc-qcd/utils/training_utils.py .

To train the sampler on a 2D U(1) gauge model using the parameters specified in bin/train_configs.json:

$ python3 /path/to/l2hmc-qcd/l2hmc-qcd/train.py --json_file=/path/to/l2hmc-qcd/bin/train_configs.json

Or via the bin/train.sh script provided in bin/.

Features

Distributed training (via horovod): If horovod is installed, the model can be trained across multiple GPUs (or CPUs) by:

#!/bin/bash

TRAINER=/path/to/l2hmc-qcd/l2hmc-qcd/train.py
JSON_FILE=/path/to/l2hmc-qcd/bin/train_configs.json

horovodrun -np ${PROCS} python3 ${TRAINER} --json_file=${JSON_FILE}

Contact

Code author: Sam Foreman

Pull requests and issues should be directed to: saforem2

Citation

If you use this code or found this work interesting, please cite our work along with the original paper:

@misc{foreman2021deep,
      title={Deep Learning Hamiltonian Monte Carlo}, 
      author={Sam Foreman and Xiao-Yong Jin and James C. Osborn},
      year={2021},
      eprint={2105.03418},
      archivePrefix={arXiv},
      primaryClass={hep-lat}
}

@article{levy2017generalizing,
  title={Generalizing Hamiltonian Monte Carlo with Neural Networks},
  author={Levy, Daniel and Hoffman, Matthew D. and Sohl-Dickstein, Jascha},
  journal={arXiv preprint arXiv:1711.09268},
  year={2017}
}

Acknowledgement

This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under contract DE_AC02-06CH11357. This work describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the work do not necessarily represent the views of the U.S. DOE or the United States Government. Declaration of Interests - None.

Comments

Remove upper bound on python_requires

(I'm moving between meetings so can iterate on this more later, so excuse the very brief Issue for now).

At the moment the project has an upper bound on python_requires

https://github.com/saforem2/l2hmc-qcd/blob/2eb6ee63cc0c53b187e6d716f4c12f418c8b8515/setup.py#L165

Assuming that you're intending l2hmc to be a library and not an application, then I would highly recommend removing this for the reasons summarized in Henry's detailed blog post on the subject.

Congrats on getting l2hmc up on PyPI though! :snake: :rocket:

opened by matthewfeickert 2
Alpha
Pull upstream alpha branch into main

Major changes

new src/ hierarchical module organization

Contains skeleton implementation of 4D SU(3) lattice gauge model

src/l2hmc/lattice/gauge/lattice.py

Framework independent configuration

Unified configuration system simplifies logic, same configs used for both tensorflow and pytorch experiments

Plan to be able to specify which backend to use through config option

Unified (and framework independent) configurations between tensorflow and pytorch implementations

Definitions can be found in l2hmc-qcd/src/l2hmc/configs.py

Note: This is still very much a WIP. Many existing features still need to be re-implemented / updated into new code in src/.

Todo

[ ] Write unit tests

[ ] Use simple configs for end-to-end workflow test + integrate into CI

[ ] dynamic learning rate scheduling

[ ] Test 4D SU(3) numpy code

[ ] Write tensorflow and pytorch implementations of LatticeSU3 objects

[ ] Improved / simplified ( / trainable?) annealing schedule

[ ] Distributed training support

[ ] horovod

[ ] DDP for pytorch implementation

[ ] DeepSpeed from Microsoft??

[ ] Testing / inference logic

[ ] Automatic checkpointing

[ ] Metric logging

[ ] Tensorboard?

[ ] Sacred?

[ ] build custom dashboard? plot.ly?

[ ] Setup packaging / distribution through pip

[ ] Resolve issue
opened by saforem2 1
Alpha
Major upgrades to how training is initialized in l2hmc-qcd/utils/training_utils.py, particularly when trying to restore a model from an existing checkpoint.

Significant upgrades to logging mechanics in l2hmc-qcd/utils/logger.py and l2hmc-qcd/utils/logger_config.py which now use a RichHandler to nicely format log messages characterized by severity, including automatic file rotation, etc.

Improvements to test suite in l2hmc-qcd/tests/test_training.py, more robust tests on larger set of possible cases

TODO: Automate using github actions for CI

Improvements to l2hmc-qcd/dynamics/gauge_dynamics.py but still a WIP
opened by saforem2 1
Rich
General improvements, rewrote logging methods to use Rich for better formatting.

Adds dynamic (trainable) step size eps for each separate x and v updates, seems to generally increase the total energy towards the middle of the trajectory but it remains unclear if this corresponds to an improvement in the tunneling rate

Adds methods for calculating autocorrelations of the topological charge, as well as notebooks for generating the plots

Updates to the writeup in doc/main.pdf

Will likely be last changes to writeup before public release of official draft
opened by saforem2 1
Dev
Updates to README

Ability to load network with new training instance

Updates to doc/, removes old sections related to debugging the bias in the plaquette
opened by saforem2 1
Saveable model
Complete rewrite of dynamics.xnet and dynamics.vnet models to use tf.keras.functional Models.

Additional changes include:

Non-Compact Projection update for gauge fields

Ability to specify convolution structure to be prepended at beginning of gauge network
opened by saforem2 1
Dev

Removes models/gauge_model.py entirely.

Instead, a base dynamics class is implemented in dynamics/dynamics.py, and an example subclass is provided in dynamics/gauge_dynamics.py.

opened by saforem2 1
Split networks

Major rewrite of existing codebase.

This pull request updates everything to be compatible with tensorflow >= 2.2 and removes a bunch of redundant legacy code.

opened by saforem2 1
Dev
Dynamics object is now compatible with tf >= 2.0

Running inference on trained model with tensorflow now creates identical graphs and summary files to numpy inference code

Inference with numpy now uses object oriented structure

Adds LaTeX + PDF documentation in doc/
opened by saforem2 1
Cooley dev

Adds new GaugeNetwork architecture as the default for training GaugeModel

Additionally, replaces pickle with joblib for saving data as .z compressed files (as opposed to .pkl files).

opened by saforem2 1
Testing

Implemented nnehmc_loss calculation for an alternative loss function using the approach suggested in https://infoscience.epfl.ch/record/264887/files/robust_parameter_estimation.pdf.

This modified loss function can be chosen (instead of the standard loss described in the original paper) by passing --use_nnehmc_loss as a command line argument.

opened by saforem2 1

Packaging and PyPI distribution?

As you've made a library and are using it as such:

# snippet from toy_distributions.ipynb

# append parent directory to `sys.path`
# to load from modules in `../l2hmc-qcd/`
module_path = os.path.join('..')
if module_path not in sys.path:
    sys.path.append(module_path)

# Local imports
from utils.attr_dict import AttrDict
from utils.training_utils import train_dynamics
from dynamics.config import DynamicsConfig
from dynamics.base_dynamics import BaseDynamics
from dynamics.generic_dynamics import GenericDynamics
from network.config import LearningRateConfig
from config import (State, NetWeights, MonteCarloStates,
                    BASE_DIR, BIN_DIR, TF_FLOAT)

from utils.distributions import (plot_samples2D, contour_potential,
                                 two_moons_potential, sin_potential,
                                 sin_potential1, sin_potential2)

do you have any plans and/or interest in packaging it as a Python library so it can either be pip installed from GitHub or be distributed on PyPI?

opened by matthewfeickert 5

Releases(0.12.0)

0.12.0(Aug 9, 2022)

Source code(tar.gz)
Source code(zip)
0.8.0(Apr 14, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.7.0...0.8.0
Source code(tar.gz)
Source code(zip)
0.7.0(Apr 14, 2022)

pypi release: v0.7.0

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.4.0...0.7.0
Source code(tar.gz)
Source code(zip)
0.4.0(Apr 8, 2022)

Full Changelog: https://github.com/saforem2/l2hmc-qcd/compare/0.3.0...0.4.0
Source code(tar.gz)
Source code(zip)

Owner

Sam Foreman

Computational science Postdoc at Argonne National Laboratory working on applying machine learning to simulations in lattice QCD.

GitHub Repository https://samforeman.me/l2hmc-qcd

EdiBERT is a generative model based on a bi-directional transformer, suited for image manipulation

EdiBERT, a generative model for image editing EdiBERT is a generative model based on a bi-directional transformer, suited for image manipulation. The

16 Dec 07, 2022

DvD-TD3: Diversity via Determinants for TD3 version

DvD-TD3: Diversity via Determinants for TD3 version The implementation of paper Effective Diversity in Population Based Reinforcement Learning. Instal

3 Feb 11, 2022

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

Here is deepparse. Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning. Use deepparse to Use the pr

192 Dec 20, 2022

PyTorch implementation of CVPR'18 - Perturbative Neural Networks

This is an attempt to reproduce results in Perturbative Neural Networks paper. See original repo for details.

57 May 14, 2021

Symbolic Music Generation with Diffusion Models

Symbolic Music Generation with Diffusion Models Supplementary code release for our work Symbolic Music Generation with Diffusion Models. Installation

119 Jan 07, 2023

BboxToolkit is a tiny library of special bounding boxes.

BboxToolkit is a light codebase collecting some practical functions for the special-shape detection, such as oriented detection

73 Jan 01, 2023

PyTorch implementation of Densely Connected Time Delay Neural Network

Densely Connected Time Delay Neural Network PyTorch implementation of Densely Connected Time Delay Neural Network (D-TDNN) in our paper "Densely Conne

64 Oct 11, 2022

This is an open source library implementing hyperbox-based machine learning algorithms

hyperbox-brain is a Python open source toolbox implementing hyperbox-based machine learning algorithms built on top of scikit-learn and is distributed

21 Dec 14, 2022

Geneva is an artificial intelligence tool that defeats censorship by exploiting bugs in censors

1.5k Jan 06, 2023

Histology images query (unsupervised)

110-1-NTU-DBME5028-Histology-images-query Final Project: Histology images query (unsupervised) Kaggle: https://www.kaggle.com/c/histology-images-query

1 Jan 05, 2022

Meli Data Challenge 2021 - First Place Solution

My solution for the Meli Data Challenge 2021

23 Mar 09, 2022

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

TalkingHead-1KH Dataset TalkingHead-1KH is a talking-head dataset consisting of YouTube videos, originally created as a benchmark for face-vid2vid: On

173 Dec 29, 2022

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

G2LTex This repository contains the implementation of "Texture Mapping for 3D Reconstruction with RGB-D Sensor (CVPR2018)" based on mvs-texturing. Due

129 Dec 30, 2022

YOLOv7 - Framework Beyond Detection

🔥🔥🔥🔥 YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥

3k Jan 01, 2023

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

Paper For more details, please see our paper Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum which has been accepted a

14 Sep 30, 2022

Unofficial PyTorch implementation of Google AI's VoiceFilter system

VoiceFilter Note from Seung-won (2020.10.25) Hi everyone! It's Seung-won from MINDs Lab, Inc. It's been a long time since I've released this open-sour

883 Jan 07, 2023

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

CFD Python Please cite as: Barba, Lorena A., and Forsyth, Gilbert F. (2018). CFD Python: the 12 steps to Navier-Stokes equations. Journal of Open Sour

2.6k Dec 30, 2022

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

This is the implementation of "Training deep neural networks via direct loss minimization" published at ICML 2016 in PyTorch. The implementation targe

1 Jan 18, 2022

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition Boyan Zhou, Quan Cui, Xiu-Shen Wei*, Zhao-Min Chen This repo

616 Dec 21, 2022

SmoothGrad implementation in PyTorch

SmoothGrad implementation in PyTorch PyTorch implementation of SmoothGrad: removing noise by adding noise. Vanilla Gradients SmoothGrad Guided backpro

143 Jan 05, 2023

Application of the L2HMC algorithm to simulations in lattice QCD.

Related tags

Overview

l2hmc-qcd

📊 Slides

📒 Example Notebook

Overview

L2HMC for LatticeQCD

Organization

Dynamics / Network

Network Architecture

Lattice

Training

Features

Contact

Citation

Acknowledgement

Comments

Major changes

Todo

Releases(0.12.0)

0.12.0(Aug 9, 2022)

0.8.0(Apr 14, 2022)

0.7.0(Apr 14, 2022)

0.4.0(Apr 8, 2022)

Owner

Sam Foreman

EdiBERT is a generative model based on a bi-directional transformer, suited for image manipulation

DvD-TD3: Diversity via Determinants for TD3 version

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

PyTorch implementation of CVPR'18 - Perturbative Neural Networks

Symbolic Music Generation with Diffusion Models

BboxToolkit is a tiny library of special bounding boxes.

PyTorch implementation of Densely Connected Time Delay Neural Network

This is an open source library implementing hyperbox-based machine learning algorithms

Geneva is an artificial intelligence tool that defeats censorship by exploiting bugs in censors

Histology images query (unsupervised)

Meli Data Challenge 2021 - First Place Solution

TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

YOLOv7 - Framework Beyond Detection

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

Unofficial PyTorch implementation of Google AI's VoiceFilter system

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

SmoothGrad implementation in PyTorch