The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Last update: Oct 19, 2022

Overview

Enformer TPU training script (wip)

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters, in an effort to migrate the model to pytorch.

This was pieced together from the Deepmind Enformer repository, the colab training notebook, as well as Basenji sequence augmentation code

It accounts for:

distributed TPU training
distributed datasets
distributed validation
gradient clipping
cross replica batchnorms
dataset augmentation

Training takes about 3 days on v3-64

Todo

fix script for differences in sequence length in basenji training data, which is ~130k vs ~190k bp as in paper

Citations

@article {Avsec2021.04.07.438649,
    author  = {Avsec, {\v Z}iga and Agarwal, Vikram and Visentin, Daniel and Ledsam, Joseph R. and Grabska-Barwinska, Agnieszka and Taylor, Kyle R. and Assael, Yannis and Jumper, John and Kohli, Pushmeet and Kelley, David R.},
    title   = {Effective gene expression prediction from sequence by integrating long-range interactions},
    elocation-id = {2021.04.07.438649},
    year    = {2021},
    doi     = {10.1101/2021.04.07.438649},
    publisher = {Cold Spring Harbor Laboratory},
    URL     = {https://www.biorxiv.org/content/early/2021/04/08/2021.04.07.438649},
    eprint  = {https://www.biorxiv.org/content/early/2021/04/08/2021.04.07.438649.full.pdf},
    journal = {bioRxiv}
}

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Related tags

Overview

Enformer TPU training script (wip)

Todo

Citations

Owner

Phil Wang

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

ObjDetApp deploys a pytorch model for object detection

A toolkit for developing and comparing reinforcement learning algorithms.

A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.

Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.

95.47% on CIFAR10 with PyTorch

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

Multi-objective gym environments for reinforcement learning.

Space Invaders For Python

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Red Team tool for exfiltrating files from a target's Google Drive that you have access to, via Google's API.

OREO: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning (NeurIPS 2021)

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Pre-trained NFNets with 99% of the accuracy of the official paper

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

This is a simple plugin for Vim that allows you to use OpenAI Codex.

NICE-GAN — Official PyTorch Implementation Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

Related tags

Overview

Enformer TPU training script (wip)

Todo

Citations

Owner

Phil Wang

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

*ObjDetApp* deploys a pytorch model for object detection

A toolkit for developing and comparing reinforcement learning algorithms.

A basic implementation of Layer-wise Relevance Propagation (LRP) in PyTorch.

Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.

95.47% on CIFAR10 with PyTorch

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

Multi-objective gym environments for reinforcement learning.

Space Invaders For Python

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Red Team tool for exfiltrating files from a target's Google Drive that you have access to, via Google's API.

OREO: Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning (NeurIPS 2021)

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Pre-trained NFNets with 99% of the accuracy of the official paper

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

This is a simple plugin for Vim that allows you to use OpenAI Codex.

NICE-GAN — Official PyTorch Implementation Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

ObjDetApp deploys a pytorch model for object detection