Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Last update: Dec 29, 2022

Related tags

Deep Learning gtn_applications

Overview

gtn_applications

An applications library using GTN. Current examples include:

Offline handwriting recognition
Automatic speech recognition

Installing

Build python bindings for the GTN library.
conda activate gtn_env # using the same environment from Step 1
conda install pytorch torchvision -c pytorch
pip install -r requirements.txt

Training

We give an example of how to train on the IAM off-line handwriting recognition benchmark.

First register here and download the dataset:

./datasets/download/iamdb.sh <path_to_data> <email> <password>

Then update the configuration JSON configs/iamdb/tds2d.json to point to the data path used above:

  "data" : {
    "dataset" : "iamdb",
    "data_path" : "<path_to_data>",
    "num_features" : 64
  },

Single GPU training can be run with:

python train.py --config configs/iamdb/tds2d.json

To run distributed training with multiple GPUs:

python train.py --config configs/iamdb/tds2d.json --world_size <NUM_GPUS>

For a list of options type:

python train.py -h

Contributing

Use Black to format python code.

First install:

pip install black

Then run with:

black <file>.py

License

GTN is licensed under a MIT license. See LICENSE.

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Related tags

Overview

gtn_applications

Installing

Training

Contributing

License

Owner

Facebook Research

Few-shot Neural Architecture Search

Code for "Unsupervised State Representation Learning in Atari"

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization

NOMAD - A blackbox optimization software

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

Model Agnostic Interpretability for Multiple Instance Learning

BERTMap: A BERT-Based Ontology Alignment System

Generating Videos with Scene Dynamics

The 2nd place solution of 2021 google landmark retrieval on kaggle.

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Large dataset storage format for Pytorch

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Neural Style and MSG-Net

Qt-GUI implementation of the YOLOv5 algorithm (ver.6 and ver.5)

Offline Reinforcement Learning with Implicit Q-Learning

Adjust Decision Boundary for Class Imbalanced Learning

coldcuts is an R package to automatically generate and plot segmentation drawings in R

NHL 94 AI contests