ParaGen

ParaGen is a PyTorch deep learning framework for parallel sequence generation. Apart from sequence generation, ParaGen also enhances various NLP tasks, including sequence-level classification, extraction and generation.

Requirements and Installation

Install third-party dependent package:

apt-get install libopenmpi-dev,libssl-dev,openssh-server

To install ParaGen from source:

cd ParaGen
pip install -e .

For distributed training, you need to make sure horovod has been installed.

# require CMake to install horovod. (https://cmake.org/install/)
pip install horovod

Install lightseq to faster train:

pip install lightseq

Getting Started

Before using ParaGen, it would be helpful to overview how ParaGen works.

ParaGen is designed as a task-oriented framework, where task is regarded as the core of all the codes. A specific task selects all the components for support itself, such as model architectures, training strategies, dataset, and data processing. Any component within ParaGen can be customized, while the existing modules and methods are used as a plug-in library.

As tasks are considered as the core of ParaGen, it works with various modes, such as train, evaluate, preprocess and serve. Tasks act differently under different modes, by reorganizing the components without code modification.

Please refer to examples for detailed instructions.

ParaGen Usage and Contribution

We welcome any experimental algorithms on ParaGen.

Install ParaGen;
Create your own paragen-plugin libraries under third_party;
Experiment your own algorithms;
Write a reproducible shell script;
Create a merge request and assign reviewers to any of us.

ParaGen is a PyTorch deep learning framework for parallel sequence generation

Related tags

Overview

ParaGen

Requirements and Installation

Getting Started

ParaGen Usage and Contribution

Owner

Bytedance Inc.

MARE - Multi-Attribute Relation Extraction

Official Implementation of Few-shot Visual Relationship Co-localization

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in Tensorflow Lite.

Detectron2 for Document Layout Analysis

FastyAPI is a Stack boilerplate optimised for heavy loads.

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

PyTorch Implementation of Backbone of PicoDet

[NeurIPS-2021] Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

Official PaddlePaddle implementation of Paint Transformer

Lexical Substitution Framework

FS2KToolbox FS2K Dataset Towards the translation between Face

Apollo optimizer in tensorflow

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

Automate issue discovery for your projects against Lightning nightly and releases.

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference