poutyne-transformers

Train 🤗 -transformers models with Poutyne.

Installation

pip install poutyne-transformers

Example

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer
from datasets import load_dataset
from torch.utils.data import DataLoader
from torch import optim
from poutyne import Model
from poutyne_transformers import TransformerCollator, model_loss, ModelWrapper

print('Loading model & tokenizer.')
transformer = AutoModelForSequenceClassification.from_pretrained('distilbert-base-cased', num_labels=2, return_dict=True)
tokenizer = AutoTokenizer.from_pretrained('distilbert-base-cased')

print('Loading & preparing dataset.')
dataset = load_dataset("imdb")
dataset = dataset.map(lambda entry: tokenizer(entry['text'], add_special_tokens=True, padding='max_length', truncation=True), batched=True)
dataset = dataset.remove_columns(['text'])
dataset.set_format('torch')

collate_fn = TransformerCollator()
train_dataloader = DataLoader(dataset['train'], batch_size=16, collate_fn=collate_fn)
test_dataloader = DataLoader(dataset['test'], batch_size=16, collate_fn=collate_fn)

print('Preparing training.')
wrapped_transformer = ModelWrapper(transformer)
optimizer = optim.AdamW(wrapped_transformer.parameters(), lr=5e-5)
device = torch.device('cuda:0' if torch.cuda.is_available() else "cpu")
model = Model(wrapped_transformer, optimizer, loss_function=model_loss, device=device)

print('Starting training.')
model.fit_generator(train_dataloader, test_dataloader, epochs=1)

Train 🤗-transformers model with Poutyne.

Related tags

Overview

poutyne-transformers

Installation

Example

Owner

Lennart Keller

KR-FinBert And KR-FinBert-SC

Задания КЕГЭ по информатике 2021 на Python

Code for Editing Factual Knowledge in Language Models

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

Library for fast text representation and classification.

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

An assignment from my grad-level data mining course demonstrating some experience with NLP/neural networks/Pytorch

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

DeepAmandine is an artificial intelligence that allows you to talk to it for hours, you won't know the difference.

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

Source code for CsiNet and CRNet using Fully Connected Layer-Shared feedback architecture.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Code for PED: DETR For (Crowd) Pedestrian Detection

Black for Python docstrings and reStructuredText (rst).

Knowledge Oriented Programming Language

test

Exploration of BERT-based models on twitter sentiment classifications

Anomaly Detection 이상치 탐지 전처리 모듈