A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

Last update: Oct 23, 2022

Related tags

Overview

wav2vec-toolkit

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

This repository accompanies the 🤗 HuggingFace Community Paper on finetuning Wav2Vec2 XLSR for low-resource languages [link]

How to contribute

(Mostly identical to the huggingface/datasets contributing guide)

Fork the repository by clicking on the 'Fork' button on the repository's page. This creates a copy of the code under your GitHub user account.

Clone your fork to your local disk, and add the base repository as a remote:

git clone [email protected]:<your Github handle>/wav2vec-toolkit.git
cd wav2vec-toolkit
git remote add upstream https://github.com/anton-l/wav2vec-toolkit.git

Create a new branch to hold your development changes:
```
git checkout -b a-descriptive-name-for-my-changes
```
do not work on the master branch.
Set up a development environment by running the following command in a virtual environment:
```
pip install -e ".[dev]"
```
(If wav2vec-toolkit was already installed in the virtual environment, remove it with pip uninstall wav2vec_toolkit before reinstalling it in editable mode with the -e flag.)
Develop the features on your branch.
Format your code. Run black and isort so that your newly added files look nice with the following command:
```
black --line-length 119 --target-version py36 src scripts
isort src scripts
```
Once you're happy with your implementation, add your changes and make a commit to record your changes locally:
```
git add .
git commit
```
It is a good idea to sync your copy of the code with the original repository regularly. This way you can quickly account for changes:
```
git fetch upstream
git rebase upstream/main
```
Push the changes to your account using:
```
git push -u origin a-descriptive-name-for-my-changes
```
Once you are satisfied, go the webpage of your fork on GitHub. Click on "Pull request" to send your to the project maintainers for review.

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

Related tags

Overview

wav2vec-toolkit

How to contribute

Owner

Anton Lozhkov

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Code of paper: A Recurrent Vision-and-Language BERT for Navigation

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic input and profiling. (Nvidia-Alibaba-TensoRT-hackathon2021)

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

a test times augmentation toolkit based on paddle2.0.

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

This library is testing the ethics of language models by using natural adversarial texts.

Exploration of BERT-based models on twitter sentiment classifications

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Smart discord chatbot integrated with Dialogflow

CATs: Semantic Correspondence with Transformers

Translate U is capable of translating the text present in an image from one language to the other.

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

Applied Natural Language Processing in the Enterprise - An O'Reilly Media Publication

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Malaya-Speech is a Speech-Toolkit library for bahasa Malaysia, powered by Deep Learning Tensorflow.

Use Tensorflow2.7.0 Build OpenAI'GPT-2

AudioCLIP Extending CLIP to Image, Text and Audio