A Convolutional Transformer for Keyword Spotting

Last update: Jan 27, 2022

Related tags

Overview

☢️ Audiomer ☢️

Audiomer: A Convolutional Transformer for Keyword Spotting

[ `arXiv` ]	[ `Previous SOTA` ]	[ `Model Architecture` ]

Results on SpeechCommands

Model Architecture

Performer Conv-Attention

Usage

To reproduce the results in the paper, follow the instructions:

To download the Speech Commands v2 dataset, run: python3 datamodules/SpeechCommands12.py
To train Audiomer-S and Audiomer-L on all three datasets thrice, run: python3 run_expts.py
To evaluate a model on a dataset, run: python3 evaluate.py --checkpoint_path /path/to/checkpoint.ckpt --model <model type> --dataset <name of dataset>.
For example: python3 evaluate.py --checkpoint_path ./epoch=300.ckpt --model S --dataset SC20

System requirements

NVIDIA GPU with CUDA
Python 3.6 or higher.
pytorch_lightning
torchaudio
performer_pytorch

Owner

GitHub Repository

Pretrained Cost Model for Distributed Constraint Optimization Problems

Pretrained Cost Model for Distributed Constraint Optimization Problems Requirements PyTorch 1.9.0 PyTorch Geometric 1.7.1 Directory structure baseline

2 Aug 28, 2022

An Open-Source Tool for Automatic Disease Diagnosis..

OpenMedicalChatbox An Open-Source Package for Automatic Disease Diagnosis. Overview Due to the lack of open source for existing RL-base automated diag

8 Nov 08, 2022

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources (e.g. just the lead vocals).

14 Nov 07, 2022

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends

Who's there? The spiritual successor to knockknock for PyTorch Lightning, to get a notification when your training is complete or when it crashes duri

70 Oct 06, 2022

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

Pre-trained checkpoint and bert config json file Location of checkpoint and bert config json file This MLCommons members Google Drive location contain

12 Apr 27, 2022

An Implementation of Fully Convolutional Networks in Tensorflow.

Update An example on how to integrate this code into your own semantic segmentation pipeline can be found in my KittiSeg project repository. tensorflo

1.1k Dec 12, 2022

Extremely simple and fast extreme multi-class and multi-label classifiers.

napkinXC napkinXC is an extremely simple and fast library for extreme multi-class and multi-label classification, that focus of implementing various m

43 Nov 14, 2022

Meta-learning for NLP

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks Code for training the meta-learning models and fine-tuning on downstr

43 Nov 08, 2022

Liecasadi - liecasadi implements Lie groups operation written in CasADi

liecasadi liecasadi implements Lie groups operation written in CasADi, mainly di

14 Nov 05, 2022

PyTorch implementation of normalizing flow models

242 Jan 02, 2023

Transformer model implemented with Pytorch

transformer-pytorch Transformer model implemented with Pytorch Attention is all you need-[Paper] Architecture Self-Attention self_attention.py class

12 Sep 03, 2022

Rotation-Only Bundle Adjustment

ROBA: Rotation-Only Bundle Adjustment Paper, Video, Poster, Presentation, Supplementary Material In this repository, we provide the implementation of

51 Nov 29, 2022

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

Contrastive Spatio Temporal Pretext Learning for Self-supervised Video Representation (AAAI 2022) The code for paper "Contrastive Spatio-Temporal Pret

8 Jun 30, 2022

A Convolutional Transformer for Keyword Spotting

Related tags

Overview

☢️ Audiomer ☢️

Results on SpeechCommands

Model Architecture

Performer Conv-Attention

Usage

System requirements

Owner

Pretrained Cost Model for Distributed Constraint Optimization Problems

An Open-Source Tool for Automatic Disease Diagnosis..

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

An Implementation of Fully Convolutional Networks in Tensorflow.

Extremely simple and fast extreme multi-class and multi-label classifiers.

Meta-learning for NLP

Liecasadi - liecasadi implements Lie groups operation written in CasADi

PyTorch implementation of normalizing flow models

Transformer model implemented with Pytorch

Rotation-Only Bundle Adjustment

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Person Re-identification

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Official Repository for Machine Learning class - Physics Without Frontiers 2021

FedScale: Benchmarking Model and System Performance of Federated Learning

ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.