On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

With the spirit of reproducible research, this repository contains codes required to produce the results in the manuscript:

P. Berjon, A. Nag, P. Brodeur, M. Checkley, A. Klinkert, S. Dev, On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition, under review.

Please cite the above paper if you intent to use whole/part of the code. This code is only for academic and research purposes.

Spectrogram of an English sample.

Code Organization

All codes are written in python3.

Dependencies

The following libraries should be installed before the execution of the codes.

numpy: pip install numpy
matplotlib: pip install matplotlib
opencv-python 4.3.0.36: pip install opencv-python
cv2: pip install cv2
tqdm: pip install tqdm
pycm: pip install pycm

Data

The audio samples can be found here : https://www.kaggle.com/rtatman/speech-accent-archive. GitHub doesn't allow me to give you all the spectrogram files generated, so I decided to give you at max 100 of each accent as an example. If you need all of it, feel free to use the dataset of spectrograms present on my personal Kaggle profile : https://www.kaggle.com/pberjon/accent-data.

Scripts

Each Python Notebook gives one contribution of the article (and one model I have used):

accentrecognitionsvm.ipynb : the SVM
accentrecognitioncnn2.ipynb : the 2-layer CNN
accentrecognitioncnn4.ipynb : the 4-layer CNN

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Related tags

Overview

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Code Organization

Dependencies

Data

Scripts

Owner

[AAAI 2022] Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

Official code for Next Check-ins Prediction via History and Friendship on Location-Based Social Networks (MDM 2018)

Pytorch implementation of "Geometrically Adaptive Dictionary Attack on Face Recognition" (WACV 2022)

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Raster Vision is an open source Python framework for building computer vision models on satellite, aerial, and other large imagery sets

Towards Representation Learning for Atmospheric Dynamics (AtmoDist)

Speech Recognition using DeepSpeech2.

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Collection of common code that's shared among different research projects in FAIR computer vision team.

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

This is the implementation of "SELF SUPERVISED REPRESENTATION LEARNING WITH DEEP CLUSTERING FOR ACOUSTIC UNIT DISCOVERY FROM RAW SPEECH" submitted to ICASSP 2022

Moving Object Segmentation in 3D LiDAR Data: A Learning-based Approach Exploiting Sequential Data

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Unsupervised Learning of Video Representations using LSTMs

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

Fastshap: A fast, approximate shap kernel