Skip to content

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Notifications You must be signed in to change notification settings

Rootroo-ltd/FinnishDialectIdentification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Finnish Dialect Identification

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text". We present a text based model and a text + audio based model for automatically detecting Finnish dialects.

Proudly presented by Rootroo Ltd

The data

The data consists of several Finnish dialects, their transcriptions and audio files.

data size

The results

Here you can see the results of our models

data size

Business solutions

Rootroo logo

If you need NLP solutions for smaller languages like Finnish, we have your back! Rootroo offers consulting related to a variety of NLP tasks. We have a strong academic background in the state-of-the-art AI solutions for every NLP need. Just contact us, we won't bite.

The code, data and models

Everything has been released on Zenodo. Check out the Zenodo repository.

Cite

If you use the data, code or models, please cite our paper:

Hämäläinen, Mika; Alnajjar, Khalid; Partanen, Niko & Rueter, Jack (Accepted). Finnish Dialect Identification: The Effect of Audio and Text. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP).

About

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published