The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Last update: Dec 25, 2021

Overview

Finnish Dialect Identification

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text". We present a text based model and a text + audio based model for automatically detecting Finnish dialects.

Proudly presented by Rootroo Ltd

The data

The data consists of several Finnish dialects, their transcriptions and audio files.

The results

Here you can see the results of our models

Business solutions

If you need NLP solutions for smaller languages like Finnish, we have your back! Rootroo offers consulting related to a variety of NLP tasks. We have a strong academic background in the state-of-the-art AI solutions for every NLP need. Just contact us, we won't bite.

The code, data and models

Everything has been released on Zenodo. Check out the Zenodo repository.

Cite

If you use the data, code or models, please cite our paper:

Hämäläinen, Mika; Alnajjar, Khalid; Partanen, Niko & Rueter, Jack (Accepted). Finnish Dialect Identification: The Effect of Audio and Text. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP).

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

Related tags

Overview

Finnish Dialect Identification

The data

The results

Business solutions

The code, data and models

Cite

Owner

Rootroo Ltd

A variational Bayesian method for similarity learning in non-rigid image registration (CVPR 2022)

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

ICCV2021 Papers with Code

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

Code repository for the paper Computer Vision User Entity Behavior Analytics

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

Implementation of Online Label Smoothing in PyTorch

Code and data for "TURL: Table Understanding through Representation Learning"

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

Experiments with Fourier layers on simulation data.

YOLOX Win10 Project

Notebook and code to synthesize complex and highly dimensional datasets using Gretel APIs.

TensorFlow CNN for fast style transfer

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans

A set of tools for converting a darknet dataset to COCO format working with YOLOX

Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

This tutorial repository is to introduce the functionality of KGTK to first-time users

This is the code of using DQN to play Sekiro .