PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Last update: Nov 12, 2021

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

@inproceedings{ali-etal-relative,
title = {PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages},
author = {Mohsin Ali and Kandukuri Sai Teja and Sumanth Manduru and Parth Patwa and Amitava Das}
booktitle =  {Proceedings of the AAAI Conference on Artificial Intelligence},
year = {2022},}

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Related tags

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

Owner

Mohsin Ali, Mohammed

PyTorch/GPU re-implementation of the paper Masked Autoencoders Are Scalable Vision Learners

Pipeline for employing a Lightweight deep learning models for LOW-power systems

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

code for EMNLP 2019 paper Text Summarization with Pretrained Encoders

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

This is a Keras implementation of a CNN for estimating age, gender and mask from a camera.

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

Neural networks applied in recognizing guitar chords using python, AutoML.NET with C# and .NET Core

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Speech Recognition using DeepSpeech2.

MIMO-UNet - Official Pytorch Implementation

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

pybaum provides tools to work with pytrees which is a concept burrowed from JAX.

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Contrastively Disentangled Sequential Variational Audoencoder

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Simple PyTorch hierarchical models.

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')