A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Romanian Automatic Speech Recognition from the ROBIN project

Fermi Problems: A New Reasoning Challenge for AI

A Gura parser implementation for Python

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Permute Me Softly: Learning Soft Permutations for Graph Representations

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Deep learning for Engineers - Physics Informed Deep Learning

PSPNet in Chainer

Research code for the paper "Variational Gibbs inference for statistical estimation from incomplete data".

Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)

ToFFi - Toolbox for Frequency-based Fingerprinting of Brain Signals

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends

Implementation of ViViT: A Video Vision Transformer

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Unimodal Face Classification with Multimodal Training

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.