SAINT-pytorch

A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arxiv.org/abs/2002.07033.

SAINT: Separated Self-AttentIve Neural Knowledge Tracing. SAINT has an encoder-decoder structure where exercise and response embedding sequence separately enter the encoder and the decoder respectively, which allows to stack attention layers multiple times.

SAINT model architecture

Usage

import torch
import torch.nn as nn
import torch.nn.functional as F
import numpy as np
import copy

from saint import saint, random_data

seq_len = 100
total_ex = 1200
total_cat = 234
total_in = 2

in_ex, in_cat, in_de = random_data(64, 
                                seq_len , 
                                total_ex, 
                                total_cat, 
                                total_in)


model = saint(dim_model=128,
            num_en=6,
            num_de=6,
            heads_en=8,
            heads_de=8,
            total_ex=total_ex,
            total_cat=total_cat,
            total_in=total_in )

outs = model(in_ex, in_cat, in_de)

print(outs.shape)
# torch.Size([64, 100, 1])

Parameters

dim_model: int.
Dimension of model ( embeddings, attention, linear layers).
num_en: int.
Number of encoder layers.
num_de: int.
Number of decoder layers.
heads_en: int.
Number of heads in multi-head attention block in each layer of encoder.
heads_de: int.
Number of heads in multi-head attention block in each layer of decoder.
total_ex: int.
Total number of unique excercise.
total_cat: int.
Total number of unique concept categories.
total_in: int.
Total number of unique interactions.

todo

change positional embedding to sine.

Citations

@article{choi2020towards,
  title={Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing},
  author={Choi, Youngduck and Lee, Youngnam and Cho, Junghyun and Baek, Jineon and Kim, Byungsoo and Cha, Yeongmin and Shin, Dongmin and Bae, Chan and Heo, Jaewe},
  journal={arXiv preprint arXiv:2002.07033},
  year={2020}
}

@misc{vaswani2017attention,
    title   = {Attention Is All You Need},
    author  = {Ashish Vaswani and Noam Shazeer and Niki Parmar and Jakob Uszkoreit and Llion Jones and Aidan N. Gomez and Lukasz Kaiser and Illia Polosukhin},
    year    = {2017},
    eprint  = {1706.03762},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

SAINT PyTorch implementation

Related tags

Overview

SAINT-pytorch

SAINT model architecture

Usage

Parameters

todo

Citations

Owner

Arshad Shaikh

The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main conference"

Legal text retrieval for python

Twitter-Sentiment-Analysis - Twitter sentiment analysis for india's top online retailers(2019 to 2022)

Python library for parsing resumes using natural language processing and machine learning

A very simple framework for state-of-the-art Natural Language Processing (NLP)

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Tools for curating biomedical training data for large-scale language modeling

Yes it's true :broken_heart:

The ibet-Prime security token management system for ibet network.

CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조

Generating new names based on trends in data using GPT2 (Transformer network)

Levenshtein and Hamming distance computation

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

Share constant definitions between programming languages and make your constants constant again

Code for paper: An Effective, Robust and Fairness-awareHate Speech Detection Framework

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

ETM - R package for Topic Modelling in Embedding Spaces