Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

Last update: Dec 15, 2022

Overview

Siamese Deep Neural Networks for Semantic Text Similarity PyTorch

A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task, including architectures such as:

Siamese LSTM
Siamese BiLSTM with Attention
Siamese Transformer
Siamese BERT.

Usage

install dependencies

pip install -r requirements.txt

download spacy en model for tokenization

python -m spacy download en

Siamese LSTM

Siamese LSTM Example

 ## init siamese lstm
    siamese_lstm = SiameseLSTM(
        batch_size=batch_size,
        output_size=output_size,
        hidden_size=hidden_size,
        vocab_size=vocab_size,
        embedding_size=embedding_size,
        embedding_weights=embedding_weights,
        lstm_layers=lstm_layers,
        device=device,
    )

    ## define optimizer
    optimizer = torch.optim.Adam(params=siamese_lstm.parameters())
   
   ## train model
    train_model(
        model=siamese_lstm,
        optimizer=optimizer,
        dataloader=sick_dataloaders,
        data=sick_data,
        max_epochs=max_epochs,
        config_dict={"device": device, "model_name": "siamese_lstm"},
    )

Siamese BiLSTM with Attention

Siamese BiLSTM with Attention Example

     ## init siamese lstm
     siamese_lstm_attention = SiameseBiLSTMAttention(
        batch_size=batch_size,
        output_size=output_size,
        hidden_size=hidden_size,
        vocab_size=vocab_size,
        embedding_size=embedding_size,
        embedding_weights=embedding_weights,
        lstm_layers=lstm_layers,
        self_attention_config=self_attention_config,
        fc_hidden_size=fc_hidden_size,
        device=device,
        bidirectional=bidirectional,
    )
    
    ## define optimizer
    optimizer = torch.optim.Adam(params=siamese_lstm_attention.parameters())
   
   ## train model
    train_model(
        model=siamese_lstm_attention,
        optimizer=optimizer,
        dataloader=sick_dataloaders,
        data=sick_data,
        max_epochs=max_epochs,
        config_dict={
            "device": device,
            "model_name": "siamese_lstm_attention",
            "self_attention_config": self_attention_config,
        },
    )

Siamese Transformer

Siamese Transformer Example

    ## init siamese bilstm with attention
    siamese_transformer = SiameseTransformer(
        batch_size=batch_size,
        vocab_size=vocab_size,
        embedding_size=embedding_size,
        nhead=attention_heads,
        hidden_size=hidden_size,
        transformer_layers=transformer_layers,
        embedding_weights=embedding_weights,
        device=device,
        dropout=dropout,
        max_sequence_len=max_sequence_len,
    )

    ## define optimizer
    optimizer = torch.optim.Adam(params=siamese_transformer.parameters())
   
   ## train model
    train_model(
        model=siamese_transformer,
        optimizer=optimizer,
        dataloader=sick_dataloaders,
        data=sick_data,
        max_epochs=max_epochs,
        config_dict={"device": device, "model_name": "siamese_transformer"},
    )

Siamese BERT

Siamese BERT Example

    from siamese_sts.siamese_net.siamese_bert import BertForSequenceClassification
    ## init siamese bert
    siamese_bert = BertForSequenceClassification.from_pretrained(model_name)

    ## train model
    trainer = transformers.Trainer(
        model=siamese_bert,
        args=transformers.TrainingArguments(
            output_dir="./output",
            overwrite_output_dir=True,
            learning_rate=1e-5,
            do_train=True,
            num_train_epochs=num_epochs,
            # Adjust batch size if this doesn't fit on the Colab GPU
            per_device_train_batch_size=batch_size,
            save_steps=3000,
        ),
        train_dataset=sick_dataloader,
    )
    trainer.train()

Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

Related tags

Overview

Siamese Deep Neural Networks for Semantic Text Similarity PyTorch

Usage

Siamese LSTM

Siamese BiLSTM with Attention

Siamese Transformer

Siamese BERT

Owner

Shahrukh Khan

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

My course projects for the 2021 Spring Machine Learning course at the National Taiwan University (NTU)

⚡ H2G-Net for Semantic Segmentation of Histopathological Images

Python package for visualizing the loss landscape of parameterized quantum algorithms.

Versatile Generative Language Model

Deep learning-based approach to discovering Granger causality networks in multivariate time series

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

Ejemplo Algoritmo Viterbi - Example of a Viterbi algorithm applied to a hidden Markov model on DNA sequence

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Compositional and Parameter-Efficient Representations for Large Knowledge Graphs

Pneumonia Detection using machine learning - with PyTorch

Repository of continual learning papers

Intent parsing and slot filling in PyTorch with seq2seq + attention

CS50x-AI - Artificial Intelligence with Python from Harvard University

Awesome Graph Classification - A collection of important graph embedding, classification and representation learning papers with implementations.

Unsupervised Image to Image Translation with Generative Adversarial Networks

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Mitsuba 2: A Retargetable Forward and Inverse Renderer