An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Last update: Oct 21, 2022

Related tags

Overview

pl_prompt_sst

An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SST2 sentiment analysis dataset. Leveraging the pytorch-lightning features like logging, gradient accumulation and early stopping, etc. Can be used as a template for further development.

Run

Install requirement

pip install -r requirements.txt

Setup the prompt to use in sst2/prompt_config.json

{
    "template_text": "{\"placeholder\": \"text_a\"} In summary, the film was {\"mask\"}.",
    "label_words": [["bad"], ["good"]]
}

Adjust the arguments in run.sh or the code below for your need, and run it.

CUDA_VISIBLE_DEVICES=0 python -u main.py --input_dir ./sst2 \
                                         --prompt_config_dir ./sst2/prompt_config.json \
                                         --model_class bert \
                                         --model_name_or_path prajjwal1/bert-tiny \
                                         --lr 2e-4
                                         --bs 32 \
                                         --max_seq_length 64 \
                                         --patience 4 \
                                         --accumulation 2 \
                                         --seed 666

In my preliminary experiment with the settings above, the model achieve 0.822 F1 compared to 0.820 without prompt.

Note

Can only be executed after this fix on state_dict()

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Related tags

Overview

pl_prompt_sst

Run

Note

Owner

Zhiling Zhang

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.

A raytrace framework using taichi language

Implementation of legal QA system based on SentenceKoBART

The Classical Language Toolkit

Py65 65816 - Add support for the 65C816 to py65

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

Pytorch-Named-Entity-Recognition-with-BERT

PIZZA - a task-oriented semantic parsing dataset

Python bot created with Selenium that can guess the daily Wordle word correct 96.8% of the time.

Pattern Matching in Python

The source code of HeCo

Simple NLP based project without any use of AI

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs".

Crie tokens de autenticação íntegros e seguros com UToken.

Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation