Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Last update: Dec 30, 2022

Related tags

Overview

PTR

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

If you use the code, please cite the following paper:

@article{han2021ptr,
  title={PTR: Prompt Tuning with Rules for Text Classification},
  author={Han, Xu and Zhao, Weilin and Ding, Ning and Liu, Zhiyuan and Sun, Maosong},
  journal={arXiv preprint arXiv:2105.11259},
  year={2021}
}

Requirements

The model is implemented using PyTorch. The versions of packages used are shown below.

numpy>=1.18.0
scikit-learn>=0.22.1
scipy>=1.4.1
torch>=1.3.0
tqdm>=4.41.1
transformers>=4.0.0

Baselines

Some baselines, especially the baselines using entity markers, come from the project [RE_improved_baseline].

Datasets

We provide all the datasets and prompts used in our experiments.

Run the experiments

(1) For TACRED

mkdir results
cd results
mkdir tacred
cd tacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacred.sh

(2) For TACREV

mkdir results
cd results
mkdir tacrev
cd tacrev
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacrev.sh

(3) For RETACRED

mkdir results
cd results
mkdir retacred
cd retacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_retacred.sh

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Related tags

Overview

PTR

Requirements

Baselines

Datasets

Run the experiments

(1) For TACRED

(2) For TACREV

(3) For RETACRED

Owner

THUNLP

A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.

DeLighT: Very Deep and Light-Weight Transformers

A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"

A tool helps build a talk preview image by combining the given background image and talk event description

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Command Line Text-To-Speech using Google TTS

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Machine learning models from Singapore's NLP research community

This is a GUI program that will generate a word search puzzle image

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

jiant is an NLP toolkit

An A-SOUL Text Generator Based on CPM-Distill.

基于pytorch_rnn的古诗词生成

OpenChat: Opensource chatting framework for generative models

Treemap visualisation of Maya scene files

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

Speech Recognition Database Management with python

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations