TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Last update: Dec 21, 2022

Related tags

Overview

TransPrompt

This code is implement for our EMNLP 2021's paper 《TransPrompt：Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification》.

Our proposed TransPrompt is motivated by the join of prompt-tuning and cross-task transfer learning. The aim is to explore and exploit the transferable knowledge from similar tasks in the few-shot scenario, and make the Pre-trained Language Model (PLM) better few-shot transfer learner. Our proposed framework is accepted by the main conference (long paper track) in EMNLP-2021. This code is the default multi-GPU version. We will teach you how to use our code in the following parts.

Ps: We also commit the same code in Alibaba EasyTransfer.

1. Data Preparation

We follow PET to use the same dataset. Please run the scripts to download the data:

sh data/download_data.sh

or manually download the dataset from https://nlp.cs.princeton.edu/projects/lm-bff/datasets.tar.

Then you will obtain a new director data/original

Our work has two kind of scenario, such as single-task and cross-task. Different kind scenario has corresponding splited examples. Defaultly, we generate few-shot learning examples, you can also generate full data by edit the parameter (-scene=full). We only demostrate the few-shot data generation.

1.1 Single-task Few-shot

Please run the scripts to obtain the single-task few-shot examples:

python3 data_utils/generate_k_shot_data.py --scene few-shot --k 16

Then you will obtain a new folder data/k-shot-single

1.2 Cross-task Few-shot

Run the scripts

python3 data_utils/generate_k_shot_cross_task_data.py --scene few-shot --k 16

and you will obtain a new folder data/k-shot-cross

After the generation, the similar tasks will be divided into the same group. We have three groups:

Group1 (Sentiment Analysis): SST-2, MR, CR
Group2 (Natural Language Inference): MNLI, SNLI
Group3 (Paraphrasing): MRPC, QQP

2. Have a Training Games

Please follow our papers, we have mask following experiments:

Single-task few-shot learning: It is the same as LM-BFF and P-tuning, we prompt-tune the PLM only on one task.
Cross-task few-shot learning: We mix up the similar task in group. At first, we prompt-tune the PLM on cross-task data, then we prompt-tune on each task again. For the Cross-task Learning, we have two cross-task method:
(Cross-)Task Adaptation: In one group, we prompt-tune on all the tasks, and then evaluate on each task both in few-shot scenario.
(Cross-)Task Generalization: In one group, we randomly choose one task for few-shot evaluation (do not used for training), others are used for prompt-tuning.

2.1 Single-task few-shot learning

Take MRPC as an example, please run:

CUDA_VISIBLE_DEVICES=0 sh scripts/run_single_task.sh

2.2 Cross-task few-shot Learning (Task Adaptaion)

Take Group1 as an example, please run the scripts:

CUDA_VISIBLE_DEVICES=0 sh scripts/run_cross_task_adaptation.sh

2.3 Cross-task few-shot Learning (Task Generalization)

Also take Group1 as an example, please run the scripts: Ps: the unseen task is SST-2.

CUDA_VISIBLE_DEVICES=0 sh scripts/run_cross_task_generalization.sh

Citation

Our paper citation is:

@inproceedings{DBLP:conf/emnlp/0001WQH021,
  author    = {Chengyu Wang and
               Jianing Wang and
               Minghui Qiu and
               Jun Huang and
               Ming Gao},
  editor    = {Marie{-}Francine Moens and
               Xuanjing Huang and
               Lucia Specia and
               Scott Wen{-}tau Yih},
  title     = {TransPrompt: Towards an Automatic Transferable Prompting Framework
               for Few-shot Text Classification},
  booktitle = {Proceedings of the 2021 Conference on Empirical Methods in Natural
               Language Processing, {EMNLP} 2021, Virtual Event / Punta Cana, Dominican
               Republic, 7-11 November, 2021},
  pages     = {2792--2802},
  publisher = {Association for Computational Linguistics},
  year      = {2021},
  url       = {https://aclanthology.org/2021.emnlp-main.221},
  timestamp = {Tue, 09 Nov 2021 13:51:50 +0100},
  biburl    = {https://dblp.org/rec/conf/emnlp/0001WQH021.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Acknowledgement

The code is developed based on pet. We appreciate all the authors who made their code public, which greatly facilitates this project. This repository would be continuously updated.

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Related tags

Overview

TransPrompt

1. Data Preparation

1.1 Single-task Few-shot

1.2 Cross-task Few-shot

2. Have a Training Games

2.1 Single-task few-shot learning

2.2 Cross-task few-shot Learning (Task Adaptaion)

2.3 Cross-task few-shot Learning (Task Generalization)

Citation

Acknowledgement

Owner

WangJianing

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

Learning Visual Words for Weakly-Supervised Semantic Segmentation

Accurate Phylogenetic Inference with Symmetry-Preserving Neural Networks

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

PyTorch Implementation for Deep Metric Learning Pipelines

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Fast, flexible and fun neural networks.

A data-driven maritime port simulator

A model to classify a piece of news as REAL or FAKE

Node for thenewboston digital currency network.

Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs.

This is an official repository of CLGo: Learning to Predict 3D Lane Shape and Camera Pose from a Single Image via Geometry Constraints

Consecutive-Subsequence - Simple software to calculate susequence with highest sum

Keras implementation of AdaBound

A rough implementation of the paper "A Steering Algorithm for Redirected Walking Using Reinforcement Learning"

A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.

PyTorch implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation.

Reproduction process of AlexNet