Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

Space robot - (Course Project) Using the space robot to capture the target satellite that is disabled and spinning, then stabilize and fix it up

The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

Weakly-supervised object detection.

An Api for Emotion recognition.

Code Release for Learning to Adapt to Evolving Domains

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2021)

Proto-RL: Reinforcement Learning with Prototypical Representations

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Code that accompanies the paper Semi-supervised Deep Kernel Learning: Regression with Unlabeled Data by Minimizing Predictive Variance

Code for "Multi-Time Attention Networks for Irregularly Sampled Time Series", ICLR 2021.

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)

Production First and Production Ready End-to-End Speech Recognition Toolkit

Fiddle is a Python-first configuration library particularly well suited to ML applications.

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

Yolov3 pytorch implementation

Automatic differentiation with weighted finite-state transducers.

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

Official Implementation of "LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks"

VQGAN+CLIP Colab Notebook with user-friendly interface.