Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

Deploy optimized transformer based models on Nvidia Triton server

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Whisper is a file-based time-series database format for Graphite.

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

Deep-learning X-Ray Micro-CT image enhancement, pore-network modelling and continuum modelling

PyTorch implementation of UNet++ (Nested U-Net).

A repository for storing njxzc final exam review material

Multilingual Image Captioning

TensorFlow port of PyTorch Image Models (timm) - image models with pretrained weights.

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021].

Scientific Computation Methods in C and Python (Open for Hacktoberfest 2021)

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring

Variational autoencoder for anime face reconstruction

PyTorch Live is an easy to use library of tools for creating on-device ML demos on Android and iOS.

Official code for "On the Frequency Bias of Generative Models", NeurIPS 2021