This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"

A general-purpose encoder-decoder framework for Tensorflow

FindFunc is an IDA PRO plugin to find code functions that contain a certain assembly or byte pattern, reference a certain name or string, or conform to various other constraints.

This is official implementaion of paper "Token Shift Transformer for Video Classification".

Official implementation of NeurIPS 2021 paper "One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective"

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Unofficial Implementation of MLP-Mixer, Image Classification Model

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"

This is a pytorch implementation of the NeurIPS paper GAN Memory with No Forgetting.

Semantic Image Synthesis with SPADE

Implementation of Geometric Vector Perceptron, a simple circuit for 3d rotation equivariance for learning over large biomolecules, in Pytorch. Idea proposed and accepted at ICLR 2021

deep_image_prior_extension

A Unified Framework and Analysis for Structured Knowledge Grounding

code for ICCV 2021 paper 'Generalized Source-free Domain Adaptation'

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

LSUN Dataset Documentation and Demo Code

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms