RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Last update: Dec 22, 2022

Overview

RITA: a Study on Scaling Up Generative Protein Sequence Models

RITA is a family of autoregressive protein models, developed by a collaboration of Lighton, the OATML group at Oxford, and the Debbie Marks Lab at Harvard.

Model	#Params	d_model	layers	lm loss uniref-100
Small	85M	768	12	2.31
Medium	300M	1024	24	2.01
Large	680M	1536	24	1.82
XLarge	1.2B	2048	24	1.70

Results

For full results see our preprint: https://arxiv.org/abs/2205.05789

Usage

Instantiate a model like so:

from transformers import AutoModel, AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("lightonai/RITA_s, trust_remote_code=True")
tokenizer = AutoTokenizer.from_pretrained("lightonai/RITA_s")

for generation we support pipelines:

from transformers import pipeline
rita_gen = pipeline('text-generation', model=model, tokenizer=tokenizer)
sequences = rita_gen("MAB", max_length=20, do_sample=True, top_k=950, repetition_penalty=1.2, 
                     num_return_sequences=2, eos_token_id=2)
for seq in sequences:
    print(f"seq: {seq['generated_text'].replace(' ', '')}")

Or see example.py

How to cite

@article{hesslow2022rita,
  title={RITA: a Study on Scaling Up Generative Protein Sequence Models},
  author={Hesslow, Daniel and Zanichelli, Niccol{\'o} and Notin, Pascal and Poli, Iacopo and Marks, Debora},
  journal={arXiv preprint arXiv:2205.05789},
  year={2022}
}

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Related tags

Overview

RITA: a Study on Scaling Up Generative Protein Sequence Models

Results

Usage

How to cite

Owner

LightOn

Automatic Number Plate Recognition using Contours and Convolution Neural Networks (CNN)

Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Official repository for "Orthogonal Projection Loss" (ICCV'21)

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

COIN the currently largest dataset for comprehensive instruction video analysis.

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

The object detection pipeline is based on Ultralytics YOLOv5

RoadMap and preparation material for Machine Learning and Data Science - From beginner to expert.

Alfred-Restore-Iterm-Arrangement - An Alfred workflow to restore iTerm2 window Arrangements

a short visualisation script for pyvideo data

Python and C++ implementation of "MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation". Accepted at LXCV @ CVPR 2021.

buildseg is a building extraction plugin of QGIS based on PaddlePaddle.

Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)

Neural-fractal - Create Fractals Using Complex-Valued Neural Networks!

LETR: Line Segment Detection Using Transformers without Edges

PURE: End-to-End Relation Extraction

Proposed n-stage Latent Dirichlet Allocation method - A Novel Approach for LDA

Data Consistency for Magnetic Resonance Imaging

Progressive Coordinate Transforms for Monocular 3D Object Detection

GPT, but made only out of gMLPs