Benchmarking Pipeline for Prediction of Protein-Protein Interactions

Last update: Jun 27, 2022

Related tags

Overview

B4PPI

Benchmarking Pipeline for the Prediction of Protein-Protein Interactions

How this benchmarking pipeline has been built, and how to use it, is detailed in our preprint here (please cite it if you find this work useful!).

A minimal example is available here, and the list of requirements there.

How to use the gold standard

All the data files are in data, most of them are available as csv (sep='|') and pickled pandas DataFrames (sometimes the csv file may be missing due to file size constraints on GitHub).

The gold standard, without pre-processed features, can be loaded using:

goldStandard = pd.read_csv(
    os.path.join('data', 'benchmarkingGS_v1-0.csv'),
    sep='|'
)

Or with the pre-processed features:

goldStandard_with_featuresSeq = pd.read_pickle(
    os.path.join('data', 'benchmarkingGS_v1-0_similarityMeasure_sequence_v3-1.pkl')
)

UniProtIDs are used for both proteins A and B.
isInteraction is the ground truth from the IntAct database (1 = interacting proteins, 0 = non-interacting proteins).
trainTest is the split between training set (train), first testing set T1 (test1) and second testing set T2 (test2).
Pre-processed features are explained in the manuscript.

Training and evaluation can then be done normally. The code from the preprint is in the Training section.

How to cite this work

Lannelongue L., Inouye M., Construction of in silico protein-protein interaction networks across different topologies using machine learning, 2022, BioArxiv

Licence

This work is licensed under a Creative Commons Attribution 4.0 International License.

Credits

The code was written in Python 3.7.
Many libraries were used, in particular Pandas, Numpy, scikit-learn and PyTorch Lightning (full list in the code and in the requirements file).
Plots were drawn using Matplotlib, Seaborn and the MetBrewer colour palettes.
Logs were saved using Weight & Bias.

Benchmarking Pipeline for Prediction of Protein-Protein Interactions

Related tags

Overview

B4PPI

How to use the gold standard

How to cite this work

Licence

Credits

Owner

Loïc Lannelongue

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

Example of semantic segmentation in Keras

This repository contains the code for the paper Neural RGB-D Surface Reconstruction

Model-based reinforcement learning in TensorFlow

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

MRI reconstruction (e.g., QSM) using deep learning methods

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

Run PowerShell command without invoking powershell.exe

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Neural models of common sense. 🤖

TensorFlow implementation of the algorithm in the paper "Decoupled Low-light Image Enhancement"

Exploration & Research into cross-domain MEV. Initial focus on ETH/POLYGON.

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Lenia - Mathematical Life Forms

Code for "Typilus: Neural Type Hints" PLDI 2020

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

OverFeat is a Convolutional Network-based image classifier and feature extractor.

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.