SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Last update: Oct 15, 2022

Related tags

Overview

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks

Molecular interaction networks are powerful resources for the discovery. While deep learning on graphs has dramatically advanced the prediction prowess, current graph neural network (GNN) methods are optimized for prediction on the basis of direct similarity between interacting nodes. In biological networks, however, similarity between nodes that do not directly interact has proved incredibly useful in the last decade across a variety of interaction networks.

Here, we present SkipGNN, it predicts molecular interactions by not only aggregating information from direct interactions but also from second-order interactions, which we call skip similarity. In contrast to existing GNNs, SkipGNN receives neural messages from two-hop neighbors as well as immediate neighbors in the interaction network and non-linearly transforms the messages to obtain useful information for prediction.

(Left) Traditionally, an interaction between nodes A and B implies that A and B are similar and vice versa. (Right) In contrast, in molecular interaction networks, directly interacting entities are not necessarily similar, which has been observed in numerous networks, including genetic interaction networks and protein-protein interaction networks.

Install

git clone https://github.com/kexinhuang12345/SkipGNN.git
cd SkipGNN
python setup.py install

Example

python train.py \
    --epochs 15 \
    --lr 5e-4 \
    --batch_size 256 \
    --hidden1 64 \
    --hidden2 16 \
    --hidden_decode1 512 \
    --network_type DTI \
    --data_path '../data/DTI/fold1' \
    --input_type one_hot

You can change the network_type to DTI, DDI, PPI, GDI. Please change the data_path accordingly.

In the paper, we use node2vec to initialize the node attributes. But empirically, we find simple one-hot position encoding is also good for SkipGNN. If you want to reproduce the result, you could put the node2vec embedding generated from this repo under data/DTI/fold1/dti.emb and set --input_type node2vec.

A Jupyter notebook example is provided in DEMO.

Dataset

We provide the dataset in the data folder.

Data	Source	Description	Processing Code
DTI	BIOSNAP	A drug-target interaction network betweeen 5,018 drugs that target 2,325 proteins with 15,139 interactions. The drugs are from the US market.	data_process_DTI.ipynb
DDI	BIOSNAP	A drug-drug interaction network betweeen 1,514 drugs with 48,514 interactions, which are approved by the FDA.	data_process_DDI.ipynb
PPI	HuRI	A protein-protein interaction network from the Human Reference Protein Interactome Mapping Project. We use the HuRI-III version from the L3 paper. It consists of 5,604 proteins with 23,322 interactions.	data_process_PPI.ipynb
GDI	DisGeNET	A disease-gene association network betweeen 9,413 genes and 10,370 diseases with 81,746 associations, which are curated from GWAS studies.	data_process_GDI.ipynb

Skip-Graph Construction

To integrate the power of skip-graph in your own GNN codes, you could simply apply a new GNN on the skip graph, which is generated using two lines. adj is a scipy.sparse adjacency matrix for the original graph.

adj_skip = adj.dot(adj)
adj_skip = adj_skip.sign()

See here for more details.

Cite Us

Cite arxiv for now:

@article{huang2020skipgnn,
  title={SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks},
  author={Huang, Kexin and Xiao, Cao and Glass, Lucas and Zitnik, Marinka and Sun, Jimeng},
  journal={arXiv preprint arXiv:2004.14949},
  year={2020}
}

The code framework is based on pygcn.

Contact

Please send questions to [email protected] or open an issue.

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Related tags

Overview

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks

Install

Example

Dataset

Skip-Graph Construction

Cite Us

Contact

Owner

Kexin Huang

PPO is a very popular Reinforcement Learning algorithm at present.

Automatic library of congress classification, using word embeddings from book titles and synopses.

null

Keras implementations of Generative Adversarial Networks.

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

Accommodating supervised learning algorithms for the historical prices of the world's favorite cryptocurrency and boosting it through LightGBM.

Orthogonal Over-Parameterized Training

ROS support for Velodyne 3D LIDARs

An open-source, low-cost, image-based weed detection device for fallow scenarios.

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

High-Resolution Image Synthesis with Latent Diffusion Models

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

AI pipelines for Nvidia Jetson Platform

PyKaldi GOP-DNN on Epa-DB

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

Synthesizing and manipulating 2048x1024 images with conditional GANs

Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models