(Personalized) Page-Rank computation using PyTorch

Last update: Dec 03, 2022

Overview

torch-ppr

This package allows calculating page-rank and personalized page-rank via power iteration with PyTorch, which also supports calculation on GPU (or other accelerators).

💪 Getting Started

As a simple example, consider this simple graph with five nodes.

Its edge list is given as

>>> import torch
>>> edge_index = torch.as_tensor(data=[(0, 1), (1, 2), (1, 3), (2, 4)]).t()

We can use

>>> from torch_ppr import page_rank
>>> page_rank(edge_index=edge_index)
tensor([0.1269, 0.3694, 0.2486, 0.1269, 0.1281])

to calculate the page rank, i.e., a measure of global importance. We notice that the central node receives the largest importance score, while all other nodes have lower importance. Moreover, the two indistinguishable nodes 0 and 3 receive the same page rank.

We can also calculate personalized page rank which measures importance from the perspective of a single node. For instance, for node 2, we have

>>> from torch_ppr import personalized_page_rank
>>> personalized_page_rank(edge_index=edge_index, indices=[2])
tensor([[0.1103, 0.3484, 0.2922, 0.1103, 0.1388]])

Thus, the most important node is the central node 1, nodes 0 and 3 receive the same importance value which is below the value of the direct neighbor 4.

By the virtue of using PyTorch, the code seamlessly works on GPUs, too, and supports auto-grad differentiation. Moreover, the calculation of personalized page rank supports automatic batch size optimization via torch_max_mem.

🚀 Installation

The most recent release can be installed from PyPI with:

$ pip install torch_ppr

The most recent code and data can be installed directly from GitHub with:

$ pip install git+https://github.com/mberr/torch-ppr.git

👐 Contributing

Contributions, whether filing an issue, making a pull request, or forking, are appreciated. See CONTRIBUTING.md for more information on getting involved.

👋 Attribution

⚖️ License

The code in this package is licensed under the MIT License.

🍪 Cookiecutter

This package was created with @audreyfeldroy's cookiecutter package using @cthoyt's cookiecutter-snekpack template.

🛠️ For Developers

See developer instructions

The final section of the README is for if you want to get involved by making a code contribution.

Development Installation

To install in development mode, use the following:

$ git clone git+https://github.com/mberr/torch-ppr.git
$ cd torch-ppr
$ pip install -e .

🥼 Testing

After cloning the repository and installing tox with pip install tox, the unit tests in the tests/ folder can be run reproducibly with:

$ tox

Additionally, these tests are automatically re-run with each commit in a GitHub Action.

📖 Building the Documentation

The documentation can be built locally using the following:

$ git clone git+https://github.com/mberr/torch-ppr.git
$ cd torch-ppr
$ tox -e docs
$ open docs/build/html/index.html

The documentation automatically installs the package as well as the docs extra specified in the setup.cfg. sphinx plugins like texext can be added there. Additionally, they need to be added to the extensions list in docs/source/conf.py.

📦 Making a Release

After installing the package in development mode and installing tox with pip install tox, the commands for making a new release are contained within the finish environment in tox.ini. Run the following from the shell:

$ tox -e finish

This script does the following:

Uses Bump2Version to switch the version number in the setup.cfg, src/torch_ppr/version.py, and docs/source/conf.py to not have the -dev suffix
Packages the code in both a tar archive and a wheel using build
Uploads to PyPI using twine. Be sure to have a .pypirc file configured to avoid the need for manual input at this step
Push to GitHub. You'll need to make a release going with the commit where the version was bumped.
Bump the version to the next patch. If you made big changes and want to bump the version by minor, you can use tox -e bumpversion minor after.

Comments

`torch.sparse.mm` breaking API changes
Suddenly, everything stopped working 😱 presumably because of the changes to torch.sparse. Particularly, I am on PyTorch 1.10, master branch of PyKEEN and torch-ppr 0.0.5.

Problem 1: the allclose() check does not pass now: https://github.com/mberr/torch-ppr/blob/921898f1a4b7770e6cdd1931e935262e456eb3c9/src/torch_ppr/utils.py#L221-L222

MWE:

import torch from torch_ppr import page_rank from pykeen.datasets import FB15k237 dataset = FB15k237(create_inverse_triples=False) edges = dataset.training.mapped_triples[:, [0, 2]].t() pr = page_rank(edge_index=torch.cat([edges, edges.flip(0)], dim=-1), num_nodes=dataset.num_entities) >> ValueError: Invalid column sum: tensor([1.0000, 1.0000, 1.0000, ..., 1.0000, 1.0000, 1.0000]). expected 1.0

Looking into the debugger:

adj_sum does sum up to the number of nodes

the default tolerance fails the check, but if I reduce rtol=1e-4 or atol=1e-4 - the check passes

Problem 2: the signature of torch.sparse.addmm has changed from the one used in power_iteration so the API call fails with the unknown kwarg error.

https://github.com/mberr/torch-ppr/blob/921898f1a4b7770e6cdd1931e935262e456eb3c9/src/torch_ppr/utils.py#L310

In fact, I can't find where those kwargs input, sparse, dense come from because the current signature has less readable mat, mat1, mat2. I traced to the very Torch 1.3.0 and still can't find where those originated from. Where does this signature come from? 😅

My test env

torch 1.10.0 torch-ppr 0.0.5
opened by migalkin 7
Incorporating edge weights

Hello,

Thank you for this great repository; it is a great, handy package that performs very well! I was wondering however; is it possible to incorporate edge weights into the personalized pagerank method?

Best Filip

opened by Filco306 5

RuntimeError torch.sparse.addmm different torch tensor shape

Dear torch-ppr

I installed torch-ppr on my Mac with python 3.9 and run the example code

>>> import torch
>>> edge_index = torch.as_tensor(data=[(0, 1), (1, 2), (1, 3), (2, 4)]).t()
>>> from torch_ppr import page_rank
>>> page_rank(edge_index)

I got a runtimeerror as

x = torch.sparse.addmm(input=x0, sparse=adj, dense=x, beta=alpha, alpha=beta)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (2x4 and 2x1)

I printed the shape of x0, adj and x

torch.Size([2, 1])
torch.Size([2, 4])
torch.Size([2, 1])

I believe that the shape of adj should be 2x2 or I might be wrong. I find the define process of adj.

# convert to sparse matrix, shape: (n, n)
adj = edge_index_to_sparse_matrix(edge_index=edge_index, num_nodes=num_nodes)
adj = adj + adj.t()

The adj is symmect.

I wonder how to fix the runtimeError or any suggestions? Thanks in advanced meatball1982 12-May-2022 09:54:50

opened by meatball1982 4

Expose API functions from top-level

Also update cookiecutter package in https://github.com/cthoyt/cookiecutter-snekpack/commit/fa032ffc3c718c208d3a03e212aaa299c193de94 to have this be a part by default

opened by cthoyt 2
Formulate page-rank as a torch.nn Layer

Thank you for this repo!

The reason to request a 'layer' fomulation is to convert the function page_rank to an onnx graph with torch.onnx (only accepts models).

Once I have the onnx model, I can compile it different hardware (other than cuda).

Maybe need just the forward pass, no need for a backward pass although I think the compute will be differentiable.

Thanks.

opened by LM-AuroTripathy 8

Releases(v0.0.8)

v0.0.8(Jul 20, 2022)
What's Changed

Update error message of validate_adjacency by @mberr in https://github.com/mberr/torch-ppr/pull/18

Add option to add identity matrix by @mberr in https://github.com/mberr/torch-ppr/pull/20

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.7...v0.0.8
Source code(tar.gz)
Source code(zip)
v0.0.7(Jun 29, 2022)
What's Changed

Fix torch 1.12 compat by @mberr in https://github.com/mberr/torch-ppr/pull/17

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.6...v0.0.7
Source code(tar.gz)
Source code(zip)
v0.0.6(Jun 29, 2022)
What's Changed

Fix language tag in docs by @cthoyt in https://github.com/mberr/torch-ppr/pull/13

Fix torch.sparse.addmm use by @mberr in https://github.com/mberr/torch-ppr/pull/12

Enable CI on multiple versions of pytorch by @cthoyt in https://github.com/mberr/torch-ppr/pull/14

Improve sparse CSR support by @mberr in https://github.com/mberr/torch-ppr/pull/15

Increase numerical tolerance by @mberr in https://github.com/mberr/torch-ppr/pull/16

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.5...v0.0.6
Source code(tar.gz)
Source code(zip)
v0.0.5(May 12, 2022)
What's Changed

Improve input validation by @mberr in https://github.com/mberr/torch-ppr/pull/10

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.4...v0.0.5
Source code(tar.gz)
Source code(zip)
v0.0.4(May 10, 2022)
What's Changed

Expose num_nodes parameter by @mberr in https://github.com/mberr/torch-ppr/pull/8

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.3...v0.0.4
Source code(tar.gz)
Source code(zip)
v0.0.3(May 10, 2022)
What's Changed

Add imports to code examples in README by @cthoyt in https://github.com/mberr/torch-ppr/pull/6

Expose API functions from top-level by @cthoyt in https://github.com/mberr/torch-ppr/pull/7

New Contributors

@cthoyt made their first contribution in https://github.com/mberr/torch-ppr/pull/6

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.2...v0.0.3
Source code(tar.gz)
Source code(zip)
v0.0.2(May 9, 2022)
What's Changed

Fix device resolution order by @mberr in https://github.com/mberr/torch-ppr/pull/5

Full Changelog: https://github.com/mberr/torch-ppr/compare/v0.0.1...v0.0.2
Source code(tar.gz)
Source code(zip)
v0.0.1(May 6, 2022)
What's Changed

Implementations of (Personalized) Page-Rank via PyTorch

Full Changelog: https://github.com/mberr/torch-ppr/commits/v0.0.1
Source code(tar.gz)
Source code(zip)

Owner

Max Berrendorf

GitHub Repository

MIMIC Code Repository: Code shared by the research community for the MIMIC-III database

MIMIC Code Repository The MIMIC Code Repository is intended to be a central hub for sharing, refining, and reusing code used for analysis of the MIMIC

1.8k Dec 26, 2022

Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation.

Unified-EPT Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation. Installation Linux, CUDA=10.0,

29 Aug 23, 2022

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

pytorch-spynet This is a personal reimplementation of SPyNet [1] using PyTorch. Should you be making use of this work, please cite the paper according

269 Jan 02, 2023

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Domain Transfer Network (DTN) TensorFlow implementation of Unsupervised Cross-Domain Image Generation. Requirements Python 2.7 TensorFlow 0.12 Pickle

864 Dec 30, 2022

social humanoid robots with GPGPU and IoT

Social humanoid robots with GPGPU and IoT Social humanoid robots with GPGPU and IoT Paper Authors Mohsen Jafarzadeh, Stephen Brooks, Shimeng Yu, Balak

0 Jan 07, 2022

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

English | 简体中文 PaddleGAN PaddleGAN provides developers with high-performance implementation of classic and SOTA Generative Adversarial Networks, and s

6.4k Jan 09, 2023

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations This repo contains official code for the NeurIPS 2021 paper Imi

2 Oct 18, 2021

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

MMGEN-FaceStylor English | 简体中文 Introduction This repo is an efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits

182 Dec 27, 2022

Implement of homography net by pytorch

HomographyNet Implement of homography net by pytorch Brief Introduction This project is based on the work Homography-Net: @article{detone2016deep, t

4 May 19, 2022

Worktory is a python library created with the single purpose of simplifying the inventory management of network automation scripts.

18 Aug 31, 2022

Reinforcement Learning for Automated Trading

Reinforcement Learning for Automated Trading This thesis has been realized for the obtention of the Master's in Mathematical Engineering at the Polite

80 Jun 19, 2022

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning This repository is the official implementation of CARE.

89 Dec 02, 2022

PyTorch implementation of PP-LCNet

PP-LCNet-Pytorch Pre-Trained Models Google Drive p018 Accuracy Models Top1 Top5 PPLCNet_x0_25 0.5186 0.7565 PPLCNet_x0_35 0.5809 0.8083 PPLCNet_x0_5 0

24 Dec 12, 2022

PG2Net: Personalized and Group PreferenceGuided Network for Next Place Prediction

PG2Net PG2Net:Personalized and Group Preference Guided Network for Next Place Prediction Datasets Experiment results on two Foursquare check-in datase

5 Dec 20, 2022

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning" This is the code for the paper Solving Graph-based Public Goo

3 Dec 05, 2022

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

We proposed a new approach to detect anomalies of mobile robot data. We investigate each data seperately with two clustering method hierarchical and k-means. There are two sub-method that we used for

1 Jan 09, 2022

Tooling for converting STAC metadata to ODC data model

手语识别 0、使用到的模型 (1). openpose，作者：CMU-Perceptual-Computing-Lab https://github.com/CMU-Perceptual-Computing-Lab/openpose (2). 图像分类classification，作者：Bubbl

65 Dec 20, 2022

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

4.8k Jan 07, 2023

Official Implementation of DDOD (Disentangle your Dense Object Detector), ACM MM2021

Disentangle Your Dense Object Detector This repo contains the supported code and configuration files to reproduce object detection results of Disentan

51 Jan 07, 2023

(AAAI 2021) Progressive One-shot Human Parsing

End-to-end One-shot Human Parsing This is the official repository for our two papers: Progressive One-shot Human Parsing (AAAI 2021) End-to-end One-sh

54 Dec 30, 2022

(Personalized) Page-Rank computation using PyTorch

Related tags

Overview

torch-ppr

💪 Getting Started

🚀 Installation

👐 Contributing

👋 Attribution

⚖️ License

🍪 Cookiecutter

🛠️ For Developers

Development Installation

🥼 Testing

📖 Building the Documentation

📦 Making a Release

Comments

`torch.sparse.mm` breaking API changes

Incorporating edge weights

RuntimeError torch.sparse.addmm different torch tensor shape

Expose API functions from top-level

Formulate page-rank as a torch.nn Layer

Releases(v0.0.8)

v0.0.8(Jul 20, 2022)

What's Changed

v0.0.7(Jun 29, 2022)

What's Changed

v0.0.6(Jun 29, 2022)

What's Changed

v0.0.5(May 12, 2022)

What's Changed

v0.0.4(May 10, 2022)

What's Changed

v0.0.3(May 10, 2022)

What's Changed

New Contributors

v0.0.2(May 9, 2022)

What's Changed

v0.0.1(May 6, 2022)

What's Changed

Owner

Max Berrendorf

MIMIC Code Repository: Code shared by the research community for the MIMIC-III database

Code for the ICCV 2021 Workshop paper: A Unified Efficient Pyramid Transformer for Semantic Segmentation.

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

social humanoid robots with GPGPU and IoT

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

Implement of homography net by pytorch

Worktory is a python library created with the single purpose of simplifying the inventory management of network automation scripts.

Reinforcement Learning for Automated Trading

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

PyTorch implementation of PP-LCNet

PG2Net: Personalized and Group PreferenceGuided Network for Next Place Prediction

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

Tooling for converting STAC metadata to ODC data model

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Official Implementation of DDOD (Disentangle your Dense Object Detector), ACM MM2021

(AAAI 2021) Progressive One-shot Human Parsing