An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py

Last update: Dec 31, 2022

Related tags

Overview

PYPARSVD

This implementation allows for a singular value decomposition which is:

Distributed using MPI4Py
Streaming - data can be shown in batches to update the left singular vectors
Randomized for further acceleration of any serial components of the overall algorithm.

The streaming algorithm used in this implementation is available in: "Sequential Karhunen–Loeve Basis Extraction and its Application to Images" by Avraham Levy and Michael Lindenbaum. IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 9, NO. 8, AUGUST 2000. This algorithm is implemented in Online_SVD_Serial.py.

The distributed computation of the SVD follows the implementation in "Approximate partitioned method of snapshots for POD." by Wang, Zhu, Brian McBee, and Traian Iliescu. Journal of Computational and Applied Mathematics 307 (2016): 374-384. This algorithm is validated in APMOS_Validation/.

The parallel QR algorithm (the TSQR method) required for the streaming feature may be found in "Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures." by Benson, Austin R., David F. Gleich, and James Demmel. 2013 IEEE international conference on big data. IEEE, 2013. This algorithm is validated in Parallel_QR.

The randomized algorithm used to accelerate the computation of the serial SVD in partitioned method of snapshots may be found in "Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions." by Halko, Nathan, Per-Gunnar Martinsson, and Joel A. Tropp. SIAM review 53.2 (2011): 217-288.

To enable this feature set low_rank=True for initializing the online_svd_calculator class object in online_svd_parallel.py

To reproduce results on a shared memory platform (needs atleast 6 available ranks): export OPENBLAS_NUM_THREADS=1 to ensure numpy does not multithread for this experiment.

Run python data_splitter.py to generate exemplar data etc.
Run python online_svd_serial.py for serial deployment of streaming algorithm.
Run mpirun -np 6 python online_svd_parallel.py for parallel/streaming deployment.

Caution: Due to differences in the parallel and serial versions of the algorithm, singular vectors may be "flipped". An orthogonality check is also deployed for an additional sanity check.

Example extractions of left singular vectors and singular values

Even the simple problem demonstrated here (8192 spatial points and 800 snapshots) achieves a dramatic acceleration in time to solution from serial to parallelized-streaming implementations (~25X). Note that the key advantage of the parallelized version is the lack of a data-transfer requirement in case this routine is being called from a simulation.

You might also like...

Streaming over lightweight data transformations

Description Data augmentation libarary for Deep Learning, which supports images, segmentation masks, labels and keypoints. Furthermore, SOLT is fast a

Research Unit of Medical Imaging, Physics and Technology

256 Jan 8, 2023

Music library streaming app written in Flask & VueJS

djtaytay This is a little toy app made to explore Vue, brush up on my Python, and make a remote music collection accessable through a web interface. I

6 May 27, 2022

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

scikit-event-correlation Event Correlation and Changing Detection Algorithm Theo

5 Oct 30, 2022

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Secure Tar Secure Tarfile library It's a streaming wrapper around python tarfile

2 Dec 9, 2022

Real-time Object Detection for Streaming Perception, CVPR 2022

StreamYOLO Real-time Object Detection for Streaming Perception Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, Sun Jian Real-time Object Detection

237 Dec 27, 2022

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

English | 简体中文 Welcome to the PaddlePaddle GitHub. PaddlePaddle, as the only independent R&D deep learning platform in China, has been officially open

19.4k Jan 4, 2023

Releases(v1.0)

v1.0(Feb 25, 2021)

A Parallelized, streaming, and randomized implementation of the SVD for Python using mpi4py.

Contact [email protected] (or create issue) for details.

Romit Maulik
Source code(tar.gz)
Source code(zip)

An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py

Related tags

Overview

PYPARSVD

You might also like...

Streaming over lightweight data transformations

Music library streaming app written in Flask & VueJS

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Real-time Object Detection for Streaming Perception, CVPR 2022

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

Model parallel transformers in Jax and Haiku

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Symbolic Parallel Adaptive Importance Sampling for Probabilistic Program Analysis in JAX

Releases(v1.0)

v1.0(Feb 25, 2021)

Owner

Romit Maulik

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

This is the official implementation for the paper "Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization" in NeurIPS 2021.

🇰🇷 Text to Image in Korean

Code base for NeurIPS 2021 publication titled Kernel Functional Optimisation (KFO)

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Predictive AI layer for existing databases.

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

Deep Learning Based Fasion Recommendation System for Ecommerce

Implementation of average- and worst-case robust flatness measures for adversarial training.

Meta Learning for Semi-Supervised Few-Shot Classification

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

This project generates news headlines using a Long Short-Term Memory (LSTM) neural network.

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Exploring whether attention is necessary for vision transformers

Txt2Xml tool will help you convert from txt COCO format to VOC xml format in Object Detection Problem.

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

A collection of educational notebooks on multi-view geometry and computer vision.

Learning Lightweight Low-Light Enhancement Network using Pseudo Well-Exposed Images

Machine-in-the-Loop Rewriting for Creative Image Captioning