Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Last update: Sep 18, 2022

Overview

VFedPCA+VFedAKPCA

This is the official source code for the Paper: Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework.

Despite enormous research interest and rapid application of federated learning (FL) to various areas, existing studies mostly focus on supervised federated learning under the horizontally partitioned local dataset setting. This paper will study the unsupervised FL under the vertically partitioned dataset setting.

Server-Clients Architecture

Figure: Server-Clients Architecture

Master Branch

VFedPCA+VFedAKPCA                    
└── case                        // Case Studies
    └── figs                    // Save experimental results' figures in '.eps' / '.png' format 
        ├── img_name*.eps              
        └── img_name*.png           
    ├── main.py          
    ├── model.py              
    └── utils.py                 
├── dataset                     // Put downloaded dataset in this folder
└── figs                        // Save experimental results' figures in '.eps' / '.png' format
    ├── img_name*.eps              
    └── img_name*.png           
├── README.md               
├── main.py                     // Experiment on Structured Dataset
├── model.py                   
└── utils.py

Environments

python = 3.8.8
numpy = 1.20.1
pandas = 1.2.4
scikit-learn = 0.24.1
scipy = 1.6.2
imageio = 2.9.0

Prepare Dataset

To demonstrate the superiority of our method, we utilized FIVE types of real-world datasets coming with distinct nature.

structured datasets from different domains;
medical image dataset;
face image dataset;
gait image dataset;
person re-identification image dataset.

Step 1: Download Dataset from the Google Drive URL

Step 2: Specify Dataset Path by Command Argument

$ python main.py --data_path="./dataset/xxx"

Experiments

We conduct extensive experiments on structured datasets to exmaines the effect of feature size, local iterations, warm-start power iterations, and weight scaling method on structed datasets. Furthermore, we investigate some case studies with image dataset to demonstrate the effectiveness of VFedPCA and VFedAKPCA.

A. Experiment on Structured Dataset

First, you need to choose the dataset.

python main.py --data_path './dataset/College.csv' --batch_size 160

Then, you only need to set different flag, p_list, iter_list and sampler_num to exmaines the effect of feature size, local iterations, warm-start power iterations, and weight scaling method on structed datasets. The example is as follows.

flag ='clients'
p_list = [3, 5, 10]         # the number of involved clients
iter_list = [100, 100, 100] # the number of local power iterations
sampler_num = 5

B. Case Studies

python main.py --data_path '../dataset/Image/DeepLesion' /
               --client_num 8 / 
               --iterations 100 / 
               --re_size 512

Citation

@inproceedings{
title = {{Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data}},
author = {Yiu-ming Cheung, Fellow, IEEE, Feng Yu, and Jian Lou},
year = 2021
}

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Related tags

Overview

VFedPCA+VFedAKPCA

Server-Clients Architecture

Master Branch

Environments

Prepare Dataset

Experiments

A. Experiment on Structured Dataset

B. Case Studies

Citation

Owner

John

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

A 3D Dense mapping backend library of SLAM based on taichi-Lang designed for the aerial swarm.

This Repo is the official CUDA implementation of ICCV 2019 Oral paper for CARAFE: Content-Aware ReAssembly of FEatures

DL & CV-based indicator toolset for the vehicle drivers via live dash-cam footage.

Distributed Evolutionary Algorithms in Python

Collection of generative models in Tensorflow

Reinforcement Learning for Portfolio Management

Ensembling Off-the-shelf Models for GAN Training

This repository contains a CBIR system that uses swin transformer to extract image's feature.

A Physics-based Noise Formation Model for Extreme Low-light Raw Denoising (CVPR 2020 Oral & TPAMI 2021)

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

Leveraging OpenAI's Codex to solve cornerstone problems in Music

A mini-course offered to Undergrad chemistry students

A SAT-based sudoku solver

PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)

Code for "Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency" paper