A collection of differentiable SVD methods and also the official implementation of the ICCV21 paper "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?"

Last update: Dec 25, 2022

Overview

Differentiable SVD

Introduction

This repository contains:

The official Pytorch implementation of ICCV21 paper Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
A collection of differentiable SVD methods utilized in our paper.

You can also find the presentation of our work via the slides and via the poster.

About the paper

In this paper, we investigate the reason behind why approximate matrix square root calculated via Newton-Schulz iteration outperform the accurate ones computed by SVD from the perspectives of data precision and gradient smoothness. Various remedies for computing smooth SVD gradients are investigated. We also propose a new spectral meta-layer that uses SVD in the forward pass, and Pad'e approximants in the backward propagation to compute the gradients. The results of the so-called SVD-Pad'e achieve state-of-the-art results on ImageNet and FGVC datasets.

Differentiable SVD Methods

As the backward algorithm of SVD is prone to have numerical instability, we implement a variety of end-to-end SVD methods by manipulating the backward algortihms in this repository. They include:

SVD-Pad'e: use Pad'e approximants to closely approximate the gradient. It is proposed in our ICCV21 paper.
SVD-Taylor: use Taylor polynomial to approximate the smooth gradient. It is proposed in our ICCV21 paper and the TPAMI journal.
SVD-PI: use Power Iteration (PI) to approximate the gradients. It is proposed in the NeurIPS19 paper.
SVD-Newton: use the gradient of the Newton-Schulz iteration.
SVD-Trunc: set a upper limit of the gradient and apply truncation.
SVD-TopN: select the Top-N eigenvalues and abandon the rest.
SVD-Original: ordinary SVD with gradient overflow check.

In the task of global covaraince pooling, the SVD-Pad'e achieves the best performances. You are free to try other methods in your research.

Implementation and Usage

The codes is modifed on the basis of iSQRT-COV.

See the requirements.txt for the specific required packages.

To train AlexNet on ImageNet, choose a spectral meta-layer in the script and run:

CUDA_VISIBLE_DEVICES=0,1 bash train_alexnet.sh

The pre-trained models of ResNet-50 with SVD-Pad'e is available via Google Drive. You can load the state dict by:

model.load_state_dict(torch.load('pade_resnet50.pth.tar'))

Citation

If you think the codes is helpful to your research, please consider citing our paper:

@inproceedings{song2021approximate,
  title={Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?},
  author={Song, Yue and Sebe, Nicu and Wang, Wei},
  booktitle={ICCV},
  year={2021}
}

Contact

If you have any questions or suggestions, please feel free to contact me

[email protected]

A collection of differentiable SVD methods and also the official implementation of the ICCV21 paper "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?"

Related tags

Overview

Differentiable SVD

Introduction

About the paper

Differentiable SVD Methods

Implementation and Usage

Citation

Contact

Owner

YueSong

Testing and Estimation of structural breaks in Stata

Face Library is an open source package for accurate and real-time face detection and recognition

TANL: Structured Prediction as Translation between Augmented Natural Languages

LibFewShot: A Comprehensive Library for Few-shot Learning.

This is the official PyTorch implementation of the CVPR 2020 paper "TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting".

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Raster Vision is an open source Python framework for building computer vision models on satellite, aerial, and other large imagery sets

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

NeWT: Natural World Tasks

Housing Price Prediction

TorchXRayVision: A library of chest X-ray datasets and models.

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

Space-invaders - Simple Game created using Python & PyGame, as my Beginner Python Project

MEDS: Enhancing Memory Error Detection for Large-Scale Applications

potpourri3d - An invigorating blend of 3D geometry tools in Python.

A Framework for Encrypted Machine Learning in TensorFlow

The official implementation of the Hybrid Self-Attention NEAT algorithm

a dnn ai project to classify which food people are eating on audio recordings

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"