Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Last update: Jan 07, 2023

Related tags

Overview

Trainable multi-codebook quantization

This repository implements a utility for use with PyTorch, and ideally GPUs, for training an efficient quantizer based on multiple single-byte codebooks. The prototypical scenario is that you have some distribution over vectors in some space, say, of dimension 512, that might come from a neural net embedding, and you want a means of encoding a vector into a short sequence of bytes (say, 4 or 8 bytes) that can be used to reconstruct the vector with minimal expected loss, measured as squared distance, i.e. squared l2 loss.

This repository provides Quantizer object that lets you do this quantization, and an associated QuantizerTrainer object that you can use to train the Quantizer. For example, you might invoke the QuantizerTrainer with 20,000 minibatches of vectors.

Usage

Installation

python3 setup.py install

Example

import torch
import quantization

trainer = quantization.QuantizerTrainer(dim=256, bytes_per_frame=4,
                                        device=torch.device('cuda'))
while not trainer.done():
   # let x be some tensor of shape (*, dim), that you will train on
   # (should not be the same on each minibatch)
   trainer.step(x)
quantizer = trainer.get_quantizer()

# let x be some tensor of shape (*, dim)..
encoded = quantizer.encode(x)  # (*, 4), dtype=uint8
x_approx = quantizer.decode(quantizer.encode(x))

To avoid versioning issues and so on, it may be easier to just include quantization.py in your repository directly (and add its requirements to your requirements.txt).

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Related tags

Overview

Trainable multi-codebook quantization

Usage

Installation

Example

Owner

Daniel Povey

Pcos-prediction - Predicts the likelihood of Polycystic Ovary Syndrome based on patient attributes and symptoms

Keras udrl - Keras implementation of Upside Down Reinforcement Learning

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

This is the source code of the solver used to compete in the International Timetabling Competition 2019.

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Deep Q-network learning to play flappybird.

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

AdaFocus (ICCV 2021) Adaptive Focus for Efficient Video Recognition

Learned image compression

🕹️ Official Implementation of Conditional Motion In-betweening (CMIB) 🏃

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

DeLighT: Very Deep and Light-Weight Transformers

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Listing arxiv - Personalized list of today's articles from ArXiv

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

A Fast Knowledge Distillation Framework for Visual Recognition

use tensorflow 2.0 to tell a dog and cat from a specified picture