NeuralCompression is a Python repository dedicated to research of neural networks that compress data

Last update: Jan 06, 2023

Overview

NeuralCompression

What's New

July 2021 (image compression) - Released implemenation of Scale Hyperprior
July 2021 (video compression) - Released implementation of DVC

About

NeuralCompression is a Python repository dedicated to research of neural networks that compress data. The repository includes tools such as JAX-based entropy coders, image compression models, video compression models, and metrics for image and video evaluation.

NeuralCompression is alpha software. The project is under active development. The API will change as we make releases, potentially breaking backwards compatibility.

Installation

NeuralCompression is a project currently under development. You can install the repository in development mode.

PyPI Installation

First, install PyTorch according to the directions from the PyTorch website. Then, you should be able to run

pip install neuralcompression

to get the latest version from PyPI.

Development Installation

First, clone the repository and navigate to the NeuralCompression root directory. To match your local environment to the test environment, run

pip install -r dev-requirements.txt

Then, you can install the package in development mode by running

pip install -e .

If you are not interested in matching the test environment, then you only need to apply the second step to install.

Repository Structure

We use a 2-tier repository structure. The neuralcompression package contains a core set of tools for doing neural compression research. Code committed to the core package requires stricter linting, high code quality, and rigorous review. The projects folder contains code for reproducing papers and training baselines. Code in this folder is not linted aggressively, we don't enforce type annotations, and it's okay to omit unit tests.

The 2-tier structure enables rapid iteration and reproduction via code in projects that is built on a backbone of high-quality code in neuralcompression.

neuralcompression

neuralcompression - base package
- data - PyTorch data loaders for various data sets
- entropy_coders - lossless compression algorithms in JAX
  - craystack - an implementation of the rANS algorithm with the craystack API
- functional - methods for image warping, information cost, etc.
- layers - building blocks for compression models
- metrics - torchmetrics classes for assessing model performance
- models - complete compression models

projects

projects - recipes and code for reproducing papers
- scale_hyperprior_lightning Scale Hyperprior (Balle et al., 2018)
- deep_video_compression DVC (Lu et al., 2019)

Getting Started

For an example of package usage, see the Scale Hyperprior for an example of how to train an image compression model in PyTorch Lightning. See DVC for a video compression example.

Contributions

Please read our CONTRIBUTING guide and our CODE_OF_CONDUCT prior to submitting a pull request.

We test all pull requests. We rely on this for reviews, so please make sure any new code is tested. Tests for neuralcompression go in the tests folder in the root of the repository. Tests for individual projects go in those projects' own tests folder.

We use black for formatting, isorst for import sorting, flake8 for linting, mypy for type checking. We enforce these on the neuralcompression package, but not in the projects folder.

License

NeuralCompression is MIT licensed, as found in the LICENSE file.

Cite

If you find NeuralCompression useful in your work, feel free to cite

@misc{muckley2021neuralcompression,
    author={Matthew Muckley and Jordan Juravsky and Daniel Severo and Mannat Singh and Quentin Duval and Karen Ullrich},
    title={NeuralCompression},
    howpublished={\url{https://github.com/facebookresearch/NeuralCompression}},
    year={2021}
}

Comments

Dependency and Build Fixes

This is a potpourri of fixes.

Dependencies Currently, we require very specific package versions for any install. This is too restrictive for a research package where researchers may have a variety of reasons to carefully tailor their environment. This PR resolves the issue by allowing general installation to have flexible packages. To maintain CI stability and functionality, the fixed version requirements have been moved to dev.

Build We build our CPP extension using load instead of setup.py. This removes PyTorch as a dependency at install time and removes the need for us to distribute binaries. This required adding ninja as a dependency. Hopefully we can get rid of it eventually, but this temporarily should fix distribution.

CI testing isort was misconfigured and missed a lot of import changes. I fixed the config and ran it on all files to update import sorting.

Other While implementing this I also found a few other minor issues with respect to versioning, tests, and formatting that I resolved.

This PR makes NeuralCompression require Python 3.8 due to its usage of importlib.

Testing Testing done via CI.
CLA Signed

opened by mmuckley 4
Documentation builds on ReadTheDocs are failing

Bug

Documentation builds on ReadTheDocs are failing.

Steps

See here: https://readthedocs.org/projects/neuralcompression/builds/15172473/

Expected behavior

Docs should build without error.

Environment

ReadTheDocs

Context

See here: https://readthedocs.org/projects/neuralcompression/builds/15172473/
bug

opened by mmuckley 4
`ContinuousEntropy` layer

Abstract base class (ABC) for implementing continuous entropy layers.

The abstract class pre-computes integer probability tables based on a prior distribution, which can be used across different platforms by a range encoder and decoder. The class also provides abstract methods for compression, decompression, quantization, and reconstruction.
enhancement CLA Signed

opened by 0x00b1 4

`NonNegativeParameterization` layer

closes #86

Non-negative parameterization as required by generalized divisive normalization (GDN) activations. The parameter is subjected to an invertible transformation that slows down the learning rate for small values.

A brief usage example:

import torch
from torch.nn import Module, Parameter
import torch.nn.functional

from torch import Tensor

from ._non_negative_parameterization import NonNegativeParameterization


class GeneralizedDivisiveNormalization(Module):
    def __init__(
        self,
        in_channels: int,
        inverse: bool = False,
        beta_min: float = 1e-6,
        gamma_init: float = 0.1,
    ):
        super(GeneralizedDivisiveNormalization, self).__init__()

        self._inverse = inverse

        self._reparameterized_beta = NonNegativeParameterization(
            torch.ones(in_channels),
            minimum=beta_min,
        )

        self._beta = Parameter(
            self._reparameterized_beta.initialized,
        )

        self._reparameterized_gamma = NonNegativeParameterization(
            gamma_init * torch.eye(in_channels),
        )

        self._gamma = Parameter(
            self._reparameterized_gamma.initialized,
        )

    def forward(self, x: Tensor) -> Tensor:
        _, channels, _, _ = x.size()

        y = torch.nn.functional.conv2d(
            x ** 2,
            torch.reshape(
                self._reparameterized_gamma(self._gamma),
                (channels, channels, 1, 1)
            ),
            self._reparameterized_beta(self._beta),
        )

        if self._inverse:
            return x * torch.sqrt(y)

        return x * torch.rsqrt(y)

import torch
import torch.testing

from neuralcompression.layers import GeneralizedDivisiveNormalization


class TestGeneralizedDivisiveNormalization:
    def test_backward(self):
        x = torch.rand((1, 32, 16, 16), requires_grad=True)

        generalized_divisive_normalization = GeneralizedDivisiveNormalization(32)

        y = generalized_divisive_normalization(x)

        y.backward(x)

        assert y.shape == x.shape

        assert x.grad is not None

        assert x.grad.shape == x.shape

        torch.testing.assert_allclose(
            x / torch.sqrt(1 + 0.1 * (x ** 2)),
            y,
        )

        generalized_divisive_normalization = GeneralizedDivisiveNormalization(
            32,
            inverse=True,
        )

        y = generalized_divisive_normalization(x)

        y.backward(x)

        assert y.shape == x.shape

        assert x.grad is not None

        assert x.grad.shape == x.shape

        torch.testing.assert_allclose(
            x * torch.sqrt(1 + 0.1 * (x ** 2)),
            y,
        )

enhancement CLA Signed

opened by 0x00b1 4

pad image at inference time, remove resize

Changes

At inference time, pad the image instead of doing an interpolation-based resize, which gives poor results (e.g. on PSNR) when the input image heigh and/or width is not exactly divisible by the downsampling factor (=2^{number of downsampling layers}).
CLA Signed

opened by desi-ivanova 3
Replace license docstrings with comments
The intention was three-fold:

consolidate copyright formatting across .py sources

simplify the implementation of #100

remove copyright headers from module documentation

Changes

[x] replaces license docstrings with comments

CLA Signed
opened by 0x00b1 3
survival_function op

closes #75

Survival function of x. Generally defined as 1 - distribution.cdf(x).

Unit test tests whether the returned result matches the returned result of scipy.stats.norm.sf.
enhancement CLA Signed

opened by 0x00b1 3
Update triggers for CI

This PR alters the triggers for continuous integration. Previously, we triggered all tests on both pushes and pull requests. This meant that within a PR we would have "duplicate" (but not really duplicate) checks. What we really want on a PR is just the PR check, so we'll keep that.

The other thing that's nice to have is to trigger CI when pushing to a branch. That is what this PR will remove, but to replace it we add workflow_dispatch which allows a user to trigger CI with the GitHub UI. So we remove quite a bit of duplicated tests at the cost of making users click a button if they want to test their code before PR.

Note that this PR only applies to people who push to branches of the repository.
CLA Signed

opened by mmuckley 3

HiFiC modules

Implements the following modules from Mentzer, et al. (2020)

HiFiCDiscriminator
HiFiCEncoder
HiFiCGenerator

@misc{mentzer2020highfidelity,
      title={High-Fidelity Generative Image Compression}, 
      author={Fabian Mentzer and George Toderici and Michael Tschannen and Eirikur Agustsson},
      year={2020},
      eprint={2006.09965},
      archivePrefix={arXiv},
      primaryClass={eess.IV}
}

Originally implemented in TensorFlow Compression (TFC) by the author (@relational).

CLA Signed

opened by 0x00b1 3

remove metadata from __init__.py

Changes

The metadata special variables from neuralcompression.__init__ were removed.

This metadata was not currently used and is now available from setup.cfg
CLA Signed

opened by 0x00b1 2
Update PyTorch to 1.10.0
Closes #127.

Changes

[x] Replaced torch.testing.assert_equal with torch.testing.assert_close

[x] Updates torch to 1.10.0

[x] Updates torchvision to 0.11.1

enhancement CLA Signed
opened by 0x00b1 2
Implement PQ-MIM compression paper

We would like to have an implementation of the following paper:

Image Compression with Product Quantized Masked Image Modeling Alaaeldin El-Nouby, Matthew J. Muckley, Karen Ullrich, Ivan Laptev, Jakob Verbeek, and Hervé Jégou
enhancement

opened by mmuckley 0
Upstream Google autoencoder models to CompressAI

At the moment we have several model implementations that are already implemented in CompressAI (e.g., Scale Hyperprior, Mean-Scale Hyperprior). At this point CompressAI has pretty good adoption, so we should be able to remove these from our repository and upstream the dependency.

By default CompressAI doesn't handle reflective image padding for users, so if desired we could include wrappers like that in PR #185 to handle this for users unfamiliar with the functionality of these models.
enhancement

opened by mmuckley 1

Releases(v0.2.1)

v0.2.1(Jan 12, 2022)
This release covers a few small fixes from PRs #171 and #172.

Dependencies

To retrieve versioning information, we now use importlib. This is included only with Python >= 3.8, so NeuralCompression will now only run on versions of Python at least as recent as 3.8. (#171).

Install requirements are flexible, whereas dev requirements are fixed (#171). This should improve CI stability while allowing researchers flexibility in tuning their research environment while using NeuralCompression.

torch has been removed as a build dependency (#172).

Other build dependencies have been modified to be flexible (#172).

Build System

C++ code from _pmf_to_quantized_cdf introduced compilation requirements when running setup.py. Since we didn't configure our build system to handle specific operating systems, this caused a failed release upload to PyPI. The build system has been altered to use torch.utils.cpp_extension.load, which defers compilation to the the user after package installation. We would like to improve this further at some point, but the modifications from #171 gets the package stable. Note: there is a reasonable chance this could fail on non-Linux OS's such as Windows. Those users will still be able to use other package features that don't rely on _pmf_to_quantized_cdf.

Other

Fixed a linting issue where isort was not checking in CI if packages were properly sorted. (#171).

Fixed a random test issue (#171).

Source code(tar.gz)
Source code(zip)
v0.2.0(Dec 13, 2021)
NeuralCompression is a PyTorch-based Python package intended to simplify neural network-based compression research. It is similar to (and shares some of the functionality) of fantastic libraries like TensorFlow Compression and Compress AI.

The major theme of v0.2.0 release is autoencoders, particularly features useful for implementing existing models by Ballé and features useful to expand on these models in forthcoming research. In addition, 0.2.0 sees some code organization changes and published documentation. I recommend reading the new “Image Compression” example to see some of these changes.

API Additions

Data (neuralcompression.data)

CLIC2020Image: Challenge on Learned Image Compression (CLIC) 2020 Image Dataset

CLIC2020Video: Challenge on Learned Image Compression (CLIC) 2020 Video Dataset

Distributions (neuralcompression.distributions)

NoisyNormal: normal distribution with additive identically distributed (i.i.d.) uniform noise.

UniformNoise: adapts a continuous distribution via additive identically distributed (i.i.d.) uniform noise.

Functional (neuralcompression.functional)

estimate_tails: estimates approximate tail quantiles.

log_cdf: logarithm of the distribution’s cumulative distribution function (CDF).

log_expm1: logarithm of e^{x} - 1.

log_ndtr: logarithm of the normal cumulative distribution function (CDF).

log_survival_function: logarithm of x for a distribution’s survival function.

lower_bound: torch.maximum with a gradient for x < bound.

lower_tail: approximates lower tail quantile for range coding.

ndtr: the normal cumulative distribution function (CDF).

pmf_to_quantized_cdf: transforms a probability mass function (PMF) into a quantized cumulative distribution function (CDF) for entropy coding.

quantization_offset: computes a distribution-dependent quantization offset.

soft_round_conditional_mean: conditional mean of x given noisy soft rounded values.

soft_round_inverse: inverse of soft_round.

soft_round: differentiable approximation of torch.round.

survival_function: survival function of x. Generally defined as 1 - distribution.cdf(x).

upper_tail: approximates upper tail quantile for range coding.

Layers (neuralcompression.layers)

AnalysisTransformation2D: applies the 2D analysis transformation over an input signal.

ContinuousEntropy: base class for continuous entropy layers.

GeneralizedDivisiveNormalization: applies generalized divisive normalization for each channel across a batch of data.

HyperAnalysisTransformation2D: applies the 2D hyper analysis transformation over an input signal.

HyperSynthesisTransformation2D: applies the 2D hyper synthesis transformation over an input signal.

NonNegativeParameterization: the parameter is subjected to an invertible transformation that slows down the learning rate for small values.

RateMSEDistortionLoss: rate-distortion loss.

SynthesisTransformation2D: applies the 2D synthesis transformation over an input signal.

Models (neuralcompression.models)

End-to-end Optimized Image Compression

End-to-end Optimized Image Compression Johannes Ballé, Valero Laparra, Eero P. Simoncelli https://arxiv.org/abs/1611.01704

PriorAutoencoder: base class for implementing prior autoencoder architectures.

FactorizedPriorAutoencoder

High-Fidelity Generative Image Compression

High-Fidelity Generative Image Compression Fabian Mentzer, George Toderici, Michael Tschannen, Eirikur Agustsson https://arxiv.org/abs/2006.09965

HiFiCEncoder

HiFiCDiscriminator

HiFiCGenerator

Variational Image Compression with a Scale Hyperprior

Variational Image Compression with a Scale Hyperprior Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, Nick Johnston https://arxiv.org/abs/1802.01436

HyperpriorAutoencoder: base class for implementing hyperprior autoencoder architectures.

MeanScaleHyperpriorAutoencoder

ScaleHyperpriorAutoencoder

API Changes

neuralcompression.functional.hsv2rgb is now neuralcompression.functional.hsv_to_rgb.

neuralcompression.functional.learned_perceptual_image_patch_similarity is now neuralcompression.functional.lpips.

Acknowledgements

Thank you to the following people for their advice:

Johannes Ballé (@jonycgn) and TensorFlow Compression

Jean Bégaint (@jbegaint) and Compress AI

Fabien Racapé (@fracape) and Compress AI

Justin Tan (@Justin-Tan) and high-fidelity-generative-compression

Source code(tar.gz)
Source code(zip)
v0.1.0(Jul 19, 2021)

This releases the project to PyPI and tests the GitHub action for releases.
Source code(tar.gz)
Source code(zip)

Owner

Facebook Research

GitHub Repository

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

Elaborative Rehearsal for Zero-shot Action Recognition This is an official implementation of: Shizhe Chen and Dong Huang, Elaborative Rehearsal for Ze

26 Sep 24, 2022

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Lightning-Hydra-Template A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥 Click on Use this template to initialize new re

1 Dec 20, 2021

Multiple style transfer via variational autoencoder

ST-VAE Multiple style transfer via variational autoencoder By Zhi-Song Liu, Vicky Kalogeiton and Marie-Paule Cani This repo only provides simple testi

13 Oct 29, 2022

Efficient Training of Visual Transformers with Small Datasets

Official codes for "Efficient Training of Visual Transformers with Small Datasets", NerIPS 2021.

112 Dec 25, 2022

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

249 Dec 22, 2022

When are Iterative GPs Numerically Accurate?

When are Iterative GPs Numerically Accurate? This is a code repository for the paper "When are Iterative GPs Numerically Accurate?" by Wesley Maddox,

1 Jan 06, 2022

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

ConvNeXt-TF This repository provides TensorFlow / Keras implementations of different ConvNeXt [1] variants. It also provides the TensorFlow / Keras mo

87 Dec 06, 2022

[CVPR 2021 Oral] ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis [arxiv|pdf|v

78 Dec 22, 2022

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

USD-Seg This project is an implement of paper USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation, based on FCOS detector f

80 Nov 28, 2022

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

RMA-Net This repo is the implementation of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021). Paper

205 Nov 09, 2022

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" ([email protected])

GP-VAE This repository provides datasets and code for preprocessing, training and testing models for the paper: Diverse Text Generation via Variationa

18 Dec 29, 2022

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

Cross-modal Retrieval using Transformer Encoder Reasoning Networks This project reimplements the idea from "Transformer Reasoning Network for Image-Te

5 Nov 05, 2022

An inofficial PyTorch implementation of PREDATOR based on KPConv.

PREDATOR: Registration of 3D Point Clouds with Low Overlap An inofficial PyTorch implementation of PREDATOR based on KPConv. The code has been tested

14 Aug 03, 2022

Heterogeneous Deep Graph Infomax

Heterogeneous-Deep-Graph-Infomax Parameter Setting: HDGI-A: Node-level dimension: 16 Attention head: 4 Semantic-level attention vector: 8 learning rat

52 Oct 31, 2022

Model of an AI powered sign language interpreter.

TEXT AND SPEECH TO SIGN LANGUAGE. A web application which takes in text or live audio speech recording as input, converts and displays the relevant Si

4 Mar 30, 2022

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

code for "Improving GAN Training via Binarized Representation Entropy (BRE) Regularization" (ICLR2018 paper) paper: https://arxiv.org/abs/1805.03644 G

21 Oct 12, 2020

Auxiliary data to the CHIIR paper Searching to Learn with Instructional Scaffolding

Searching to Learn with Instructional Scaffolding This is the data and analysis code for the paper "Searching to Learn with Instructional Scaffolding"

2 Mar 02, 2022

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

GMPQ: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation This is the pytorch implementation for the paper: Generalizable Mix

18 Sep 02, 2022

A font family with a great monospaced variant for programmers.

Fantasque Sans Mono A programming font, designed with functionality in mind, and with some wibbly-wobbly handwriting-like fuzziness that makes it unas

6.3k Jan 08, 2023

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021) Introduction This repository is the offical Pytorch implementation of

37 Nov 21, 2022

NeuralCompression is a Python repository dedicated to research of neural networks that compress data

Related tags

Overview

NeuralCompression

What's New

About

Installation

PyPI Installation

Development Installation

Repository Structure

neuralcompression

projects

Getting Started

Contributions

License

Cite

Comments

Bug

Steps

Expected behavior

Environment

Context

Changes

Changes

Changes

Changes

Releases(v0.2.1)

v0.2.1(Jan 12, 2022)

Dependencies

Build System

Other

v0.2.0(Dec 13, 2021)

API Additions

Data (neuralcompression.data)

Distributions (neuralcompression.distributions)

Functional (neuralcompression.functional)

Layers (neuralcompression.layers)

Models (neuralcompression.models)

End-to-end Optimized Image Compression

High-Fidelity Generative Image Compression

Variational Image Compression with a Scale Hyperprior

API Changes

Acknowledgements

v0.1.0(Jul 19, 2021)

Owner

Facebook Research

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Multiple style transfer via variational autoencoder

Efficient Training of Visual Transformers with Small Datasets

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

When are Iterative GPs Numerically Accurate?

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

[CVPR 2021 Oral] ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" ([email protected])

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

An inofficial PyTorch implementation of PREDATOR based on KPConv.

Heterogeneous Deep Graph Infomax

Model of an AI powered sign language interpreter.

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Auxiliary data to the CHIIR paper Searching to Learn with Instructional Scaffolding

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

A font family with a great monospaced variant for programmers.

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

Data (`neuralcompression.data`)

Distributions (`neuralcompression.distributions`)

Functional (`neuralcompression.functional`)

Layers (`neuralcompression.layers`)

Models (`neuralcompression.models`)