Learned image compression

Last update: Dec 04, 2022

Overview

Pytorch code of our recent work A Unified End-to-End Framework for Efficient Deep Image Compression.

We first release the code for Variational image compression with a scale hyperprior, we will update our code to our full implementaion of our paper.

Prerequisites

You should install the libraries of this repo.

pip install -r requirements.txt

Data Preparation

We need to first prepare the training and validation data. The trainging data is from flicker.com. You can obtain the training data according to description of CompressionData.

The validation data is the popular kodak dataset.

bash data/download_kodak.sh

Training

For high bitrate (4096, 6144, 8192), the out_channel_N is 192 and the out_channel_M is 320 in 'config_high.json'. For low bitrate (256, 512, 1024, 2048), the out_channel_N is 128 and the out_channel_M is 192 in 'config_low.json'.

Details

PSNR experiments.

For high bitrate of 8192, we first train from scratch as follows.

CUDA_VISIBLE_DEVICES=0 python train.py --config examples/example/config_high.json -n baseline_8192 --train flicker_path --val kodak_path

For other high bitrate (4096, 6144), we use the converged model of 8192 as pretrain model and set the learning rate as 1e-5. The training iterations are set as 500000.

The low bitrate (256, 512, 1024, 2048) training process follows the same strategy.

MS-SSIM experiments

You should change the distorsion loss to (1-MS_SSIM), and fine-tune the pretrained model optimized by PSNR to accelerate the training process. You can find more details in our released paper. The training strategy is similar.

If your find our code is helpful for your research, please cite our paper. Besides, this code is only for research.

@article{liu2020unified,
  title={A Unified End-to-End Framework for Efficient Deep Image Compression},
  author={Liu, Jiaheng and Lu, Guo and Hu, Zhihao and Xu, Dong},
  journal={arXiv preprint arXiv:2002.03370},
  year={2020}
}

Learned image compression

Related tags

Overview

Overview

Content

Prerequisites

Data Preparation

Training

Details

PSNR experiments.

MS-SSIM experiments

Owner

Jiaheng Liu

Ground truth data for the Optical Character Recognition of Historical Classical Commentaries.

(Personalized) Page-Rank computation using PyTorch

Implementing Graph Convolutional Networks and Information Retrieval Mechanisms using pure Python and NumPy

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

Official implementation of VQ-Diffusion

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

Unofficial Implementation of Oboe (SIGCOMM'18').

PyTorch implementation of SIFT descriptor

Medical Insurance Cost Prediction using Machine earning

Efficient Training of Visual Transformers with Small Datasets

House3D: A Rich and Realistic 3D Environment

JittorVis - Visual understanding of deep learning models

This is a Image aid classification software based on python TK library development

Convert openmmlab (not only mmdetection) series model to tensorrt

Code for 1st place solution in Sleep AI Challenge SNU Hospital

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

An Open-Source Toolkit for Prompt-Learning.

Open Source Light Field Toolbox for Super-Resolution

Plug and play transformer you can find network structure and official complete code by clicking List