A GridMixup augmentation, inspired by GridMask and CutMix

Last update: Dec 28, 2022

Related tags

Deep Learning GridMixup

Overview

GridMixup

A GridMixup augmentation, inspired by GridMask and CutMix

Easy install

pip install git+https://github.com/IlyaDobrynin/GridMixup.git

Overview

This simple augmentation is inspired by the GridMask and CutMix augmentations. The combination of this two augmentations forms proposed method.

Example

To run simple examples notebooks, you should install requirements:

pip install -r requirements.txt

Simple examples are here: demo and pipeline demo

TlDr:

from gridmix import GridMixupLoss

gridmix_cls = GridMixupLoss(
    alpha=(0.4, 0.7),
    hole_aspect_ratio=1.,
    crop_area_ratio=(0.5, 1),
    crop_aspect_ratio=(0.5, 2),
    n_holes_x=(2, 6)
)

images, targets = batch['images'], batch['targets']
images_mixed, targets_mixed = gridmix_cls.get_sample(images=images, targets=targets)
preds = model(images_mixed)
loss = criterion(preds, targets_mixed)

Before

After

GridMixup loss defined as:

lam * CrossEntropyLoss(preds, trues1) + (1 - lam) * CrossEntropyLoss(preds, trues2)

where:

lam - the area of the main image
(1 - lam) - area of the secondary image

Parameters

GridMixupLoss takes follow arguments:

alpha - parameter define area of the main image in mixed image. Could be float or Tuple[float, float].
- if float: lambda parameter gets from the beta-dictribution np.random.beta(alpha, alpha);
- if Tuple[float, float]: lambda parameter gets from the uniform distribution np.random.uniform(alpha[0], alpha[1]).
n_holes_x - number of holes in crop by X axis.
hole_aspect_ratio - aspect ratio of holes.
crop_area_ratio - parameter define area of the secondary image on a mixed image.
crop_aspect_ratio - aspect ratio of crop.

A GridMixup augmentation, inspired by GridMask and CutMix

Related tags

Overview

GridMixup

Easy install

Overview

Example

Parameters

Owner

IlyaDo

A simple consistency training framework for semi-supervised image semantic segmentation

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

A library that allows for inference on probabilistic models

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

An image classification app boilerplate to serve your deep learning models asap!

Differentiable Annealed Importance Sampling (DAIS)

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

Code for IntraQ, PyTorch implementation of our paper under review

Tom-the-AI - A compound artificial intelligence software for Linux systems.

Data Preparation, Processing, and Visualization for MoVi Data

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

A tool for making map images from OpenTTD save games

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

1st ranked 'driver careless behavior detection' for AI Online Competition 2021, hosted by MSIT Korea.

Deep Learning Head Pose Estimation using PyTorch.

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification