A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Last update: Nov 01, 2022

Related tags

Deep Learning imagenet-tools

Overview

This is a set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Make TFRecords

To run the script setup a virtualenv with the following libraries installed.

tensorflow: Install with pip install tensorflow

Once you have all the above libraries setup, you should register on the Imagenet website and download the ImageNet .tar files. It should be extracted and provided in the format:

Training images: train/n03062245/n03062245_4620.JPEG
Validation Images: validation/ILSVRC2012_val_00000001.JPEG

To run the script to preprocess the raw dataset as TFRecords, run the following command:

python3 make_tfrecords.py \
  --raw_data_dir="path/to/imagenet" \
  --local_scratch_dir="path/to/output"

Note that the label is from 1 to 1000.

Make index files

To run the script setup a virtualenv with the following libraries installed.

nvidia.dali: See documentation

python3 make_idx.py --tfrecord_root="path/to/tfrecords"

Build subset of Imagenet-1K

This can help you build a subset of Imagenet-1K (TFRecord format):

python3 build_subset.py "path/to/tfrecords" "output_dir" \
  --train_num_shards=128 \
  --valid_num_shards=16 \
  --num_classes=100

Classes are selected randomly.

DALI dataloader

We also provide a DALI dataloader which can read the processed dataset. The dataloader is equipped with Mixup.

Here is an simple example to construct it:

import glob
import os


def build_dali_train(root):
    train_pat = os.path.join(root, 'train/*')
    train_idx_pat = os.path.join(root, 'idx_files/train/*')
    return DaliDataloader(
        sorted(glob.glob(train_pat)),
        sorted(glob.glob(train_idx_pat)),
        batch_size=BATCH_SIZE,
        shard_id=SHARD_ID,
        num_shards=NUM_SHARDS,
        training=True,
        gpu_aug=True,
        cuda=True,
        mixup_alpha=0.0,
        num_threads=16,
    )

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Related tags

Overview

Overview

Make TFRecords

Make index files

Build subset of Imagenet-1K

DALI dataloader

Owner

The UI as a mobile display for OP25

Attention Probe: Vision Transformer Distillation in the Wild

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

RaceBERT -- A transformer based model to predict race and ethnicty from names

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

All the essential resources and template code needed to understand and practice data structures and algorithms in python with few small projects to demonstrate their practical application.

Human4D Dataset tools for processing and visualization

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Styled Augmented Translation

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

View model summaries in PyTorch!

Leaderboard, taxonomy, and curated list of few-shot object detection papers.

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Repository for self-supervised landmark discovery

Libtorch yolov3 deepsort

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Related tags

Overview

Overview

Make TFRecords

Make index files

Build subset of Imagenet-1K

DALI dataloader

Owner

The UI as a mobile display for OP25

Attention Probe: Vision Transformer Distillation in the Wild

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

RaceBERT -- A transformer based model to predict race and ethnicty from names

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

All the essential resources and template code needed to understand and practice data structures and algorithms in python with few small projects to demonstrate their practical application.

Human4D Dataset tools for processing and visualization

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Styled Augmented Translation

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

View model summaries in PyTorch!

Leaderboard, taxonomy, and curated list of few-shot object detection papers.

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

Repository for self-supervised landmark discovery

Libtorch yolov3 deepsort

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: