Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Last update: Jan 05, 2023

Overview

Introduction

This repository is the official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Data-free Knowledge Distillation for Object Detection
Akshay Chawla, Hongxu Yin, Pavlo Molchanov and Jose Alvarez
NVIDIA

Abstract: We present DeepInversion for Object Detection (DIODE) to enable data-free knowledge distillation for neural networks trained on the object detection task. From a data-free perspective, DIODE synthesizes images given only an off-the-shelf pre-trained detection network and without any prior domain knowledge, generator network, or pre-computed activations. DIODE relies on two key components—first, an extensive set of differentiable augmentations to improve image fidelity and distillation effectiveness. Second, a novel automated bounding box and category sampling scheme for image synthesis enabling generating a large number of images with a diverse set of spatial and category objects. The resulting images enable data-free knowledge distillation from a teacher to a student detector, initialized from scratch. In an extensive set of experiments, we demonstrate that DIODE’s ability to match the original training distribution consistently enables more effective knowledge distillation than out-of-distribution proxy datasets, which unavoidably occur in a data-free setup given the absence of the original domain knowledge.

[PDF - OpenAccess CVF]

LICENSE

This work is made available under the Nvidia Source Code License (1-Way Commercial). To view a copy of this license, visit https://github.com/NVlabs/DIODE/blob/master/LICENSE

Setup environment

Install conda [link] python package manager then install the lpr environment and other packages as follows:

$ conda env create -f ./docker_environment/lpr_env.yml
$ conda activate lpr
$ conda install -y -c conda-forge opencv
$ conda install -y tqdm
$ git clone https://github.com/NVIDIA/apex
$ cd apex
$ pip install -v --no-cache-dir ./

Note: You may also generate a docker image based on provided Dockerfile docker_environments/Dockerfile.

How to run?

This repository allows for generating location and category conditioned images from an off-the-shelf Yolo-V3 object detection model.

Download the directory DIODE_data from google cloud storage: gcs-link (234 GB)

Copy pre-trained yolo-v3 checkpoint and pickle files as follows:

$ cp /path/to/DIODE_data/pretrained/names.pkl /pathto/lpr_deep_inversion/models/yolo/
$ cp /path/to/DIODE_data/pretrained/colors.pkl /pathto/lpr_deep_inversion/models/yolo/
$ cp /path/to/DIODE_data/pretrained/yolov3-tiny.pt /pathto/lpr_deep_inversion/models/yolo/
$ cp /path/to/DIODE_data/pretrained/yolov3-spp-ultralytics.pt /pathto/lpr_deep_inversion/models/yolo/

Extract the one-box dataset (single object per image) as follows:
```
$ cd /path/to/DIODE_data
$ tar xzf onebox/onebox.tgz -C /tmp
```
Confirm the folder /tmp/onebox containing the onebox dataset is present and has following directories and text file manifest.txt:
```
$ cd /tmp/onebox
$ ls
images  labels  manifest.txt
```

Generate images from yolo-v3:

$ cd /path/to/lpr_deep_inversion
$ chmod +x scripts/runner_yolo_multiscale.sh
$ scripts/runner_yolo_multiscale.sh

Notes:

For ngc, use the provided bash script scripts/diode_ngc_interactivejob.sh to start an interactive ngc job with environment setup, code and data setup.
To generate large dataset use bash script scripts/LINE_looped_runner_yolo.sh.
Check knowledge_distillation subfolder for code for knowledge distillation using generated datasets.

Citation

@inproceedings{chawla2021diode,
	title = {Data-free Knowledge Distillation for Object Detection},
	author = {Chawla, Akshay and Yin, Hongxu and Molchanov, Pavlo and Alvarez, Jose M.},
	booktitle = {The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
	month = January,
	year = {2021}
}

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Related tags

Overview

Introduction

LICENSE

Setup environment

How to run?

Notes:

Citation

Owner

NVIDIA Research Projects

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

PyTorch implementation of MulMON

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

A Graph Neural Network Tool for Recovering Dense Sub-graphs in Random Dense Graphs.

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

An official implementation of "Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation" (CVPR 2021) in PyTorch.

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

Mmdet benchmark with python

A transformer model to predict pathogenic mutations

PyTorch Implementation of SSTNs for hyperspectral image classifications from the IEEE T-GRS paper "Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A FAS Framework."

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

Tooling for converting STAC metadata to ODC data model

Devkit for 3D -- Some utils for 3D object detection based on Numpy and Pytorch

Product-based-recommendation-system - A product based recommendation system which uses Machine learning algorithm such as KNN and cosine similarity

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

Cleaned up code for DSTC 10: SIMMC 2.0 track: subtask 2: multimodal coreference resolution

Code & Data for the Paper "Time Masking for Temporal Language Models", WSDM 2022

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

UnFlow: Unsupervised Learning of Optical Flow with a Bidirectional Census Loss