Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Last update: Nov 15, 2022

Related tags

Deep Learning pytorch

Overview

Photon-Starved Scene Inference using Single Photon Cameras

ICCV 2021
Arxiv Project Video

Bhavya Goyal, Mohit Gupta

University of Wisconsin-Madison

Abstract

Scene understanding under low-light conditions is a challenging problem. This is due to the small number of photons captured by the camera and the resulting low signal-to-noise ratio (SNR). Single-photon cameras (SPCs) are an emerging sensing modality that are capable of cap-turing images with high sensitivity. Despite having minimal read-noise, images captured by SPCs in photon-starved conditions still suffer from strong shot noise, preventing reliable scene inference. We propose photon scale-space, a collection of high-SNR images spanning a wide range of photons-per-pixel (PPP) levels (but same scene content) as guides to train inference model on low photon flux images. We develop training techniques that push images with different illumination levels closer to each other in feature representation space. The key idea is that having a spectrum of different brightness levels during training enables effective guidance, and increases robustness to shot noise even in extreme noise cases. Based on the proposed approach, we demonstrate, via simulations and real experiments with a SPAD camera, high-performance on various inference tasks such as image classification and monocular depth estimation under ultra low-light, down to < 1 PPP.

Code Structure

.
├── classification          # Code for image classification using Photon Net training
├── monodepth               # Code for monocular depth estimation using Photon Net training
├── simulation              # Scripts for simulating noisy SPAD images
├── figures                 # figures used for results
└── README.md

Requirements/Installation

Install PyTorch (pytorch.org)
pip install -r requirements.txt

How to Use

Download the datasets (CUB/CARS/NYUV2/others) from the official sources and use scripts in simulation to simulate noisy images from SPAD
Use classification and monodepth code for image classifiation and monocular depth estimation using Photon Net

Citation

@InProceedings{Goyal_2021_ICCV,
    author    = {Goyal, Bhavya and Gupta, Mohit},
    title     = {Photon-Starved Scene Inference Using Single Photon Cameras},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {2512-2521}
}

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Related tags

Overview

Photon-Starved Scene Inference using Single Photon Cameras

Bhavya Goyal, Mohit Gupta

Abstract

Code Structure

Requirements/Installation

How to Use

Citation

Owner

Bhavya Goyal

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

face2comics by Sxela (Alex Spirin) - face2comics datasets

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

sequitur is a library that lets you create and train an autoencoder for sequential data in just two lines of code

Towards Debiasing NLU Models from Unknown Biases

A generalist algorithm for cell and nucleus segmentation.

FLAVR is a fast, flow-free frame interpolation method capable of single shot multi-frame prediction

Keras-1D-ACGAN-Data-Augmentation

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

Bling's Object detection tool

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

ML course - EPFL Machine Learning Course, Fall 2021

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

A Simulated Optimal Intrusion Response Game

Metric learning algorithms in Python

2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案