PN-Net a neural field-based framework for depth estimation from single-view RGB images.

Last update: Oct 02, 2021

Related tags

Overview

PN-Net

We present a neural field-based framework for depth estimation from single-view RGB images. Rather than representing a 2D depth map as a single channel image, we define it as the iso-surface of a scalar field in an implicit space, which we introduce as the Pseudo 3D Space. We convert a 3D Depth Field into a 2D depth image utilizing an efficient and differentiable sphere tracing rendering algorithm. We introduce two further innovations. First, we present a Field Warping technique that simplifies the depth field estimation as a classification problem, which is far more efficient to learn than a regression task of learning a signed distance function (SDF). Second, we design the 3D Pseudo Normal from the 2D depth map, which is closely related to the actual 3D surface normal and can be computed from the depth field's implicit representation with an uncalibrated camera. Experiments validated our method's performance. Our Pseudo 3D Space simplifies the current implicit field learning and offers a consistent framework for advancing shape reconstruction from multiple cues.

Set up dataset path

Suppose your dataset is placed like this:

/absolute_path/bts_nyu_data/
    sync/
        ...
    official_splits/
        train/
            ...
        test/
            ...

Add in ~/.bashrc the following

export PNNET_NYU2_DATASET=/absolute_path/bts_nyu_data/

Train with

python train_bts_nyu_nd3.py -c configs/train_bts_nyu_nd3_tb_vis.json

This include pseudo normal and total bending loss.

PN-Net a neural field-based framework for depth estimation from single-view RGB images.

Related tags

Overview

PN-Net

Set up dataset path

Train with

Owner

Sign Language Transformers (CVPR'20)

Using Language Model to Bootstrap Human Activity Recognition Ambient Sensors Based in Smart Homes

ParmeSan: Sanitizer-guided Greybox Fuzzing

The final project of "Applying AI to 2D Medical Imaging Data" of "AI for Healthcare" nanodegree - Udacity.

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Machine learning library for fast and efficient Gaussian mixture models

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

Transfer Learning Shootout for PyTorch's model zoo (torchvision)

MakeItTalk: Speaker-Aware Talking-Head Animation

Fast and Simple Neural Vocoder, the Multiband RNNMS

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

Final Project for the CS238: Decision Making Under Uncertainty course at Stanford University in Autumn '21.

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A Learning-based Camera Calibration Toolbox

sssegmentation is a general framework for our research on strongly supervised semantic segmentation.

A rule-based log analyzer & filter

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.