HV-plane reconstruction from a single 360 image

Code for our paper in CVPR 2021: Indoor Panorama Planar 3D Reconstruction via Divide and Conquer (paper, video)

Pretrained models

Download our pretrained models from google drive or dropbox.

Inference on 360 datas

Please resize your images into 512 x 1024.
Follow the preprocessing step here to ensure Mahattan alignment of your 360 images.
Run our inference script. Examples:

python inference.py --pth ckpt/mp3d.pth --glob static/demo.png --outdir static/mp3d_model_results

To run on a batch of images, you can use --glob "AWESOME_360_IMAGES_DIR/*png"

Visualize the results

Here is the visulization example on a held-out data:

python vis_planes.py --img static/demo.png --h_planes static/mp3d_model_results/demo.h_planes.exr --v_planes static/mp3d_model_results/demo.v_planes.exr --mesh

To always visualize all the planes, add --mesh_show_back_face.

Citation

@inproceedings{SunHWSC21,
  author    = {Cheng Sun and
               Chi{-}Wei Hsiao and
               Ning{-}Hsu Wang and
               Min Sun and
               Hwann{-}Tzong Chen},
  title     = {Indoor Panorama Planar 3D Reconstruction via Divide and Conquer},
  booktitle = {CVPR},
  year      = {2021},
}

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

Related tags

Overview

HV-plane reconstruction from a single 360 image

Pretrained models

Inference on 360 datas

Visualize the results

Citation

Owner

sunset

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Code repo for "RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional Neural Network" (Machine Learning and the Physical Sciences workshop in NeurIPS 2021).

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

Code examples and benchmarks from the paper "Understanding Entropy Coding With Asymmetric Numeral Systems (ANS): a Statistician's Perspective"

Learning Versatile Neural Architectures by Propagating Network Codes

Ladder Variational Autoencoders (LVAE) in PyTorch

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

A time series processing library

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

[UNMAINTAINED] Automated machine learning for analytics & production

Optimizing synthesizer parameters using gradient approximation

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Regularizing Generative Adversarial Networks under Limited Data (CVPR 2021)

Large scale embeddings on a single machine.

DeconvNet : Learning Deconvolution Network for Semantic Segmentation

Pytorch implementation of set transformer

Dynamic View Synthesis from Dynamic Monocular Video

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

Explaining in Style: Training a GAN to explain a classifier in StyleSpace