VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

This repo provides code for QB-Norm (Cross Modal Retrieval with Querybank Normalisation)

Crowd-Kit is a powerful Python library that implements commonly-used aggregation methods for crowdsourced annotation and offers the relevant metrics and datasets

Deep Learning pipeline for motor-imagery classification.

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Building Ellee — A GPT-3 and Computer Vision Powered Talking Robotic Teddy Bear With Human Level Conversation Intelligence

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

A quick recipe to learn all about Transformers

Social Distancing Detector

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

IMBENS: class-imbalanced ensemble learning in Python.

nanodet_plus,yolov5_v6.0

Totally Versatile Miscellanea for Pytorch

The official github repository for Towards Continual Knowledge Learning of Language Models

Adaptive Graph Convolution for Point Cloud Analysis

[CVPR 2021] Generative Hierarchical Features from Synthesizing Images

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Randomized Correspondence Algorithm for Structural Image Editing

Converting CPT to bert form for use

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

Build Low Code Automated Tensorflow, What-IF explainable models in just 3 lines of code.