VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Deep-learning X-Ray Micro-CT image enhancement, pore-network modelling and continuum modelling

PyTorch implementation of GLOM

Cross-platform-profile-pic-changer - Script to change profile pictures across multiple platforms

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

Adaptation through prediction: multisensory active inference torque control

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

The second project in Python course on FCC

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

Julia package for multiway (inverse) covariance estimation.

My implementation of DeepMind's Perceiver

This is the official implementation of "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".

Open AI's Python library

PenguinSpeciesPredictionML - Basic model to predict Penguin species based on beak size and sex.

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Invasive Plant Species Identification

ObsPy: A Python Toolbox for seismology/seismological observatories.

Domain Generalization with MixStyle, ICLR'21.