duralava

duralava is a neural network which can simulate a lava lamp in an infinite loop.

Example

This is not a real lava lamp but a "fake" one generated by duralava.

Novelty

duralava can

learn a physical process (a lava lamp).
generate an arbitarily long sequence of output, without diverging even after hours (outputting tens of thousands of frames).

How it works

Generative Adversarial Networks (GANs) can learn to generate new samples of data. For example, a GAN can be trained to output images of a lava lamp which look as real as possible. To accomplish this, the GAN gets an input vector with normally distributed noise. For duralava this vector is of length 64. Based on this random noise vector it generates a lava lamp image. The random vector thus encodes the state of the lava lamp.

For training, the GAN is presented a real image of a lava lamp and also one of the fake lava lamp and then it learns to make the fake ones look as real as possible.

For a lava lamp, a sequence of images has to be created. This sequence should in fact be infinite since a lava lamp can run forever. Thus the GAN should learn to output an arbitrarily long sequence of lava lamp images as a video. This is achieved by using a recurrent neural network (RNN). The RNN gets the 64 element noise vector of time step t and outputs the 64 element noise vector for time stemp t+1.

The tricky part is to make sure that the state of the lava lamp (the 64 element random noise vector) remains stable. It could for example happen that over time the distribution of noise in the vector diverges from a normal distribution the mean becomes 10 and the standard deviation 52. In this case, the output images of the lava lamps wouldn't be correct anymore as the GAN was trained to expect the input vector to be normally distributed. To solve this problem, I make sure that in training the output of the RNN stays normally distributed. This is accomplished by adding penalization terms in the training which discourage the noise to diverge from the normal distribution.

Low-hanging fruit

I trained on a MacBook Air with an M1 SoC with 16 GB of shared memory for CPU and GPU. Thus, memory was the limiting factor in my experiments.

With more memory, one could

Increase the resolution (currently 64x64 pixels)
Increase the training sequence length (currently 20)
Increase the batch size (currently 32)

duralava is a neural network which can simulate a lava lamp in an infinite loop.

Related tags

Overview

duralava

Example

Novelty

How it works

Low-hanging fruit

Owner

Maximilian Bachl

So-ViT: Mind Visual Tokens for Vision Transformer

PyTorch implementation of neural style transfer algorithm

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Course content and resources for the AIAIART course.

Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

This is the repository of the NeurIPS 2021 paper "Curriculum Disentangled Recommendation withNoisy Multi-feedback"

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

UniFormer - official implementation of UniFormer

The materials used in the SaxonJS tutorial presented at Declarative Amsterdam, 2021

DI-smartcross - Decision Intelligence Platform for Traffic Crossing Signal Control

OpenGAN: Open-Set Recognition via Open Data Generation

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021)

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Generate image analogies using neural matching and blending

Energy consumption estimation utilities for Jetson-based platforms

TransVTSpotter: End-to-end Video Text Spotter with Transformer

Multi-View Radar Semantic Segmentation