Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Last update: Dec 23, 2022

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

To be featured in the 2021 NeurIPS Workshop on Machine Learning and Design

StyleCLIPDraw adds a style loss to the CLIPDraw (Frans et al. 2021) (code) text-to-drawing synthesis model to allow artistic control of the synthesized drawings in addition to control of the content via text. Whereas performing decoupled style transfer on a generated image only affects the texture, our proposed coupled approach is able to capture a style in both texture and shape, suggesting that the style of the drawing is coupled with the drawing process itself.

Checkout our code on Colab

Method

Unlike most other image generation models, CLIPDraw produces drawings consisting of a series of Bezier curves defined by a list of coordinates, a color, and an opacity. The drawing begins as randomized Bezier curves on a canvas and is optimized to fit the given style and text. The StyleCLIPDraw model architecture is shown above. The brush strokes are rendered into a raster image via differentiable model. There are two losses for StyleCLIPDraw that correspond to each input. The text input and the augmented raster drawing are fed the the CLIP model and the difference in embeddings are compared using cosine distance to compute a loss that encourages the drawing to fit the text input. The image is augmented to avoid finding shallow solutions to optimizing through the CLIP model. The raster image and the style image are fed through early layers of the VGG-16 model and the difference in extracted features form the loss that encourages the drawings to fit the style of the style image.

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Related tags

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

Method

Results

StyleCLIPDraw vs. CLIPDraw then Style Transfer

Owner

Peter Schaldenbrand

Bayesian inference for Permuton-induced Chinese Restaurant Process (NeurIPS2021).

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

An AutoML Library made with Optuna and PyTorch Lightning

The Unsupervised Reinforcement Learning Benchmark (URLB)

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims.

Pytorch implementation of four neural network based domain adaptation techniques: DeepCORAL, DDC, CDAN and CDAN+E. Evaluated on benchmark dataset Office31.

Project page for our ICCV 2021 paper "The Way to my Heart is through Contrastive Learning"

A tiny, pedagogical neural network library with a pytorch-like API.

PyTorch implementation of the paper Ultra Fast Structure-aware Deep Lane Detection

Face uncertainty quantification or estimation using PyTorch.

SFD implement with pytorch

Jax/Flax implementation of Variational-DiffWave.

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.

A Runtime method overload decorator which should behave like a compiled language

Reference PyTorch implementation of "End-to-end optimized image compression with competition of prior distributions"

Self-Supervised Learning with Kernel Dependence Maximization

Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.