Skip to content

woctezuma/glide-text2im-colab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

GLIDE text2im on Colab

This repository provides a Colab notebook to produce images conditioned on text prompts with GLIDE [1].

Usage

Tip: press <Ctrl+F9> to run everything.

Results

The process is based on the small, filtered-data GLIDE model, with classifier-free guidance.

Results consist of 64x64 images, and the corresponding 256x256 upsampled versions.

Expected run-time: 2m30s (for the one-time set-up), 1 min (64x64 sampling), 30 sec (256x256 upsampling).


SampleSampleSample

Several uncurated samples obtained with the same prompt: "a magnificent French rooster singing".

Safety considerations

The small model has 300 million parameters, compared to the unreleased 3.5 billion parameter model.

As described in Appendix F.1, the training dataset was filtered so that it would not contain:

  • images of humans and human-like objects,
  • images of violent objects,
  • two prevalent hate symbols in America (swastika and confederate flag).

References

[1] Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. arXiv preprint 2112.10741. 2021.