Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Last update: Sep 13, 2022

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Latest revision: Beta 1.52 (10/11/21): https://colab.research.google.com/github/sadnow/360Diffusion/blob/main/360Diffusion_Public.ipynb

Latest highlights: Full compatibility for both 256 and 512 model for upscaling to 256,512,1024,2048, and 4096px.

Note that 4096 files aren’t quite as pretty as 2048, and they’re massive in file size. 2048 is appealing in most cases. If you intend on upscaling to anything higher than 1024, I recommend using the 512 diffusion model found in the settings-

Credits & Acknowledgements

Katherine Crowson (https://github.com/crowsonkb, https://twitter.com/RiversHaveWings)
Founder of OG Diffusion Notebook Original notebook founder; [I think] has a large involvement in both VQGAN and Diffusion!
Daniel Russell (https://github.com/russelldc, https://twitter.com/danielrussruss) Fast Diffusion Fork Founder Made the OG Fast Diffusion notebook.
Dango233 and nsheppard Contributed to Daniel’s Fast Diffusion Notebook
Sadnow (twitter.com/sadly_existent) 360Diffusion Fork Founder Forked Daniel Russel’s Fast Diffusion Notebook to include Real-ESRGAN integration-
airguitararchon (steven) Init Research
Everyone else on the VQLIPSE Discord (https://www.patreon.com/sportsracer48); Support & Research

Prior release(s): Implemented Daniel Russ’s Perlin revisions, fixed init_bug, 4096 double-pass, VRAM fixes, practical debug_mode (set to higher skip_timestep)

All edits & additions are welcome and appreciated~

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Owner

Peter Baylies

Sparse-dense operators implementation for Paddle

UT-Sarulab MOS prediction system using SSL models

Large scale embeddings on a single machine.

[Link]mareteutral - pars tradg wth M []

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

Official pytorch implementation of Rainbow Memory (CVPR 2021)

A modern pure-Python library for reading PDF files

Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)

Cross-Document Coreference Resolution

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

Rapid experimentation and scaling of deep learning models on molecular and crystal graphs.

Drone Task1 - Drone Task1 With Python

This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametric Head Model (CVPR 2022)".

Patch SVDD for Image anomaly detection

Open-source implementation of Google Vizier for hyper parameters tuning

《Improving Unsupervised Image Clustering With Robust Learning》(2020)

This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification".

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.