A simple library that implements CLIP guided loss in PyTorch.

Last update: Dec 26, 2022

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

A simple library that implements CLIP guided loss in PyTorch.

Install package

pip install pytorch_clip_guided_loss

Install the latest version

pip install --upgrade git+https://github.com/bes-dev/pytorch_clip_guided_loss.git

Features

The library supports multiple prompts (images or texts) as targets for optimization.
The library automatically detects the language of the input text, and multilingual translate it via google translate.
The library supports the original CLIP model by OpenAI and ruCLIP model by SberAI.

Usage

Simple code

import torch
from pytorch_clip_guided_loss import get_clip_guided_loss

loss_fn = get_clip_guided_loss(clip_type="ruclip", input_range = (-1, 1)).eval().requires_grad_(False)
# text prompt
loss_fn.add_prompt(text="text description of the what we would like to generate")
# image prompt
loss_fn.add_prompt(image=torch.randn(1, 3, 224, 224))

# variable
var = torch.randn(1, 3, 224, 224).requires_grad_(True)
loss = loss_fn(image=var)["loss"]
loss.backward()
print(var.grad)

VQGAN-CLIP

We provide our tiny implementation of the VQGAN-CLIP pipeline for image generation as an example of the usage of our library. To start using our implementation of the VQGAN-CLIP please follow by documentation.

A simple library that implements CLIP guided loss in PyTorch.

Related tags

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

Install package

Install the latest version

Features

Usage

Simple code

VQGAN-CLIP

Owner

Sergei Belousov

Python based Advanced AI Assistant

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

GAT - Graph Attention Network (PyTorch) 💻 + graphs + 📣 = ❤️

Reproducing code of hair style replacement method from Barbershorp.

YOLOX-RMPOLY

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

Geometric Vector Perceptrons --- a rotation-equivariant GNN for learning from biomolecular structure

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Self-labelling via simultaneous clustering and representation learning. (ICLR 2020)

RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

Experimental code for paper: Generative Adversarial Networks as Variational Training of Energy Based Models

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

Code I use to automatically update my videos' metadata on YouTube

A repo with study material, exercises, examples, etc for Devnet SPAUTO

PyTorch implementations of deep reinforcement learning algorithms and environments

Repository for the paper "From global to local MDI variable importances for random forests and when they are Shapley values"