A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Last update: Dec 30, 2022

Overview

StyleGAN3 CLIP-based guidance

StyleGAN3 + CLIP

StyleGAN3 + inversion + CLIP

This repo is a collection of Jupyter notebooks made to easily play with StyleGAN3¹ and CLIP² for a text-based guided image generation.

Both notebooks are heavily based on this notebook, created by nshepperd (thank you!).

Special thanks too to Katherine Crowson for coming up with many improved sampling tricks, as well as some of the code.

Feel free to suggest any changes! If anyone has any idea what license should this repo use, please let me know.

StyleGAN3 was created by NVIDIA. Here is the original repo. ↩
CLIP (Contrastive Language-Image Pre-Training) is a multimodal model made by OpenAI. For more information head over here. ↩

Owner

Eugenio Herrera

Data Scientist, Full-Stack Engineer, and aspiring Researcher.

GitHub Repository

Simple and understandable swin-transformer OCR project

swin-transformer-ocr ocr with swin-transformer Overview Simple and understandable swin-transformer OCR project. The model in this repository heavily r

67 Dec 31, 2022

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

MetaDrive: Composing Diverse Driving Scenarios for Generalizable RL [ Documentation | Demo Video ] MetaDrive is a driving simulator with the following

276 Jan 04, 2023

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Trajectory Prediction using Equivariant Continuous Convolution (ECCO) This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivar

45 Jul 22, 2022

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

13 Oct 07, 2022

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

SILG This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark. If you find this work helpful, please cons

17 Nov 27, 2022

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet） By Lele Chen , Ross K Maddox, Zhiyao Duan, Chenliang Xu. Unive

218 Dec 27, 2022

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Omni-swarm A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm Introduction Omni-swarm is a decentralized omn

99 Dec 23, 2022

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022) Please cite "Independent SE(3)-Equivar

154 Jan 02, 2023

keyframes-CNN-RNN(action recognition)

keyframes-CNN-RNN(action recognition) Environment: python=3.7 pytorch=1.2 Datasets: Following the format of UCF101 action recognition. Run steps: Mo

4 Feb 09, 2022

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

SwinTextSpotter This is the pytorch implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text R

183 Jan 03, 2023

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

CSAW-M This repository contains code for CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer. Source code for tr

7 Oct 11, 2022

An intuitive library to extract features from time series

Time Series Feature Extraction Library Intuitive time series feature extraction This repository hosts the TSFEL - Time Series Feature Extraction Libra

589 Jan 04, 2023

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

A310 Computational Neuroscience - Okinawa Institute of Science and Technology, 2022 This repository contains modeling practice materials and homework

1 Jan 24, 2022

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles This repository contains a method to generate 3D conformer ensembles direct

127 Dec 20, 2022

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Code Submission for: Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks Run with docker To build a docker environment, chan

0 Dec 09, 2021

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

PFD：Pose-guided Feature Disentangling for Occluded Person Re-identification based on Transformer This repo is the official implementation of "Pose-gui

93 Dec 18, 2022

A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Related tags

Overview

StyleGAN3 CLIP-based guidance

StyleGAN3 + CLIP

StyleGAN3 + inversion + CLIP

Owner

Eugenio Herrera

Simple and understandable swin-transformer OCR project

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

keyframes-CNN-RNN(action recognition)

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

An intuitive library to extract features from time series

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

BrainGNN - A deep learning model for data-driven discovery of functional connectivity

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Air Quality Prediction Using LSTM

A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

Related tags

Overview

StyleGAN3 CLIP-based guidance

StyleGAN3 + CLIP

StyleGAN3 + inversion + CLIP

Footnotes

Owner

Eugenio Herrera

Simple and understandable swin-transformer OCR project

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

keyframes-CNN-RNN(action recognition)

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

An intuitive library to extract features from time series

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

BrainGNN - A deep learning model for data-driven discovery of functional connectivity

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Air Quality Prediction Using LSTM