Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Last update: Dec 19, 2022

Related tags

Overview

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Abstract: We introduce a method that allows to automatically segment images into semantically meaningful regions without human supervision. Derived regions are consistent across different images and coincide with human-defined semantic classes on some datasets. In cases where semantic regions might be hard for human to define and consistently label, our method is still able to find meaningful and consistent semantic classes. In our work, we use pretrained StyleGAN2 generative model: clustering in the feature space of the generative model allows to discover semantic classes. Once classes are discovered, a synthetic dataset with generated images and corresponding segmentation masks can be created. After that a segmentation model is trained on the synthetic dataset and is able to generalize to real images. Additionally, by using CLIP we are able to use prompts defined in a natural language to discover some desired semantic classes. We test our method on publicly available datasets and show state-of-the-art results.

This repository contains the official Pytorch implementation of the following paper:

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP
Daniil Pakhomov, Sanchit Hira, Narayani Wagle, Kemar E. Green, Nassir Navab
https://arxiv.org/abs/2107.12518

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Related tags

Overview

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Owner

Daniil Pakhomov

PyTorch implementation of the wavelet analysis from Torrence & Compo

Repo for flood prediction using LSTMs and HAND

NAVER BoostCamp Final Project

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

Artstation-Artistic-face-HQ Dataset (AAHQ)

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

official code for dynamic convolution decomposition

🌊 Online machine learning in Python

A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

Code for paper "ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation"

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"

BMVC 2021 Oral: code for BI-GCN: Boundary-Aware Input-Dependent Graph Convolution for Biomedical Image Segmentation

CoANet: Connectivity Attention Network for Road Extraction From Satellite Imagery

GAN-STEM-Conv2MultiSlice - Exploring Generative Adversarial Networks for Image-to-Image Translation in STEM Simulation

3D mesh stylization driven by a text input in PyTorch

Metadata-Extractor - Metadata Extractor Script can be used to read in exif metadata