HairCLIP: Design Your Hair by Text and Reference Image

Last update: Jan 06, 2023

Related tags

Overview

This repository hosts the official PyTorch implementation of the paper: "HairCLIP: Design Your Hair by Text and Reference Image".

Our single framework supports hairstyle and hair color editing individually or jointly, and conditional inputs can come from either image or text domain.

Tianyi Wei¹, Dongdong Chen², Wenbo Zhou¹, Jing Liao³, Zhentao Tan¹, Lu Yuan², Weiming Zhang¹, Nenghai Yu¹
¹University of Science and Technology of China, ²Microsoft Cloud AI, ³City University of Hong Kong

Abstract

Hair editing is an interesting and challenging problem in computer vision and graphics. Many existing methods require well-drawn sketches or masks as conditional inputs for editing, however these interactions are neither straightforward nor efficient. In order to free users from the tedious interaction process, this paper proposes a new hair editing interaction mode, which enables manipulating hair attributes individually or jointly based on the texts or reference images provided by users. For this purpose, we encode the image and text conditions in a shared embedding space and propose a unified hair editing framework by leveraging the powerful image text representation capability of the Contrastive Language-Image Pre-Training (CLIP) model. With the carefully designed network structures and loss functions, our framework can perform high-quality hair editing in a disentangled manner. Extensive experiments demonstrate the superiority of our approach in terms of manipulation accuracy, visual realism of editing results, and irrelevant attribute preservation.

HairCLIP: Design Your Hair by Text and Reference Image

Related tags

Overview

Overview

Abstract

Comparison

Comparison to Text-Driven Image Manipulation Methods

Comparison to Hair Transfer Methods

Application

Hair Interpolation

Generalization Ability to Unseen Descriptions

Cross-Modal Conditional Inputs

To Do

Owner

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

DL course co-developed by YSDA, HSE and Skoltech

Source Code and data for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

RLDS stands for Reinforcement Learning Datasets

Mini Software that give reminder to drink water as per your weight.

Adversarial Autoencoders

Unsupervised Learning of Video Representations using LSTMs

[CVPR 2019 Oral] Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Learning Visual Words for Weakly-Supervised Semantic Segmentation

PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition, CVPR 2018

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

Flaxformer: transformer architectures in JAX/Flax

This program writes christmas wish programmatically. It is using turtle as a pen pointer draw christmas trees and stars.

Python implementation of O-OFDMNet, a deep learning-based optical OFDM system,

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

Pytorch implementation of our method for regularizing nerual radiance fields for few-shot neural volume rendering.

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

Python scripts to detect faces in Python with the BlazeFace Tensorflow Lite models