Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Last update: Jan 06, 2023

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Splice is a method for semantic appearance transfer, as described in Splicing ViT Features for Semantic Appearance Transfer (link to paper).

Given two input images—a source structure image and a target appearance image–our method generates a new image in which the structure of the source image is preserved, while the visual appearance of the target image is transferred in a semantically aware manner. That is, objects in the structure image are “painted” with the visual appearance of semantically related objects in the appearance image. Our method leverages a self-supervised, pre-trained ViT model as an external semantic prior. This allows us to train our generator only on a single input image pair, without any additional information (e.g., segmentation/correspondences), and without adversarial training. Thus, our framework can work across a variety of objects and scenes, and can generate high quality results in high resolution (e.g., HD).

Getting Started

Installation

git clone https://github.com/omerbt/Splice.git
pip install -r requirements.txt

Run examples

Run the following command to start training

python train.py --dataroot datasets/cows

Intermediate results will be saved to /out/output.png during optimization. The frequency of saving intermediate results is indicated in the save_epoch_freq flag of the configuration.

Sample Results

Citation

@article{Splice2022,
    author = {Tumanyan, Narek
              and Bar-Tal, Omer
              and Bagon, Shai
              and Dekel, Tali
              },
    title = {Splicing ViT Features for Semantic Appearance Transfer}, 
    journal = {arXiv preprint arXiv:2201.00424},
    year  = {2022}
}

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Related tags

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Getting Started

Installation

Run examples

Sample Results

Citation

Owner

Omer Bar Tal

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

A library for finding knowledge neurons in pretrained transformer models.

Mask-invariant Face Recognition through Template-level Knowledge Distillation

Code related to the manuscript "Averting A Crisis In Simulation-Based Inference"

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

OpenVisionAPI server

Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

.NET bindings for the Pytorch engine

An implementation of Deep Graph Infomax (DGI) in PyTorch

Code for Low-Cost Algorithmic Recourse for Users With Uncertain Cost Functions

Trains an agent with stochastic policy gradient ascent to solve the Lunar Lander challenge from OpenAI

Self-Learning - Books Papers, Courses & more I have to learn soon

A Temporal Extension Library for PyTorch Geometric

Data augmentation for NLP, accepted at EMNLP 2021 Findings

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

A computer vision pipeline to identify the "icons" in Christian paintings

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'