ArcaneGAN by Alex Spirin

Last update: Dec 28, 2022

Related tags

Deep Learning ArcaneGAN

Overview

ArcaneGAN by Alex Spirin

Changelog

2021-12-12 ArcaneGAN v0.3 is live
2021-12-09 Thanks to ak92501 we now have a huggingface demo

ArcaneGAN v0.3

Videos processed by the huggingface video inference colab.

obama2.mp4

ryan2.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

ArcaneGAN v0.2

The release is here

Implementation Details

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Comments

How to convert the FastAI model to Pytorch JIT

Hi,

I trained a model with unet_learner but I can't convert it to jit.

I run the following code: torch.jit.save(torch.jit.script(learn.model), 'jit.pt')

Here is the error:

UnsupportedNodeError: GeneratorExp aren't supported: File "/usr/local/lib/python3.7/dist-packages/fastai/callbacks/hooks.py", line 21 "Applieshook_functomodule,input,output." if self.detach: input = (o.detach() for o in input ) if is_listy(input ) else input.detach() ~ <--- HERE output = (o.detach() for o in output) if is_listy(output) else output.detach() self.stored = self.hook_func(module, input, output)

May I know how you convert it to a jit model? Thanks

opened by ramtiin 2
Ошибка

Добрый вечер.В ArcaneGAN на colab for videos,выдаёт ошибку:

RuntimeError: CUDA out of memory. Tried to allocate 2.80 GiB (GPU 0; 11.17 GiB total capacity; 5.74 GiB already allocated; 2.21 GiB free; 8.44 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Помогите пожалуйста!

opened by Zzip7 2
How do you change the style of the whole image

Nice work! My only confusion is how you change the style of the whole image instead of just the face. Usually, StyleGAN generates aligned face images by fine-tuning the FFHQ checkpoint. How does the pix2pix model trained with these face image pairs work with the full image or frame.

opened by zhanglonghao1992 2
Architecture for video

Hi, what does the architecture look like? Is it similar to Pix2Pix? And for processing of the video, are you doing anything extra to make sure the frames are consistent?

opened by unography 2
How to prevent eyes occur in nose?

Hello, I try your model and it's amazing, but I find in some pictures if the nose is too big, there will be eyes in the nose. I try to lower the 'target_face' and it can work. But the details like the light of the eyes and background will also lose when I lower the 'target_face'. So I wonder is there a way to prevent the eyes occurs in the nose and keep the details in the meantime?

opened by Folkfive 1
support arbitrary image size?

Great work!

The unet prediction result will be cropped to be the same size as the training input, e.g. 256 or 512. For arbitrary image size (e.g. 1280*720), how to config or set the model to output the same size of the input image as your colab did? Thank you.

opened by foobarhe 1
RuntimeError: CUDA out of memory

Добрый вечер.Извините,это опять я.Снова эта ошибка появляется.Можно ли,самому эту ошибку решать?Или исправлять можете только вы?Обьясните пожалуйста подробно.

opened by Zzip7 1
about the paired datasets generated by stylegan

how do you make sure the background and expression similarity between the generated input(face) and target(style face) ? I find that the style is too weak when less finetune and the similarity is too weak when more finetune, how do you solve it ? Would you like to share the paired datasets generated code with me ? thanks a lot ~

opened by Leocien 1
Any news for training code?

Interesting topic... I wonder how you trained the model, especially the augmentation part. Fixed crop limitation is a well-known problem and would like to know how you handle it. :)

opened by dongyun-kim-arch 0
tuple issue

Was trying the ArcaneGan video colab but I am having a tuple issue can you please help, i am really excited to try the Arcane video can you please help out

opened by mau021 0
What GPU is used for training?

Hi,

I want to train the Fastai u-net model. However, when I try to train the critic (learn_critic.fit_one_cycle(6, 1e-3)), I get the following error:

CUDA out of memory. Tried to allocate 4.00 GiB (GPU 0; 14.76 GiB total capacity; 9.78 GiB already allocated; 891.75 MiB free; 12.57 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The GPU is a Tesla T4 with 16 GB of VRAM. My batch size is 4 and the training images size is 512*512. I also tried with lower numbers, but I'm still getting the same error.

opened by ramtiin 2
How to make the style stronger?

The following are input image, my training output from pair label supervision, and the output from your test model。 I trained my model (Super-Resolution model) on the images from your model outputs, I find it difficult to change the facial features。 Like the eyes and face texture are changed, how to do it ? I use L1Loss (weight is 1) + PerceptualLoss (weight is 1)+ GANLoss (weight is 0.1),

opened by xuanandsix 1

Releases(v0.4)

v0.4(Dec 25, 2021)
ArcaneGAN v0.4

The main differences are:

lighter styling (closer to original input)

sharper result

happier faces

reduced childish eyes effect

reduced stubble on feminine faces

increased temporal stability on videos

reduced mouth\teeth artifacts

Image samples

v0.3 vs v0.4

Video samples

https://user-images.githubusercontent.com/11751592/146966428-f4e27929-19dd-423f-a772-8aee709d2116.mp4

https://user-images.githubusercontent.com/11751592/146966462-6511998e-77f5-4fd2-8ad9-5709bf0cd172.mp4
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.4.jit(59.75 MB)
v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

This is a stronger-styled version. It performs okay on videos, though visible flickering is present. Here are some video examples.

https://user-images.githubusercontent.com/11751592/145702737-c02b8b00-ad30-4358-98bf-97c8ad7fefdf.mp4

https://user-images.githubusercontent.com/11751592/145702740-afd3377d-d117-467d-96ca-045e25d85ac6.mp4

Image samples

Faces were enhanced via GPEN before applying the ArcaneGAN v0.3 filter.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.3.jit(79.40 MB)
v0.2(Dec 7, 2021)

ArcaneGAN v0.2 This version is a bit better at doing something other than making images darker :D

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.2.jit(79.52 MB)
v0.1(Dec 6, 2021)

ArcaneGAN v0.1 This is a proof of concept release. The model is in beta (which means it's beta than nothin')

Here are some image pairs. I've specifically picked various images to see how the model performs in the wild, not on aligned and cropped faces.

It does something, but not much at the moment.

The model is a pytroch *.jit of a fastai v1 flavored u-net trained on a paired dataset, generated via a blended stylegan2. You can see the blending colab I've used here.

Inference notebook is here
Source code(tar.gz)
Source code(zip)
ArcaneGANv0.1.jit(79.53 MB)

Owner

Alex

GitHub Repository

This is an easy python software which allows to sort images with faces by gender and after by age.

Gender-age Classifier This is an easy python software which allows to sort images with faces by gender and after by age. Usage First install Deepface

6 Sep 17, 2022

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

CLIP4CMR A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval The original data and pre-calculate

24 Dec 26, 2022

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

Equinox Callable PyTrees and filtered JIT/grad transformations = neural networks in JAX Equinox brings more power to your model building in JAX. Repr

909 Dec 30, 2022

Deep Ensemble Learning with Jet-Like architecture

Ransomware analysis using DEL with jet-like architecture comprising two CNN wings, a sparse AE tail, a non-linear PCA to produce a diverse feature space, and an MLP nose

2 Feb 06, 2022

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

kaggle_riiid Transformer part of 12th place solution in Riiid! Answer Correctness Prediction. Please see here for more information. Execution You need

2 Apr 23, 2022

Acoustic mosquito detection code with Bayesian Neural Networks

HumBugDB Acoustic mosquito detection with Bayesian Neural Networks. Extract audio or features from our large-scale dataset on Zenodo. This repository

31 Nov 28, 2022

Perspective: Julia for Biologists

Perspective: Julia for Biologists 1. Examples Speed: Example 1 - Single cell data and network inference Domain: Single cell data Methodology: Network

55 Dec 02, 2022

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

MSAD Multi-Scale Aligned Distillation for Low-Resolution Detection Lu Qi*, Jason Kuen*, Jiuxiang Gu, Zhe Lin, Yi Wang, Yukang Chen, Yanwei Li, Jiaya J

115 Dec 23, 2022

Revealing and Protecting Labels in Distributed Training

0 Nov 09, 2022

yolox_backbone is a deep-learning library and is a collection of YOLOX Backbone models.

YOLOX-Backbone yolox-backbone is a deep-learning library and is a collection of YOLOX backbone models. Install pip install yolox-backbone Load a Pret

21 Dec 28, 2022

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

AFSD: Learning Salient Boundary Feature for Anchor-free Temporal Action Localization This is an official implementation in PyTorch of AFSD. Our paper

146 Dec 24, 2022

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

DER.ClassIL.Pytorch This repo is the official implementation of DER: Dynamically Expandable Representation for Class Incremental Learning (CVPR 2021)

108 Jan 01, 2023

Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

512x512 flowers after 12 hours of training, 1 gpu 256x256 flowers after 12 hours of training, 1 gpu Pizza 'Lightweight' GAN Implementation of 'lightwe

1.5k Jan 02, 2023

ArcaneGAN by Alex Spirin

Related tags

Overview

ArcaneGAN by Alex Spirin

ArcaneGAN v0.3

Image samples

ArcaneGAN v0.2

Implementation Details

Comments

Releases(v0.4)

v0.4(Dec 25, 2021)

ArcaneGAN v0.4

Image samples

Video samples

v0.3(Dec 12, 2021)

ArcaneGAN v0.3

Video samples

Image samples

v0.2(Dec 7, 2021)

v0.1(Dec 6, 2021)

Owner

Alex

This is an easy python software which allows to sort images with faces by gender and after by age.

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

Deep Ensemble Learning with Jet-Like architecture

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

Acoustic mosquito detection code with Bayesian Neural Networks

Perspective: Julia for Biologists

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Revealing and Protecting Labels in Distributed Training

yolox_backbone is a deep-learning library and is a collection of YOLOX Backbone models.

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

The source code for Adaptive Kernel Graph Neural Network at AAAI2022

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

A DCGAN to generate anime faces using custom mined dataset

YuNetのPythonでのONNX、TensorFlow-Lite推論サンプル

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

:boar: :bear: Deep Learning based Python Library for Stock Market Prediction and Modelling