Styled Handwritten Text Generation with Transformers (ICCV 21)

Last update: Dec 22, 2022

Overview

⚡ Handwriting Transformers [PDF]

Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan & Mubarak Shah

Abstract: We propose a novel transformer-based styled handwritten text image generation approach, HWT, that strives to learn both style-content entanglement as well as global and local writing style patterns. The proposed HWT captures the long and short range relationships within the style examples through a self-attention mechanism, thereby encoding both global and local style patterns. Further, the proposed transformer-based HWT comprises an encoder-decoder attention that enables style-content entanglement by gathering the style representation of each query character. To the best of our knowledge, we are the first to introduce a transformer-based generative network for styled handwritten text generation. Our proposed HWT generates realistic styled handwritten text images and significantly outperforms the state-of-the-art demonstrated through extensive qualitative, quantitative and human-based evaluations. The proposed HWT can handle arbitrary length of text and any desired writing style in a few-shot setting. Further, our HWT generalizes well to the challenging scenario where both words and writing style are unseen during training, generating realistic styled handwritten text images.

Software environment

Python 3.7
PyTorch >=1.4

Setup & Training

Please see INSTALL.md for installing required libraries. You can change the content in the file mytext.txt to visualize generated handwriting while training.

Citation

If you use the code for your research, please cite our paper:

@InProceedings{Bhunia_2021_ICCV,
    author    = {Bhunia, Ankan Kumar and Khan, Salman and Cholakkal, Hisham and Anwer, Rao Muhammad and Khan, Fahad Shahbaz and Shah, Mubarak},
    title     = {Handwriting Transformers},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {1086-1094}
}

Styled Handwritten Text Generation with Transformers (ICCV 21)

Related tags

Overview

⚡ Handwriting Transformers [PDF]

Software environment

Setup & Training

Citation

Owner

Ankan Kumar Bhunia

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation

Python module providing a framework to trace individual edges in an image using Gaussian process regression.

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

The codebase for Data-driven general-purpose voice activity detection.

Contenido del curso Bases de datos del DCC PUC versión 2021-2

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Art Project "Schrödinger's Game of Life"

Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features"

A PyTorch implementation of the architecture of Mask RCNN

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Sketch-Based 3D Exploration with Stacked Generative Adversarial Networks

TJU Deep Learning & Neural Network

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Code for our paper "Interactive Analysis of CNN Robustness"

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

Sarus implementation of classical ML models. The models are implemented using the Keras API of tensorflow 2. Vizualization are implemented and can be seen in tensorboard.

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch