STRIVE: Scene Text Replacement In Videos

Dataset Types:

RoboText
SynthText
RealWorld videos

RoboText : Videos of texts collected using navigation robot in indoor environment. The overall duration of these videos is 10hrs+ Each text's background can be extracted from the bottom rectangle of its text rectangle. The orginial unprocessed data is stored as RoboText-OriginalZip.7z. Around 200 preprocessed videos are stored as RoboTextZip1.7z

SynthText : Using unity, we have created paired videos from synthetic scenes. These videos are stored with similar naming convention in drive. File name : SynthText7Zip.7z

Note: Unity bbox are recorded as mirror values, hence the bbox extraction process will be different than other two video types.

Real World videos: We have collected videos using high resolution mobile camera to capture texts in different lighting conditions and motion blur. File name: RealWorld.7z

Preparing data

We have extracted text bounding box from RoboText and Real world videos using AWS Rekognition API. The code available as runAWS.py file. Synthetic videos bbox is recorded in unity environment

Data Preprocessing

Refer to the preprocessing python file for each dataset type to get crop images of text.

Data download

Data can be downloaded from here

Please contact Jeyasri Subramanian( [email protected] ) for any data queries

STRIVE: Scene Text Replacement In Videos

Related tags

Overview

STRIVE: Scene Text Replacement In Videos

Dataset Types:

Preparing data

Data Preprocessing

Data download

Owner

Sequence-to-Sequence learning using PyTorch

A library for hidden semi-Markov models with explicit durations

Pytorch implementation of

PyTorch code of paper "LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering"

Multiview 3D object detection on MultiviewC dataset through moft3d.

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)

Listing arxiv - Personalized list of today's articles from ArXiv

Implementation of the GBST block from the Charformer paper, in Pytorch

A Unified Generative Framework for Various NER Subtasks.

PyTorch implementation of the YOLO (You Only Look Once) v2

Efficient training of deep recommenders on cloud.

CoReNet is a technique for joint multi-object 3D reconstruction from a single RGB image.

Public repository containing materials used for Feed Forward (FF) Neural Networks article.

"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

Pytorch implementation of the paper: "A Unified Framework for Separating Superimposed Images", in CVPR 2020.

Fairness Metrics: All you need to know

Repo for flood prediction using LSTMs and HAND

Implementation of "Semi-supervised Domain Adaptive Structure Learning"