IMGUR5K handwriting set. It is a handwritten in-the-wild dataset, which contains challenging real world handwritten samples from different writers.The dataset is shared as a set of image urls with annotations. This code downloads the images and verifies the hash to the image to avoid data contamination.

Last update: Dec 26, 2022

Overview

IMGUR5K Handwriting Dataset

To run the code for downloading the urls and generate corresponding annotations :

Usage: python download_imgur5k.py --dataset_info_dir <dir_with_annotaion_and_hashes> --output_dir <path_to_store_images>

Requirements

IMGUR5K download code works with

Python3

Downloading images of IMGUR5K

Run the command and set <path_to_store_images> to the target image directory

How IMGUR5K download works

The code checks the validity of urls by checking the hash of the url with the groundtruth md5 hash. If the image is pristine, the annotations are added to the generated annotations file and the respective splits.

Full documentation

IMGUR5K is shared as a set of image urls with annotations. This code downloads the images and verifies the hash to the image to avoid data contamination.

REQUIRED FILES:

download_imgur5k.py : Code to download the URLs for the dataset building.
<dataset_info_dir>/imgur5k_data.lst : File containing URLs with annotations and bounding box
<dataset_info_dir>/imgur5k_hashes.lst : File containins URL indexes with groundtruth md5 hash.
<dataset_info_dir>/train_index_ids.lst : File containins URL indexes belonging to train split.
<dataset_info_dir>/val_index_ids.lst : File containins URL indexes belonging to val split.
<dataset_info_dir>/test_index_ids.lst : File containins URL indexes belonging to test split.

Output:

<path_to_store_images>/.jpg :
- Images dowloaded to output_dir
imgur5k_annotations.json :
- json file with image annotation mappings -> dowloaded to dataset_info_dir
  - Format: { "index_id" : {indexes}, "index_to_annotation_map" : { annotations ids for an index}, "annotation_id": { each annotation's info } }
  - Annotation ID: bounding_box in xywha format
  - Bounding boxes with '.' mean the annotations were not done for various reasons
imgur5k_annotations_train.json :
- json file with image annotation mappings of TRAIN split only -> dowloaded to dataset_info_dir
imgur5k_annotations_val.json :
- json file with image annotation mappings of VAL split only -> dowloaded to dataset_info_dir
imgur5k_annotations_test.json :
- json file with image annotation mappings of TEST split only -> dowloaded to dataset_info_dir

[All imgur5k_annotations_*.json's format is similar to the format of imgur5k_annotations.json]

NOTE: Apart from the ~5K images employed in TextStyleBrush paper, ~4K more images are added to the dataset to foster the research in Handwritten Recognition.

Contribution

See the CONTRIBUTING file for how to help out.

License

IMGUR5K is Creative Commons Attribution-NonCommercial 4.0 International Public licensed, as found in the LICENSE file.

Related tags

Overview

IMGUR5K Handwriting Dataset

Requirements

Downloading images of IMGUR5K

How IMGUR5K download works

Full documentation

Contribution

License

Owner

Facebook Research

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

Neural search engine for AI papers

A python program to block out your face

Python Computer Vision application that allows users to draw/erase on the screen using their webcam.

Introduction to image processing, most used and popular functions of OpenCV

Histogram specification using openCV in python .

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Ddddocr - 通用验证码识别OCR pypi版

governance proposal to make fei redeemable for eth

Autonomous Driving project for Euro Truck Simulator 2

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

Optical character recognition for Japanese text, with the main focus being Japanese manga

A curated list of papers, code and resources pertaining to image composition

Controlling Volume by Hand Gestures

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

Sort By Face

A set of workflows for corpus building through OCR, post-correction and normalisation