TextBoxes-TensorFlow

TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project Later, we will overwrite this project so make it more flexiable and modularized.

Author: Daitao Xing : [email protected] Jin Huang : [email protected]

Progress

2017/ 03/14

data_processing phase finished Test：

1. Download the dataset， put 1/ folder and gt.mat uner ddata/sythtext/ folder（will wirte script）   
2. python datasets/data2record.py    
3. python image_processing.py

output： batch_size * 300 * 300 * 3 image

2017/ 03/17

Finish the design of training(can start training)

python train.py \
--train_dir=${TRAIN_DIR} \
--dataset_dir=${DATASET_DIR} \
--save_summaries_secs=60 \
--save_interval_secs=600 \
--weight_decay=0.0005 \
--optimizer=adam \
--learning_rate=0.001 \
--batch_size=32

Problems to be solved：

1. Need to redesign visualization		
2. image_processing can be improved

Next steps:

traing on other datasets
fine tunes
test
automatic downloading datasets and so on

TextBoxes re-implement using tensorflow

Related tags

Overview

TextBoxes-TensorFlow

Progress

Problems to be solved：

Next steps:

Owner

Gu Xiaodong

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

The Open Source Framework for Machine Vision

Characterizing possible failure modes in physics-informed neural networks.

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

A list of hyperspectral image super-solution resources collected by Junjun Jiang

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

Python library to extract tabular data from images and scanned PDFs

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

A Vietnamese personal card OCR website built with Django.

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Drowsiness Detection and Alert System

Virtual Zoom Gesture using OpenCV

virtual mouse which can copy files, close tabs and many other features !

Natural language detection

A machine learning software for extracting information from scholarly documents

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Write-ups for the SwissHackingChallenge2021 CTF.

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.