TextBoxes++-TensorFlow

TextBoxes++ re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project

Author: Zhisheng Zou [email protected]

pretrained model

Google drive

environment

python2.7/python3.5

tensorflow-gpu 1.8.0

at least one gpu

how to use

Getting the xml file like this example xml and put the image together because we need the format like this standard xml
1. picture format: *.png or *.PNG
Getting the xml and flags ensure the XML file is under the same directory as the corresponding image.execute the code: convert_xml_format.py
1. python tools/convert_xml_format.py -i in_dir -s split_flag -l save_logs -o output_dir
2. in_dir means the absolute directory which contains the pic and xml
3. split_flag means whether or not to split the datasets
4. save_logs means whether to save train_xml.txt
5. output_dir means where to save xmls
Getting the tfrecords
1. python gene_tfrecords.py --xml_img_txt_path=./logs/train_xml.txt --output_dir=tfrecords
2. xml_img_txt_path like this train xml
3. output_dir means where to save tfrecords
Training
1. python train.py --train_dir =some_path --dataset_dir=some_path --checkpoint_path=some_path
2. train_dir store the checkpoints when training
3. dataset_dir store the tfrecords for training
4. checkpoint_path store the model which needs to be fine tuned
Testing
1. python test.py -m /home/model.ckpt-858 -o test
2. -m which means the model
3. -o which means output_result_dir
4. -i which means the test img dir
5. -c which means use which device to run the test
6. -n which means the nms threshold
7. -s which means the score threshold

Note:

when you are training the model, you can run the eval_result.py to eval your model and save the result

Textboxes_plusplus implementation with Tensorflow (python)

Related tags

Overview

TextBoxes++-TensorFlow

pretrained model

environment

how to use

Note:

Owner

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

Web interface for browsing arXiv papers

Learn computer graphics by writing GPU shaders!

Text-to-Image generation

A curated list of promising OCR resources

Fast style transfer

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

Brief idea about our project is mentioned in project presentation file.

Page to PAGE Layout Analysis Tool

STEFANN: Scene Text Editor using Font Adaptive Neural Network

A tensorflow implementation of EAST text detector

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

One Metrics Library to Rule Them All!

Table recognition inside douments using neural networks

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

governance proposal to make fei redeemable for eth

Random maze generator and solver

Corner-based Region Proposal Network

Automatically resolve RidderMaster based on TensorFlow & OpenCV