TF-deeplab

This is a Tensorflow implementation of DeepLab, compatible with Tensorflow 1.2.1.

Currently it supports both training and testing the ResNet 101 version by converting the caffemodel provided by Jay.

Note that the current version is not multi-scale, i.e. only uses the original resolution branch and discarding all layers of 0.5 and 0.75 resolution.

The caffemodel2npy.py is modified from here, and the deeplab_model.py is modified from here.

Example Usage

Download the prototxt and caffemodel provided by Jay
Convert caffemodel to npy file

python caffemodel2npy.py deploy.prototxt ../deeplab/ResNet101/init.caffemodel ./model/ResNet101_init.npy
python caffemodel2npy.py deploy.prototxt ../deeplab/ResNet101/train_iter_20000.caffemodel ./model/ResNet101_train.npy
python caffemodel2npy.py deploy.prototxt ../deeplab/ResNet101/train2_iter_20000.caffemodel ./model/ResNet101_train2.npy

Convert npy file to tfmodel

python npy2tfmodel.py 0 ./model/ResNet101_init.npy ./model/ResNet101_init.tfmodel
python npy2tfmodel.py 0 ./model/ResNet101_train.npy ./model/ResNet101_train.tfmodel
python npy2tfmodel.py 0 ./model/ResNet101_train2.npy ./model/ResNet101_train2.tfmodel

Test on a single image

python deeplab_main.py 0 single

Test on the PASCAL VOC2012 validation set (you will also want to look at the matlab folder and run EvalSegResults.m after you run the following command)

python deeplab_main.py 0 test

To train on the PASCAL VOC2012 train_aug, run

python deeplab_main.py 0 train

Performance

The converted DeepLab ResNet 101 model achieves mean IOU of 73.296% on the validation set of PASCAL VOC2012. Again, this is only with the original resolution branch, which is likely to be the reason for the performance gap (according to the paper this number should be around 75%).

TODO

Incorporating 0.5 and 0.75 resolution

Tensorflow implementation of DeepLabv2

Related tags

Overview

TF-deeplab

Example Usage

Performance

TODO

Owner

Chenxi Liu

MANO hand model porting for the GraspIt simulator

Trajectory Prediction with Graph-based Dual-scale Context Fusion

This repository contains the code used for Predicting Patient Outcomes with Graph Representation Learning (https://arxiv.org/abs/2101.03940).

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

(Python, R, C/C++) Isolation Forest and variations such as SCiForest and EIF, with some additions (outlier detection + similarity + NA imputation)

[2021][ICCV][FSNet] Full-Duplex Strategy for Video Object Segmentation

Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxiv.org/abs/2108.09084).

Face Detection & Age Gender & Expression & Recognition

Moer Grounded Image Captioning by Distilling Image-Text Matching Model

PyTorch implementation of our Adam-NSCL algorithm from our CVPR2021 (oral) paper "Training Networks in Null Space for Continual Learning"

Custom studies about block sparse attention.

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

Hybrid Neural Fusion for Full-frame Video Stabilization

MakeItTalk: Speaker-Aware Talking-Head Animation

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

This is the code used in the paper "Entity Embeddings of Categorical Variables".

Lightweight tool to perform MITM attack on local network

Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading