Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Last update: Nov 02, 2022

Overview

ZF_UNET_224 Pretrained Model

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Requirements

Python 3.*, Keras 2.1, Tensorflow 1.4

Usage

from zf_unet_224_model import ZF_UNET_224, dice_coef_loss, dice_coef
from keras.optimizers import Adam

model = ZF_UNET_224(weights='generator')
optim = Adam()
model.compile(optimizer=optim, loss=dice_coef_loss, metrics=[dice_coef])

model.fit(...)

Notes

"ZF_UNET_224" Model based on UNET code from following paper: https://arxiv.org/abs/1505.04597
This model used to get 2nd place in DSTL competition: https://www.kaggle.com/c/dstl-satellite-imagery-feature-detection
For training used DICE coefficient: https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient
Input shape for model is 224x224 (the same as for other popular CNNs like VGG or ResNet)
It has 3 input channels (to process standard RGB (BGR) images). You can change it with variable "INPUT_CHANNELS"
In most cases model ZF_UNET_224 is ok to be used without pretrained weights.
This code should work fine on both Theano and Tensorflow backends. Code prepared for Keras 2.1, if you need code for Keras 1.2 then use this link:

Pretrained weights

Download: Weights for Tensorflow backend ~123 MB (Keras 2.1, Dice coef: 0.998)

Weights were obtained with random image generator (generator code available here: train_infinite_generator.py). See example of images from generator below.

Dice coefficient for pretrained weights: ~0.998. See history of learning below:

Comments

Extended example
Hi, I have created extended example based on your repository: https://github.com/mrgloom/keras-semantic-segmentation-example

It also use random colors for foreground and background (not like lighter and darker like here https://github.com/ZFTurbo/ZF_UNET_224_Pretrained_Model/blob/master/train_infinite_generator.py#L24 ), one idea behind it is that in that case network can learn 'shape of object' not just 'thresholding and separating background and foreground', also looks like using random colors make problem harder and network converges slower.

Also I have experienced some problems:

Netwoks not always converges on second run with fixed params even for this toy problem, looks like it depens on random seed.

Dice loss and jaccard loss are harder to train than binary crossentropy, any ideas why? Network architecture is the same just loss differs, I even tried to load trained weights from binary crossentropy loss network and use them in dice loss network which show high dice coef.
opened by mrgloom 8
Deeper network

I know this is not an issue, but I wanted to contact you to know how did you make the network deeper in keras for the DSTL competition using this model?

opened by nassarofficial 6

Tensorflow problem

When I use tensorflow-1.3.0 as backend, I get this kind of error:

builtins.ValueError: Dimension 2 in both shapes must be equal, but are 3 and 32 for 'Assign' (op: 'Assign') with input shapes: [3,3,3,32], [3,3,32,3].

opened by lawlite19 5

preprocess_batch for real data
Here is preprocessing for the batch (looks like 256 should be 255 ;) ) https://github.com/ZFTurbo/ZF_UNET_224_Pretrained_Model/blob/master/zf_unet_224_model.py#L27

Is it ok for real images to use code like this or it should be calculated for entire dataset?

batch=batch-np.mean(batch) batch=batch/np.std(batch)

Also how crucial is impact of data normalization for U-net? In my tests even on this simple synthetic data network doesn't converges if input is not normalized.
opened by mrgloom 2
Applying pretrained weights to 128*128 size image

You have generated pretrained weights for 224224 input size, but I have 128128. How can we use such weights in this situation, but without padding/upsampling 128*128 images. Sorry for silly question - is it worth trying in kaggle salt competition?

opened by Diyago 1
Attribute Error

Traceback (most recent call last): File "train.py", line 11, in import segmentation_models as sm File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/segmentation_models/init.py", line 98, in set_framework(_framework) File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/segmentation_models/init.py", line 68, in set_framework import efficientnet.keras # init custom objects File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/efficientnet/keras.py", line 17, in init_keras_custom_objects() File "/home/melih/anaconda3/envs/ai/lib/python3.6/site-packages/efficientnet/init.py", line 71, in init_keras_custom_objects keras.utils.generic_utils.get_custom_objects().update(custom_objects) AttributeError: module 'keras.utils' has no attribute 'generic_utils'

when I run the code, I got the result below but don't know why there is no generic_utils attribute in the library since there is in the keras.

opened by melih1996 0
How to run the model for 6 input channels?

Is it possible to run the model for 6 input channels? Three inputs in that are RGB values and the other three are metrics I want to pass on into the architecture for my use case.

opened by ShreyaPandita01 2
dice and jaccard metrics

Thanks for the repo. I am wondering why do you use a smoothing factor of 1.0 in both dice and jaccard coefficients? Where does this value comes from? And what about using another smaller value close to zero, e.g. K.epsilon()

opened by tinalegre 3
model.fit step

Hi! I would like to know how I should perform the model.fit instruction. model.fit(trainSet, mask_trainSet, batch_size=20, nb_epoch=1, verbose=1,validation_split=0.2, shuffle=True, callbacks=[model_checkpoint])¿? What I write in callback??

And how should I use the weights if I wan't to use pretained weights??

Thank you very much and sorry for the inconvenience!

opened by AmericaBG 7
How to generate img and mask correctly

I run your code and then find that the img batch has a shape(16,224,224,3),but mask batch has a shape(16,1,224,224). I don't understand it.Can you explain it to me?I use my dataset to train unet and then the dice coef is high，but the real effect is bad.

opened by wong-way 6

Releases(v1.0)

v1.0(Mar 19, 2018)

Weights for Tensorflow backend ~123 MB (Keras 2.1, Dice coef: 0.998)
Source code(tar.gz)
Source code(zip)
zf_unet_224.h5(120.21 MB)

Owner

GitHub Repository

Unicorn can be used for performance analyses of highly configurable systems with causal reasoning

Unicorn can be used for performance analyses of highly configurable systems with causal reasoning. Users or developers can query Unicorn for a performance task.

27 Jan 05, 2023

Generative Adversarial Text-to-Image Synthesis

###Generative Adversarial Text-to-Image Synthesis Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee This is the

883 Dec 31, 2022

NNR conformation conditional and global probabilities estimation and analysis in peptides or proteins fragments

NNR and global probabilities estimation and analysis in peptides or protein fragments This module calculates global and NNR conformation dependent pro

0 Jul 15, 2021

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Energy-based Conditional Generative Adversarial Network (ECGAN) This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without

22 May 28, 2022

Python Implementation of algorithms in Graph Mining, e.g., Recommendation, Collaborative Filtering, Community Detection, Spectral Clustering, Modularity Maximization, co-authorship networks.

Graph Mining Author: Jiayi Chen Time: April 2021 Implemented Algorithms: Network: Scrabing Data, Network Construbtion and Network Measurement (e.g., P

3 Mar 03, 2022

Tensorflow AffordanceNet and AffContext implementations

AffordanceNet and AffContext This is tensorflow AffordanceNet and AffContext implementations. Both are implemented and tested with tensorflow 2.3. The

6 Dec 01, 2022

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

13 Oct 07, 2022

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking [Paper Link] Abstract In this work, we contribute a new million-scale Un

25 Jan 01, 2023

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

Dataset Distillation by Matching Training Trajectories Project Page | Paper This repo contains code for training expert trajectories and distilling sy

256 Jan 05, 2023

The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation"

SD-AANet The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation" [arxiv] Overview confi

9 Nov 07, 2022

PHOTONAI is a high level python API for designing and optimizing machine learning pipelines.

PHOTONAI is a high level python API for designing and optimizing machine learning pipelines. We've created a system in which you can easily select and

57 Nov 12, 2022

COVID-VIT: Classification of Covid-19 from CT chest images based on vision transformer models

COVID-ViT COVID-VIT: Classification of Covid-19 from CT chest images based on vision transformer models This code is to response to te MIA-COV19 compe

17 Dec 30, 2022

Automatic tool focused on deriving metallicities of open clusters

metalcode Automatic tool focused on deriving metallicities of open clusters. Based on the method described in Pöhnl & Paunzen (2010, https://ui.adsabs

2 Dec 13, 2021

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

tmm_fast tmm_fast or transfer-matrix-method_fast is a lightweight package to speed up optical planar multilayer thin-film device computation. It is es

26 Dec 11, 2022

Implementation of the CVPR 2021 paper "Online Multiple Object Tracking with Cross-Task Synergy"

Online Multiple Object Tracking with Cross-Task Synergy This repository is the implementation of the CVPR 2021 paper "Online Multiple Object Tracking

54 Oct 15, 2022

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021) Tensorflow implementation of Bridging the Gap between Label- and Reference-ba

8 Jul 13, 2022

Modification of convolutional neural net "UNET" for image segmentation in Keras framework

Related tags

Overview

ZF_UNET_224 Pretrained Model

Requirements

Usage

Notes

Pretrained weights

Comments

Releases(v1.0)

v1.0(Mar 19, 2018)

Owner

Unicorn can be used for performance analyses of highly configurable systems with causal reasoning

Generative Adversarial Text-to-Image Synthesis

NNR conformation conditional and global probabilities estimation and analysis in peptides or proteins fragments

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Python Implementation of algorithms in Graph Mining, e.g., Recommendation, Collaborative Filtering, Community Detection, Spectral Clustering, Modularity Maximization, co-authorship networks.

Tensorflow AffordanceNet and AffContext implementations

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data.

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation"

PHOTONAI is a high level python API for designing and optimizing machine learning pipelines.

COVID-VIT: Classification of Covid-19 from CT chest images based on vision transformer models

Automatic tool focused on deriving metallicities of open clusters

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

Implementation of the CVPR 2021 paper "Online Multiple Object Tracking with Cross-Task Synergy"

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

3D mesh stylization driven by a text input in PyTorch

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

End-to-end Temporal Action Detection with Transformer. [Under review]

The official codes of "Semi-supervised Models are Strong Unsupervised Domain Adaptation Learners".