A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Last update: Jan 17, 2022

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

This is the repository for our Paper/Contribution to the WI2022 in Nürnberg.

Abstract

In recent years, large pre-trained deep neural networks (DNNs) have revolutionized the field of computer vision (CV). Although these DNNs have been shown to be very well suited for general image recognition tasks, application in industry is often precluded for three reasons:

large pre-trained DNNs are built on hundreds of millions of parameters, making deployment on many devices impossible,
the underlying dataset for pre-training consists of general objects, while industrial cases often consist of very specific objects, such as structures on solar wafers,
potentially biased pre-trained DNNs raise legal issues for companies.

As a remedy, we study neural networks for CV that we train from scratch. For this purpose, we use a real-world case from a solar wafer manufacturer. We find that our neural networks achieve similar performances as pre-trained DNNs, even though they consist of far fewer parameters and do not rely on third-party datasets.

Structure of this repository

+-- ImageClassification            | Runner Notebook + Scripts for experiments
+-- ReadMe.md			   | ReadMe
+-- Results.xlsx                   | Results that were reported in the paper
+-- RunResults                     | Detailed logging of our experiments results that were reported in the paper (IDs correspond to old IDs in the .xlsx file due to procedure)

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

1 Dec 18, 2021

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision Project | Arxiv | Abstract It is very challenging for various visual tasks such as image

377 Jan 7, 2023

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

MobileViT RegNet Unofficial PyTorch implementation of MobileViT based on paper MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE, AND MOBILE-FRIENDLY VISION TR

91 Dec 2, 2022

Best Practices on Recommendation Systems

Recommenders What's New (February 4, 2021) We have a new relase Recommenders 2021.2! It comes with lots of bug fixes, optimizations and 3 new algorith

14.8k Jan 3, 2023

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

A DeepStack custom model for detecting common objects in dark/night images and videos.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Abstract

Structure of this repository

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Best Practices on Recommendation Systems

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

A DeepStack custom model for detecting common objects in dark/night images and videos.

An unofficial styleguide and best practices summary for PyTorch

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Dark Finix: All in one hacking framework with almost 100 tools

Releases(v1.0)

v1.0(Jan 5, 2022)

Owner

Maximilian Harl

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation.

Pyramid Pooling Transformer for Scene Understanding

Open source annotation tool for machine learning practitioners.

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

High frequency AI based algorithmic trading module.

Code base for the paper "Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation"

Exadel CompreFace is a free and open-source face recognition GitHub project

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

Implementation of the Chamfer Distance as a module for pyTorch

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.

Pytorch and Torch testing code of CartoonGAN

Post-training Quantization for Neural Networks with Provable Guarantees

A tensorflow model that predicts if the image is of a cat or of a dog.

Code for the Convolutional Vision Transformer (ConViT)

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

R-Drop: Regularized Dropout for Neural Networks