Delve is a Python package for analyzing the inference dynamics of your PyTorch model.

Last update: Dec 12, 2022

Overview

Delve: Deep Live Visualization and Evaluation

Delve is a Python package for analyzing the inference dynamics of your model.

Use Delve if you need a lightweight PyTorch extension that:

Gives you insight into the inference dynamics of your architecture
Allows you to optimize and adjust neural networks models to your dataset without much trial and error
Allows you to analyze the eigenspaces your data at different stages of inference
Provides you basic tooling for experiment logging

Motivation

Designing a deep neural network is a trial and error heavy process that mostly revolves around comparing performance metrics of different runs. One of the key issues with this development process is that the results of metrics not realy propagte back easily to concrete design improvements. Delve provides you with spectral analysis tools that allow you to investigate the inference dynamic evolving in the model while training. This allows you to spot underutilized and unused layers. Missmatches between object size and neural architecture among other inefficiencies. These observations can be propagated back directly to design changes in the architecture even before the model has fully converged, allowing for a quicker and mor guided design process.

Installation

pip install delve

Using Layer Saturation to improve model performance

The saturation metric is the core feature of delve. By default saturation is a value between 0 and 1.0 computed for any convolutional, lstm or dense layer in the network. The saturation describes the percentage of eigendirections required for explaining 99% of the variance. Simply speaking, it tells you how much your data is "filling up" the individual layers inside your model.

In the image below you can see how saturation portraits inefficiencies in your neural network. The depicted model is ResNet18 trained on 32 pixel images, which is way to small for a model with a receptive field exceeding 400 pixels in the final layers.

To visualize what this poorly chosen input resolution does to the inference, we trained logistic regressions on the output of every layer to solve the same task as the model. You can clearly see that only the first half of the model (at best) is improving the intermedia solutions of our logistic regression "probes". The layers following this are contributing nothing to the quality of the prediction! You also see that saturation is extremly low for this layers!

We call this a tail and it can be removed by either increasing the input resolution or (which is more economical) reducing the receptive field size to match the object size of your dataset.

We can do this by removing the first two downsampling layers, which quarters the growth of the receptive field of your network, which reduced not only the number of parameters but also makes more use of the available parameters, by making more layers contribute effectivly!

For more details check our publication on this topics

Spectral Analysis of Latent Representations
Feature Space Saturation during Training
(Input) Size Matters for CNN Classifiers
Should you go deeper? Optimizing Convolutional Neural Networks without training
Go with the Flow: the distribution of information processing in multi-path networks (soon)

Demo

import torch
from delve import CheckLayerSat
from torch.cuda import is_available
from torch.nn import CrossEntropyLoss
from torchvision.datasets import CIFAR10
from torchvision.transforms import ToTensor, Compose
from torch.utils.data.dataloader import DataLoader
from torch.optim import Adam
from torchvision.models.vgg import vgg16

# setup compute device
from tqdm import tqdm

if __name__ == "__main__":

    device = "cuda:0" if is_available() else "cpu"

    # Get some data
    train_data = CIFAR10(root="./tmp", train=True,
                         download=True, transform=Compose([ToTensor()]))
    test_data = CIFAR10(root="./tmp", train=False, download=True, transform=Compose([ToTensor()]))

    train_loader = DataLoader(train_data, batch_size=1024,
                              shuffle=True, num_workers=6,
                              pin_memory=True)
    test_loader = DataLoader(test_data, batch_size=1024,
                             shuffle=False, num_workers=6,
                             pin_memory=True)

    # instantiate model
    model = vgg16(num_classes=10).to(device)

    # instantiate optimizer and loss
    optimizer = Adam(params=model.parameters())
    criterion = CrossEntropyLoss().to(device)

    # initialize delve
    tracker = CheckLayerSat("my_experiment", save_to="plotcsv", modules=model, device=device)

    # begin training
    for epoch in range(10):
        model.train()
        for (images, labels) in tqdm(train_loader):
            images, labels = images.to(device), labels.to(device)
            prediction = model(images)
            optimizer.zero_grad(set_to_none=True)
            with torch.cuda.amp.autocast():
                outputs = model(images)
                _, predicted = torch.max(outputs.data, 1)

                loss = criterion(outputs, labels)
            loss.backward()
            optimizer.step()

        total = 0
        test_loss = 0
        correct = 0
        model.eval()
        for (images, labels) in tqdm(test_loader):
            images, labels = images.to(device), labels.to(device)
            outputs = model(images)
            loss = criterion(outputs, labels)
            _, predicted = torch.max(outputs.data, 1)

            total += labels.size(0)
            correct += torch.sum((predicted == labels)).item()
            test_loss += loss.item()

        # add some additional metrics we want to keep track of
        tracker.add_scalar("accuracy", correct / total)
        tracker.add_scalar("loss", test_loss / total)

        # add saturation to the mix
        tracker.add_saturations()

    # close the tracker to finish training
    tracker.close()

Why this name, Delve?

delve (verb):

reach inside a receptacle and search for something
to carry on intensive and thorough research for data, information, or the like

Comments

Refactor covariance matrix calculations:
Refactor covariance matrix calculations:

use only the most current activation

change default sampling rate from B

flatten B, H, W for Conv layers instead of median

Fix calculation of number of eigval by argmax.

Change CIFAR10 training schedule for faster convergence (the loss seemed not to drop almost at all during training example_deep.py):

normalization function

batch size

learning rate

h2 sizes (due to different saturation calculations)

bug enhancement
opened by mmarcinkiewicz 11
Idiomatic 1.0 code

I see that there are quite a few constructions which are obsolete in Python 0.4+.

For example: Variable. Plus, it seems that .to(device) is the preferred method to keep transfer to GPU (or not, if not available).

opened by stared 5
Fully convolutional AutoEncoder

Hello,

I have developed AutoEncoder which is fully convolutional and I wanted to check what is the utilization of convolutional layers in it (no dense), but I am not able to do it with this module. Even though, it is written that conv layers are supported.

opened by dawrym 4

TypeError with pytest

Running py.test,

https://github.com/delve-team/delve/blob/6a2b594eb9ce43c38c9f94be8781ea8c57610de2/delve/writers.py#L469 returns a TypeError:

TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''.

Contents of df.values[0]:

array([list([(tensor(-1.5482e-08, dtype=torch.float64), tensor(-1.3955e-09, dtype=torch.float64)), (tensor(-2.5362e-08, dtype=torch.float64), tensor(-2.5179e-09, dtype=torch.float64)), (tensor(-3.1511e-08, dtype=torch.float64), tensor(-3.6894e-09, dtype=torch.float64)), (tensor(-3.5553e-08, dtype=torch.float64), tensor(-4.1750e-09, dtype=torch.float64)), (tensor(-3.8271e-08, dtype=torch.float64), tensor(-4.4061e-09, dtype=torch.float64)), (tensor(-3.7972e-08, dtype=torch.float64), tensor(-2.7664e-09, dtype=torch.float64)), (tensor(-3.7489e-08, dtype=torch.float64), tensor(-1.7852e-09, dtype=torch.float64)), (tensor(-3.7178e-08, dtype=torch.float64), tensor(-1.3027e-09, dtype=torch.float64))])],
      dtype=object)

bug

opened by justinshenk 2

delve outdated examples

Traceback (most recent call last): File "example.py", line 39, in <module> "regression/h{}".format(h), "csv", model, device=device, reset_covariance=True, File "Z:\delve\delve\torchcallback.py", line 193, in __init__ self.timeseries_method = timeseries_method NameError: name 'timeseries_method' is not defined

opened by Saran-nns 2

Does it work with submodules?

Typically I use modules within nn.Sequential or custom-defined modules.

class TwoLayerNet(torch.nn.Module):
    def __init__(self, D_in, H, D_out):
        super(TwoLayerNet, self).__init__()
        self.fc = torch.nn.Sequential(
            torch.nn.Linear(D_in, H),
            torch.nn.Linear(H, D_out)
        )

    def forward(self, x):
        return self.fc(x)

and then layers = model.parameters(). However, I get an error:

Traceback (most recent call last):
  File "example_submodule.py", line 43, in <module>
    stats = CheckLayerSat('regression/h{}'.format(h), layers)
  File "/Users/pmigdal/not_my_repos/delve/delve/main.py", line 50, in __init__
    self.layers = self._get_layers(modules)
  File "/Users/pmigdal/not_my_repos/delve/delve/main.py", line 167, in _get_layers
    for name in modules.state_dict().keys():
AttributeError: 'generator' object has no attribute 'state_dict'

(for full code example, see: https://gist.github.com/stared/b598c03ade397baf3fa03c52bd79e90d)

Does it work with submodules?

opened by stared 2

[JOSS review] Doc nitpicks
Things I noticed while reading the docs:

Spurious indices and tables link on the saturation page.

If I understand correctly then CheckLayerSat is the only way your users should interact with the library. In this case there's no need to include anything else in the API reference. Just focus on the essential API and exclude internal objects.

Broken link on top of Reference page.

I think the home page of your documentation fills a similar role as the GitHub README, in being the first point of interaction for new users where you should put your best foot forward. Right now your README is a lot more polished, so why not just include the README in the documentation home page and save yourself the hassle of maintaining both separately? (E.g. by converting the README to rst, see mpi4jax where we use this pattern.)

I would link more prominently to the integration with the tensorflow playground, which really does a great job of introducing the library! Love the gif.

Links under "dependencies" are broken (and the whole section is unnecessary IMO).

Emphasize more clearly what I should read to understand the theory behind Delve. You mention several papers but I think highlighting a specific one could be helpful.

(This is a part of the ongoing review at openjournals/joss-reviews#3992)
opened by dionhaefner 1
[JOSS review] API

I wonder if CheckLayerSat is really the best name for your main tracker object. The imperative sounds more like a function name to me, and Sat is so overloaded that it's not obvious what it stands for. I would probably use something like SaturationTracker or so.

But I understand that changing names in the public API can be a pain, so if you insist to keep it that's fine with me.

(This is a part of the ongoing review at openjournals/joss-reviews#3992)

opened by dionhaefner 1
[JOSS review] Test coverage

I suggest adding a service like codecov to see how much is actually covered by tests, and adding a badge to the README. There's no shame in not reaching 100% coverage, but if you don't measure it you won't know whether your tests work as intended.

(This is a part of the ongoing review at openjournals/joss-reviews#3992)

opened by dionhaefner 1
[JOSS review] Incorrect qualifiers
Qualifiers in setup.py:

'Programming Language :: Python :: 3.4', 'Programming Language :: Python :: 3.5', 'Programming Language :: Python :: 3.6',

But since you have python_requires='>=3.6', this should probably be something like 3.6 through 3.10.

(This is a part of the ongoing review at openjournals/joss-reviews#3992)
opened by dionhaefner 1
[JOSS review] Pinning Pytorch

Is it really necessary to pin Pytorch to ==1.9.0? Seems quite restrictive to me, and makes the package harder to install (because if you e.g. do pip install delve and then pip install torchvision it gets overwritten again).

(This is a part of the ongoing review at https://github.com/openjournals/joss-reviews/issues/3992)

opened by dionhaefner 1

ConvTranspose2d layers not being tracked

class simple(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv1 = nn.Conv2d(3, 32, 3)
        self.deconv1 = nn.ConvTranspose2d(32, 3, 3)

simple_model = simple()
tracker2 = CheckLayerSat("my_experiment", save_to="plotcsv", modules=simple_model, device=image.device)

output:

added layer conv1
Skipping deconv1

This is an awesome tool, but I'd love to see how well the decoder part of my autoencoder works.

opened by marthinwurer 6

Releases(v0.1.49)

v0.1.49(Jan 17, 2022)

Rename CheckLayerSat
Source code(tar.gz)
Source code(zip)
v0.1.48(Oct 12, 2021)

Minor fixes
Source code(tar.gz)
Source code(zip)
v0.1.45(Aug 22, 2021)

Source code(tar.gz)
Source code(zip)
0.1.45(Aug 22, 2021)

Source code(tar.gz)
Source code(zip)
0.1.44(Mar 7, 2021)

Plotting is now executed on stats computed at training and evaluation time.
Source code(tar.gz)
Source code(zip)
0.1.43(Jan 24, 2021)

A breaking bug was fixed keeping delve from plotting things. Updated the documentation and example.py to be more consistent with the current state and usage of this framework. (still mutch to do, but its progress)
Source code(tar.gz)
Source code(zip)
0.1.42(Nov 20, 2020)

bugfix that required preinstalling pandas and matplotlib in order to install delve without crash. Now dependencies are resolved correctly.
Source code(tar.gz)
Source code(zip)
0.1.41c(Apr 13, 2020)

Source code(tar.gz)
Source code(zip)
0.1.41b(Apr 12, 2020)

Added multiple optional metric for measuring the variance of the latent representation.
Source code(tar.gz)
Source code(zip)
0.1.40(Mar 15, 2020)
Plots no longer shrink with each epoch when recording layer saturation

the torchcallback now features a stop() and resume() function which stop and resume the recording of stats and aggregation of saturation values.

Source code(tar.gz)
Source code(zip)
0.1.39(Mar 11, 2020)

Source code(tar.gz)
Source code(zip)
0.1.38(Mar 6, 2020)

Source code(tar.gz)
Source code(zip)
0.1.37(Mar 6, 2020)

It is now possible to downsample feature maps to a minimum size in order to keep computation costs a low as possible
Source code(tar.gz)
Source code(zip)
0.1.36(Mar 5, 2020)

It is now possible to resume the training by setting the initial_epoch parameter.
Source code(tar.gz)
Source code(zip)
0.1.35(Mar 1, 2020)

Tripled the performance by removing redundant covariance computation
Source code(tar.gz)
Source code(zip)
0.1.34(Feb 28, 2020)
It is now possible to provide a list of writers or corresponding string keys for the "save_to" parameter

It is now possible to save the covariance matrix, however only npy-writer supports saving the covariance matrix

Added a utility function that allows reconstructing result csvs with saturation and intrinsic dimensionality on arbitrary thresholds. Works only of npy save strategy is used for the run.

Adjusted look and configuration of all plots

Intrinsic Dimensionality can be computed

Cleaned differen computation strategies for intrinsic dimensionality, such that all work exactly the same now.

Saturation computation is now fully implemented in double precision in order to avoid rounding errors

Source code(tar.gz)
Source code(zip)
0.1.32b(Dec 8, 2019)

Source code(tar.gz)
Source code(zip)
1.31.0(Nov 9, 2019)

Source code(tar.gz)
Source code(zip)
0.1.30(Oct 29, 2019)

Added channelwise saturation computation, which should be more accurate than mean-saturation
Source code(tar.gz)
Source code(zip)
0.1.29b(Oct 25, 2019)

Reworked the interface of saturation: ** it is now possible to pass kwargs over to the writer object ** it is now possible to pass writer objects directly instead of a string key Added another writer which saves saturation plots additionally to the csv-file.
Source code(tar.gz)
Source code(zip)
0.1.28(Oct 20, 2019)

Minor changes that enabled saturation to be computed on multiple GPUs.
Source code(tar.gz)
Source code(zip)
0.1.27(Sep 28, 2019)

enabled logging with limited number of samples. This new feature is off by default and can be enabled using "max_samples"-parameter in the CheckLayerSat constructor. This is usefull to gain performance if batch-processing is fast on it's own.
Source code(tar.gz)
Source code(zip)
0.1.26b(Sep 28, 2019)

enabled logging with limited number of samples. This new feature is off by default and can be enabled using "max_samples"-parameter in the CheckLayerSat constructor
Source code(tar.gz)
Source code(zip)
0.1.26(Sep 20, 2019)

Added a switch in the constructor for resetting the covariance matrix after each '''add_saturation()''' call.
Source code(tar.gz)
Source code(zip)
1.25(Aug 26, 2019)

minor hotfix
Source code(tar.gz)
Source code(zip)
v1.23(Jul 15, 2019)

fixed a bug where add_saturations() would sometimes couse a crash.
Source code(tar.gz)
Source code(zip)
1(Jul 15, 2019)

Source code(tar.gz)
Source code(zip)
v0.1.23(Jul 15, 2019)

fixed a bug where sometimes add_saturations would crash
Source code(tar.gz)
Source code(zip)
v0.1.22(Jul 14, 2019)

removed most of the unused functionality added different types of logging (csv, console-out are now alternatives to tensorboard) fixed a bug where the system would not search properly through arbitrary nested modules. changed the constructor interface of CheckLayerSat() in order to be more ergonomic in use changed the general interface of the functions to be more closely related to Sacred and MLFlow-Type interfaces.
Source code(tar.gz)
Source code(zip)

Owner

Delve

Delve is a library for visualizing layer saturation during neural network training

GitHub Repository https://delve-docs.readthedocs.io

FairML - is a python toolbox auditing the machine learning models for bias.

======== FairML: Auditing Black-Box Predictive Models FairML is a python toolbox auditing the machine learning models for bias. Description Predictive

338 Nov 09, 2022

Visualization toolkit for neural networks in PyTorch! Demo -->

FlashTorch A Python visualization toolkit, built with PyTorch, for neural networks in PyTorch. Neural networks are often described as "black box". The

692 Dec 29, 2022

Pytorch Feature Map Extractor

MapExtrackt Convolutional Neural Networks Are Beautiful We all take our eyes for granted, we glance at an object for an instant and our brains can ide

40 Dec 07, 2022

Delve is a Python package for analyzing the inference dynamics of your PyTorch model.

73 Dec 12, 2022

A collection of research papers and software related to explainability in graph machine learning.

1.9k Dec 26, 2022

Pytorch implementation of convolutional neural network visualization techniques

Convolutional Neural Network Visualizations This repository contains a number of convolutional neural network visualization techniques implemented in

7k Jan 03, 2023

Lime: Explaining the predictions of any machine learning classifier

lime This project is about explaining what machine learning classifiers (or models) are doing. At the moment, we support explaining individual predict

10.3k Jan 01, 2023

Interactive convnet features visualization for Keras

Quiver Interactive convnet features visualization for Keras The quiver workflow Video Demo Build your model in keras model = Model(...) Launch the vis

1.7k Dec 21, 2022

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Hierarchical neural-net interpretations (ACD) 🧠 Produces hierarchical interpretations for a single prediction made by a pytorch neural network. Offic

111 Jan 03, 2023

A python library for decision tree visualization and model interpretation.

dtreeviz : Decision Tree Visualization Description A python library for decision tree visualization and model interpretation. Currently supports sciki

2.4k Jan 02, 2023

👋🦊 Xplique is a Python toolkit dedicated to explainability, currently based on Tensorflow.

343 Jan 02, 2023

pytorch implementation of "Distilling a Neural Network Into a Soft Decision Tree"

Soft-Decision-Tree Soft-Decision-Tree is the pytorch implementation of Distilling a Neural Network Into a Soft Decision Tree, paper recently published

262 Dec 04, 2022

⬛ Python Individual Conditional Expectation Plot Toolbox

⬛ PyCEbox Python Individual Conditional Expectation Plot Toolbox A Python implementation of individual conditional expecation plots inspired by R's IC

140 Dec 30, 2022

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Yellowbrick Visual analysis and diagnostic tools to facilitate machine learning model selection. What is Yellowbrick? Yellowbrick is a suite of visual

3.9k Dec 30, 2022

Code for visualizing the loss landscape of neural nets

Visualizing the Loss Landscape of Neural Nets This repository contains the PyTorch code for the paper Hao Li, Zheng Xu, Gavin Taylor, Christoph Studer

2.2k Dec 30, 2022

JittorVis - Visual understanding of deep learning model.

182 Jan 06, 2023

A ultra-lightweight 3D renderer of the Tensorflow/Keras neural network architectures

16 Nov 17, 2021

Visualizer for neural network, deep learning, and machine learning models

Netron is a viewer for neural network, deep learning and machine learning models. Netron supports ONNX (.onnx, .pb, .pbtxt), Keras (.h5, .keras), Tens

20.9k Dec 28, 2022

TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)

🤪 TensorFlowTTS provides real-time state-of-the-art speech synthesis architectures such as Tacotron-2, Melgan, Multiband-Melgan, FastSpeech, FastSpeech2 based-on TensorFlow 2. With Tensorflow 2, we c

3k Jan 04, 2023

Algorithms for monitoring and explaining machine learning models

Alibi is an open source Python library aimed at machine learning model inspection and interpretation. The focus of the library is to provide high-qual

1.9k Dec 30, 2022

Delve is a Python package for analyzing the inference dynamics of your PyTorch model.

Related tags

Overview

Delve: Deep Live Visualization and Evaluation

Motivation

Installation

Using Layer Saturation to improve model performance

Demo

Why this name, Delve?

Comments

Releases(v0.1.49)

v0.1.49(Jan 17, 2022)

v0.1.48(Oct 12, 2021)

v0.1.45(Aug 22, 2021)

0.1.45(Aug 22, 2021)

0.1.44(Mar 7, 2021)

0.1.43(Jan 24, 2021)

0.1.42(Nov 20, 2020)

0.1.41c(Apr 13, 2020)

0.1.41b(Apr 12, 2020)

0.1.40(Mar 15, 2020)

0.1.39(Mar 11, 2020)

0.1.38(Mar 6, 2020)

0.1.37(Mar 6, 2020)

0.1.36(Mar 5, 2020)

0.1.35(Mar 1, 2020)

0.1.34(Feb 28, 2020)

0.1.32b(Dec 8, 2019)

1.31.0(Nov 9, 2019)

0.1.30(Oct 29, 2019)

0.1.29b(Oct 25, 2019)

0.1.28(Oct 20, 2019)

0.1.27(Sep 28, 2019)

0.1.26b(Sep 28, 2019)

0.1.26(Sep 20, 2019)

1.25(Aug 26, 2019)

v1.23(Jul 15, 2019)

1(Jul 15, 2019)

v0.1.23(Jul 15, 2019)

v0.1.22(Jul 14, 2019)

Owner

Delve

FairML - is a python toolbox auditing the machine learning models for bias.

Visualization toolkit for neural networks in PyTorch! Demo -->

Pytorch Feature Map Extractor

Delve is a Python package for analyzing the inference dynamics of your PyTorch model.

A collection of research papers and software related to explainability in graph machine learning.

Pytorch implementation of convolutional neural network visualization techniques

Lime: Explaining the predictions of any machine learning classifier

Interactive convnet features visualization for Keras

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

A python library for decision tree visualization and model interpretation.

👋🦊 Xplique is a Python toolkit dedicated to explainability, currently based on Tensorflow.

pytorch implementation of "Distilling a Neural Network Into a Soft Decision Tree"

⬛ Python Individual Conditional Expectation Plot Toolbox

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Code for visualizing the loss landscape of neural nets

JittorVis - Visual understanding of deep learning model.

A ultra-lightweight 3D renderer of the Tensorflow/Keras neural network architectures

Visualizer for neural network, deep learning, and machine learning models

TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)

Algorithms for monitoring and explaining machine learning models