Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Last update: Nov 19, 2022

Overview

Improving evidential deep learning via multi task learning

It is a repository of AAAI2022 paper, “Improving evidential deep learning via multi-task learning”, by Dongpin Oh and Bonggun Shin.

This repository contains the code to reproduce the Multi-task evidential neural network (MT-ENet), which uses the Lipschitz MSE loss function as the additional loss function of the evidential regression network (ENet). The Lipschitz MSE loss function can improve the accuracy of the ENet while preserving its uncertainty estimation capability, by avoiding gradient conflict with the NLL loss function—the original loss function of the ENet.

Setup

Please refer to "requirements.txt" for requring packages of this repo.

pip install -r requirements.txt

Training the ENet with the Lipschitz-MSE loss: example

from mtevi.mtevi import EvidentialMarginalLikelihood, EvidenceRegularizer, modified_mse
...
net = EvidentialNetwork() ## Evidential regression network
nll_loss = EvidentialMarginalLikelihood() ## original loss, NLL loss
reg = EvidenceRegularizer() ## evidential regularizer
mmse_loss = modified_mse ## lipschitz MSE loss
...
for inputs, labels in dataloader:
	gamma, nu, alpha, beta = net(inputs)
	loss = nll_loss(gamma, nu, alpha, beta, labels)
	loss += reg(gamma, nu, alpha, beta, labels)
	loss += mmse_loss(gamma, nu, alpha, beta, labels)
	loss.backward()

Quick start

Synthetic data experiment.

python synthetic_exp.py

UCI regression benchmark experiments.

python uci_exp_norm -p energy

Drug target affinity (DTA) regression task on KIBA and Davis datasets.

python train_evinet.py -o test --type davis -f 0 --evi # ENet
python train_evinet.py -o test --type davis -f 0  # MT-ENet

Gradient conflict experiment on the DTA benchmarks

python check_conflict.py --type davis -f 0 # Conflict between the Lipschitz MSE (proposed) and NLL loss. 
python check_conflict.py --type davis -f 0 --abl # Conflict between the simple MSE loss and NLL loss.

Characteristic of the Lipschitz MSE loss

The Lipschitz MSE loss function can support training the ENet to more accurately predicts target values.
It regularizes its gradient to prevent gradient conflict with the NLL loss--the original loss function--if the NLL loss increases predictive uncertainty of the ENet.
Please check our paper for details.

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Related tags

Overview

Improving evidential deep learning via multi task learning

Setup

Training the ENet with the Lipschitz-MSE loss: example

Quick start

Characteristic of the Lipschitz MSE loss

Owner

deargen

Code for paper "Multi-level Disentanglement Graph Neural Network"

Source Code for DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances (https://arxiv.org/pdf/2012.01775.pdf)

Contrastive Learning of Structured World Models

Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)

Video Matting via Consistency-Regularized Graph Neural Networks

This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.

Image reconstruction done with untrained neural networks.

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

Controlling the MicriSpotAI robot from scratch

Winners of the Facebook Image Similarity Challenge

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Problem-943.-ACMP - Problem 943. ACMP

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Effective Use of Transformer Networks for Entity Tracking

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Codes for SIGIR'22 Paper 'On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation'

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'