Relative Uncertainty Learning for Facial Expression Recognition

Last update: Dec 28, 2022

Overview

Relative Uncertainty Learning for Facial Expression Recognition

The official implementation of the following paper at NeurIPS2021:
Title: Relative Uncertainty Learning for Facial Expression Recognition
Authors: Yuhang Zhang, Chengrui Wang, Weihong Deng
Institute: BUPT

Abstract

In facial expression recognition (FER), the uncertainties introduced by inherent noises like ambiguous facial expressions and inconsistent labels raise concerns about the credibility of recognition results. To quantify these uncertainties and achieve good performance under noisy data, we regard uncertainty as a relative concept and propose an innovative uncertainty learning method called Relative Uncertainty Learning (RUL). Rather than assuming Gaussian uncertainty distributions for all datasets, RUL builds an extra branch to learn uncertainty from the relative difficulty of samples by feature mixup. Specifically, we use uncertainties as weights to mix facial features and design an add-up loss to encourage uncertainty learning. It is easy to implement and adds little or no extra computation overhead. Extensive experiments show that RUL outperforms state-of-the-art FER uncertainty learning methods in both real-world and synthetic noisy FER datasets. Besides, RUL also works well on other datasets such as CIFAR and Tiny ImageNet.

Pipeline

Feature Visualization

The feature distribution figure shows that RUL encourages intra-class compactness and inter-class seperability of the learned features. (0:Surprise, 1:Fear, 2:Disgust, 3:Happy, 4:Sad, 5:Angry, 6:Neutral)

Train

Torch

We train RUL with Torch 1.8.0 and torchvision 0.9.0.

Dataset

Download RAF-DB, put it into the dataset folder, and make sure that it has the same structure as bellow:

- dataset/raf-basic/
         EmoLabel/
             list_patition_label.txt
         Image/aligned/
	     train_00001_aligned.jpg
             test_0001_aligned.jpg
             ...

Pretrained backbone model

Download the pretrained ResNet18 from this github repository, and then put it into the pretrained_model directory. We thank the authors for providing their pretrained ResNet model.

Train the RUL model

cd src
python main.py --raf_path '../dataset/raf-basic' --label_path '../dataset/raf-basic/EmoLabel/list_patition_label.txt' --pretrained_backbone_path '../pretrained_model/resnet18_msceleb.pth'

Accuracy

Acknowledgments

Our work is based on the following works, thanks for their code and pretrained model:

https://github.com/kaiwang960112/Self-Cure-Network

https://github.com/Ontheway361/dul-pytorch

https://github.com/amirhfarzaneh/dacl

Relative Uncertainty Learning for Facial Expression Recognition

Related tags

Overview

Relative Uncertainty Learning for Facial Expression Recognition

Abstract

Pipeline

Feature Visualization

Train

Owner

Implicit Graph Neural Networks

The Submission for SIMMC 2.0 Challenge 2021

Here is the implementation of our paper S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations.

Vision Transformer and MLP-Mixer Architectures

NLMpy - A Python package to create neutral landscape models

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR 2022)

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Style-based Neural Drum Synthesis with GAN inversion

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI 2022)

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans

Official Code for "Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning"

code for Fast Point Cloud Registration with Optimal Transport

Algorithmic trading using machine learning.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)