Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

Last update: Dec 05, 2022

Related tags

Deep Learning PureGaze

Overview

PureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation

Description

Our work is accpeted by AAAI 2022.

Picture: We propose a domain-generalization framework for gaze estimation. Our method is only trained in the source domain and brings improvement in all unknown target domains. The key idea of our method is to purify the gaze feature with a self-adversarial framework.

Picture: Overview of the gaze feature purification. Our goal is to preserve the gaze-relevant feature and eliminate gaze-irrelevant features. We define two tasks, which are to preserve gaze information and to remove general facial image information. The two tasks are not cooperative but adversarial to purify feature. Simultaneously optimizing the two tasks, we implicitly purify the gaze feature without defining gaze-irrelevant feature.

Performance: PureGaze shows best performance among typical gaze estimation methods (w/o adaption), and has competitive result among domain adaption methods. Note that, PureGaze learns one optimal model for four tasks, while domain adaption methods need to learn a total of four models. This is an advantage of PureGaze.

Feature visualization: The result clearly explains the purification. Our purified feature contains less gaze-irrelevant feature and naturally improves the cross-domain performance.

Usage

This is a re-implemented version by Pytorch1.7.1 (origin is Pytorch1.0.1).

We provides an Res50-Version PureGaze. If you want to change the backbone to Res18, you could use the file in Model/Res18.

Resourse

Model/: Implemented code.
Masker/: The masker used for training.

Get Started

You could find data processing code from this link.
modifing files in config/ folder, and run commands like:

Training:python trainer/total.py -c config/train/config-eth.yaml

Test:python tester/total.py -s config/train/config-eth.yaml -t config/test/config-mpii.yaml

Visual:python tester/visual.py -s config/train/config-eth.yaml -t config/test/config-mpii.yaml

Pre-trained model.

We provide a pre-trained model of Res50-version PureGaze. You can find it from this link.

Citation.

@article{cheng2022puregaze,
  title={PureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation},
  author={Yihua Cheng and Yiwei Bao and Feng Lu},
  journal={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2022}
}

Contact

Please email [email protected].

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

Related tags

Overview

PureGaze: Purifying Gaze Feature for Generalizable Gaze Estimation

Description

Usage

Resourse

Get Started

Pre-trained model.

Citation.

Contact

Owner

Train the HRNet model on ImageNet

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

McGill Physics Hackathon 2021: Reaction-Diffusion Models for the Generation of Biological Patterns

Uses OpenCV and Python Code to detect a face on the screen

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Aalto-cs-msc-theses - Listing of M.Sc. Theses of the Department of Computer Science at Aalto University

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

Mixed Transformer UNet for Medical Image Segmentation

Graph Convolutional Networks in PyTorch

Deep Crop Rotation

Lama-cleaner: Image inpainting tool powered by LaMa

The `rtdl` library + The official implementation of the paper

Simple image captioning model - CLIP prefix captioning.

GazeScroller - Using Facial Movements to perform Hands-free Gesture on the system

A Demo server serving Bert through ONNX with GPU written in Rust with <3

Repository containing detailed experiments related to the paper "Memotion Analysis through the Lens of Joint Embedding".

You Only 👀 One Sequence

AIR^2 for Interaction Prediction

BLEURT is a metric for Natural Language Generation based on transfer learning.

Development of IP code based on VIPs and AADM