Pytorch implementation of face attention network

Last update: Dec 09, 2022

Related tags

Overview

Face Attention Network

Pytorch implementation of face attention network as described in Face Attention Network: An Effective Face Detector for the Occluded Faces. The baseline is RetinaNet followed by this repo.

Requirements

Python3
Pytorch0.4
torchvision
tensorboardX

Installation

Install packages.

sudo apt-get install tk-dev python-tk
pip install cffi
pip install cython
pip install pandas
pip install tensorboardX

Build NMS.

cd Face_Attention_Network/lib
sh build.sh

Create folders.

cd Face_Attention_Network/
mkdir ckpt mAP_txt summary weight

Datasets

You should prepare three CSV or TXT files including train annotations file, valid annotations file and label encoding file.

Annotations format

Two examples are as follows:

$image_path/img_1.jpg x1 y1 x2 y2 label
$image_path/img_2.jpg . . . . .

Images with more than one bounding box should use one row per box. When an image does not contain any bounding box, set them '.'.

Label encoding file

A TXT file (classes.txt) is needed to map label to ID. Each line means one label name and its ID. One example is as follows:

face 0

Pretrained Model

We use resnet18, 34, 50, 101, 152 as the backbone. You should download them and put them to /weight.

resnet18: https://download.pytorch.org/models/resnet18-5c106cde.pth
resnet34: https://download.pytorch.org/models/resnet34-333f7ec4.pth
resnet50: https://download.pytorch.org/models/resnet50-19c8e357.pth
resnet101: https://download.pytorch.org/models/resnet101-5d3b4d8f.pth
resnet152: https://download.pytorch.org/models/resnet152-b121ed2d.pth

Training

python train.py --csv_train <$path/train.txt> --csv_val <$path/val.txt> --csv_classes <$path/classes.txt> --depth <50> --pretrained resnet50-19c8e357.pth --model_name <model name to save>

Visualization Result

Detection result

Attention map at different level (P3~P7)

Pytorch implementation of face attention network

Related tags

Overview

Face Attention Network

Requirements

Installation

Datasets

Annotations format

Label encoding file

Pretrained Model

Training

Visualization Result

Reference

Owner

Hooks

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

A comprehensive list of published machine learning applications to cosmology

A Python reference implementation of the CF data model

Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

Codebase to experiment with a hybrid Transformer that combines conditional sequence generation with regression

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

A python package for generating, analyzing and visualizing building shadows

一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

Unifying Global-Local Representations in Salient Object Detection with Transformer

MutualGuide is a compact object detector specially designed for embedded devices

Implementation of ToeplitzLDA for spatiotemporal stationary time series data.

Immortal tracker

Spectralformer: Rethinking hyperspectral image classification with transformers

CMP 414/765 course repository for Spring 2022 semester

The official implementation of NeurIPS 2021 paper: Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks

PiRapGenerator - Make anyone rap the digits of pi