Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

Last update: Dec 12, 2022

Comments

About the MTCNN face detections and preprocessing
Hi,

It would be great if you could clarify a few questions regarding this dataset please.

Is it possible for you to provide the MTCNN output face detections (bounding boxes and facial landmarks) for the face samples in BFW?

Am I right in assuming MTCNN takes as input the images in "face-samples" folder of the dataset? If yes, what settings do we use with MTCNN in order for us to detect a face correctly on all the facial images provided in face-samples? If not, can you help us reproduce your face detection results by providing us with the original images on which the MTCNN was run to obtain the results in face-samples?

Are the images in facial-samples actually crops which are aligned?

Thanks in advance for your help.
opened by manisoftwartist 2
Create a wrapper function to unify pipeline that produces the 3 figures (detailed below) from embedding data
3 Figures based on the paper "Face Recognition: Too Bias, or Not Too Bias" are

DET curves: FPR versus FNR by moving threshold

Score distributions for genuine and imposter using violin plots

Confusion matrix for Rank 1 and any Rank.
opened by suchanv 2
Devel
Develop branch-- prepare for next version release.

Aim for the following for version 0.1.1:

[ ] Notebook is updated to use interface recently modulated (#21)

[ ] Update Documentation to explain steps to run (#21)

[ ] add to README in root

[x] move results from README in root to the README in results/

[x] move data section from README in root to the README in data/

[x] save curve data along with PDF (i.e., in results/)

[ ] Add simple (brief) docstring where missing)

[ ] A sample (toy) set is run end-to-end (demonstrate in README)

[ ] if small enough, add to repo (i.e., < 40 MB or so)

[ ] Finish script to generate Tar @ Far table

[ ] Improve annotation in notebooks; more description, i.e., tutorial-like.

[ ] create pdf versions of notebooks and add to project in notebooks/pdfs (or create nbviewer and point to it)

[ ] add assertions (and tests) where appropriate-- at least critical cases, such a specific type is expected.

[ ] Consider moving some of the analysis functions to visualizations.

[ ] modulate the handling of plt.axes objects

[ ] add optional input arguments for the title and other figure cosmetics or settings

[x] Add benchmarks for sphereface features. Make these the results showcased throughout.

documentation enhancement Benchmark Project-level
opened by visionjo 1
Questions on verification_RFW and training procedure

Hi, Thanks for your great work and sharing of the code on these two papers ! It takes me days to read the paper and go through the repository and I have a few questions:

(2) Do you have the code for training the features (asian_females, asian_males, black_females, black_males, indian_females, indian_males,...). Since I have a hard time finding something like train.py (e.g. the loss function and training process). (I suppose the released code is mainly on image pre-processing and result analysis) (Since BFW dataset is not as large as other face dataset and it may possible for me to train it from scratch on one GPU)

(3) I am little confused about how the BFW is used in two papers, as I understand:

in paper Face Recognition: Too Bias, or Not Too Bias? , the train and test model are as follows: train: CASIA_webface trained using Sphereface loss test: LFW where does BFW dataset not used in training in this set of experiments?

in paper Balancing Biases and Preserving Privacy on Balanced Faces in the Wild the train, test model are as follows: tain: (1) MS1M trained using Arcface loss --> to get 512-dim embedding (f_in in Fig.6) (2) BFW dataset is used to train the encoder and two classifiers in Fig 6 test: 4-folds used for training and 1-fold used for testing (using the best threshold chosen)

is that right?

(4) There are some difference from "bfw-v0.1.5-datatable.csv" and the TABLE-2 in paper 2: for example: there are 921379 records in TABLE-2 while ther are 923898 records from the csv file? and there is no "{dir_meta}thresholds.pkl" file.

Thanks for your time and any help would be appreciated !

opened by lizhenstat 7
Regarding face identification

Hey,

Thanks for the awesome work!

I wanted to know how I can modify the repo to use for face identification task instead of verification.

Any help would be highly appreciated.

opened by shivmgg 1
Sphinx documentation

Setup the project for sphinx.

Include clear instruction on how to maintain (i.e., once in place, we'll include as part of the build process (see in docs/)

Setup for tutorials on the different concepts and experiments done as part of this line of work (i.e., facial bias and BFW database)
documentation enhancement

opened by visionjo 0
Create plan for Dash interface
Project plan (lead: Dylan; support: Rohan):

[ ] what features to include

[ ] Specifications

[ ] Interface layout (use lucidchart or equivalent)

[ ] Division of tasks and proposed timeline

Plan and design Project-level
opened by visionjo 0

Releases(v0.0.3)

v0.0.3(Feb 7, 2020)

Source code(tar.gz)
Source code(zip)

Owner

Joseph P. Robinson

Ph.D., Northeastern, 2020. Focus: applied machine learning, mostly vision. At Vicarious Surgical's ASDAI group, an AI Engineer working on our surgical robot.

GitHub Repository

LabelImg is a graphical image annotation tool.

LabelImgPlus LabelImg is a graphical image annotation tool. This project is not updated with new functions now. More functions are supported with Labe

200 Dec 20, 2022

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

StyleGAN-Human: A Data-Centric Odyssey of Human Generation Abstract: Unconditional human image generation is an important task in vision and graphics,

762 Jan 08, 2023

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Recurrent Neural Networks Implements simple recurrent network and a stacked recurrent network in numpy and torch respectively. Both flavours implement

1 Nov 16, 2021

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color Overview Code and dataset for The World of an Octopus: H

1 Nov 13, 2021

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Crab - A Recommendation Engine library for Python Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering r

1.2k Dec 21, 2022

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids The electric grid is a key enabling infrastructure for the a

19 Jan 07, 2023

A system for quickly generating training data with weak supervision

Programmatically Build and Manage Training Data Announcement The Snorkel team is now focusing their efforts on Snorkel Flow, an end-to-end AI applicat

5.4k Jan 02, 2023

LSTM model trained on a small dataset of 3000 names written in PyTorch

LSTM model trained on a small dataset of 3000 names. Model generates names from model by selecting one out of top 3 letters suggested by model at a time until an EOS (End Of Sentence) character is no

1 Dec 20, 2021

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition With the spirit of reproducible research, this repository contains codes requ

0 Feb 24, 2022

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

B-Pref Official codebase for B-Pref: Benchmarking Preference-BasedReinforcement Learning contains scripts to reproduce experiments. Install conda env

48 Dec 20, 2022

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

Angora Angora is a mutation-based coverage guided fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without s

833 Jan 07, 2023

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

Related tags

Overview

Face Recognition: Too Bias, or Not Too Bias?

Balanced Faces in the Wild (BFW): Data, Code, Evaluations

Project Overview

Experimental-based contributions and findings

Score sensitivity

Global threshold

All-in-all

Paper abstract

To Do

License

Acknowledgement

Comments

About the MTCNN face detections and preprocessing

Create a wrapper function to unify pipeline that produces the 3 figures (detailed below) from embedding data

Devel

Questions on verification_RFW and training procedure

Regarding face identification

Sphinx documentation

Create plan for Dash interface

Releases(v0.0.3)

v0.0.3(Feb 7, 2020)

Owner

Joseph P. Robinson

LabelImg is a graphical image annotation tool.

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

A system for quickly generating training data with weak supervision

LSTM model trained on a small dataset of 3000 names written in PyTorch

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

NBEATSx: Neural basis expansion analysis with exogenous variables

A simple baseline for 3d human pose estimation in PyTorch.

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Unsupervised Pre-training for Person Re-identification (LUPerson)

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

Python and Julia in harmony.

Semi-Supervised Signed Clustering Graph Neural Network (and Implementation of Some Spectral Methods)

implement of SwiftNet:Real-time Video Object Segmentation

A collection of loss functions for medical image segmentation