RaceBERT -- A transformer based model to predict race and ethnicty from names

Last update: Nov 02, 2022

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

pip install racebert

Using a virtual environment is highly recommended! You may need to install pytorch as instructed here: https://pytorch.org/get-started/locally/

Paper

Todo

Usage

raceBERT predicts race (U.S census race) and ethnicity from names.

from racebert import RaceBERT

model = RaceBERT()

# To predict race
model.predict_race("Barack Obama")

>>> {"label": "nh_black", "score": 0.5196923613548279}

The race categories are:

Race	Label
Non-hispanic White	nh_white
Hispanic	hispanic
Non-hispanic Black	nh_black
Asian & Pacific Islander	api
American Indian & Alaskan Native	aian

# Predict ethnicity
model.predict_ethnicty("Arjun Gupta")

>>> {"label": "Asian,IndianSubContinent", "score": 0.9612812399864197}

The ethnicity categories are:

Ethnicity
GreaterEuropean,British
GreaterEuropean,WestEuropean,French
GreaterEuropean,WestEuropean,Italian
GreaterEuropean,WestEuropean,Hispanic
GreaterEuropean,Jewish
GreaterEuropean,EastEuropean
Asian,IndianSubContinent
Asian,GreaterEastAsian,Japanese
GreaterAfrican,Muslim
Asian,GreaterEastAsian,EastAsian
GreaterEuropean,WestEuropean,Nordic
GreaterEuropean,WestEuropean,Germanic
GreaterAfrican,Africans

GPU

If you have a GPU, you can speed up the computation by specifying the CUDA device when you instantiate the model.

from racebert import RaceBERT

model = RaceBERT(device=0)

# predict race in batch
model.predict_race(["Barack Obama", "George Bush"])

>>>
[
        {"label": "nh_black", "score": 0.5196923613548279},
        {"label": "nh_white", "score": 0.8365859389305115}
]

# predict ethnicity in batch
model.predict_ethnicity(["Barack Obama", "George Bush"])

HuggingFace

Alternatively, you can work with the transformers models hosted on the huggingface hub directly.

Race Model: https://huggingface.co/pparasurama/raceBERT
Ethnicity Model: https://huggingface.co/pparasurama/raceBERT-ethnicity

Please refer to the transformers documentation.

RaceBERT -- A transformer based model to predict race and ethnicty from names

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

Paper

Usage

GPU

HuggingFace

Owner

Prasanna Parasurama

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

Attentive Implicit Representation Networks (AIR-Nets)

Reinforcement Learning Theory Book (rus)

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

Image-to-image regression with uncertainty quantification in PyTorch

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

Yggdrasil - A simplistic bot designed to streamline your server experience

Human Dynamics from Monocular Video with Dynamic Camera Movements

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

YKKDetector For Python

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Convolutional Neural Network for Text Classification in Tensorflow

This code is part of the reproducibility package for the SANER 2022 paper "Generating Clarifying Questions for Query Refinement in Source Code Search".

Plotting points that lie on the intersection of the given curves using gradient descent.

Official Pytorch implementation of Meta Internal Learning

E2EDNA2 - An automated pipeline for simulation of DNA aptamers complexed with small molecules and short peptides

A library for uncertainty representation and training in neural networks.

SBINN: Systems-biology informed neural network

A scikit-learn-compatible module for estimating prediction intervals.