Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Last update: Nov 17, 2022

Overview

Credit Card Fraud Detection

Came across this mocked-up dataset of customer transactions at [Capital One Recruitment Challenge](https://github.com/CapitalOneRecruiting/DS).
The unbalanced dataset is comprised of artificial customer transactions with a few outlier cases where fraud was detected. There's only ~1.6% fraudulent cases.
Our primary goal is to successfully predict whether a transaction is Fraudulent or not, and avoid Type-II errors as much as possible as in most sensitive classification problems: we'll try not to point accusatory-fingers at genuine-transactions 😂 .
The secondary goal is to identify interesting anomalies in the transactions like multi-swipes, reversal of suspicious transactions, etc. by performing exploratory-data-analysis.
Most numerical-fields seem to follow Power-law distributions rather than Gaussian distributions.
We'll engineer some time-dependent categorical features by parsing the datetime fields, exclude the fields which have just one categorical value (makes no sense keeping these around 😒 ), and also create a new feature to indicate if credit-card-CVV is wrongly entered.
Baseline classifiers chosen are Logistic Regression, SVM, Random Forest, Isolated Forest.
Performance is kinda poor on these Baseline models: Accuracy, precision, and recall vary greatly across the models.
Moving on Gradient-Boosting models, Light Gradient Boosting is known to perform well on sparse datasets.
Final accuracy achieved hovers around 98%, and recall is approximately 99.99% indicating that False-Negatives are absolutely minimal.

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Related tags

Overview

Credit Card Fraud Detection

Owner

Vikrant Deshpande

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

classify fashion-mnist dataset with pytorch

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

Official implementation for the paper: Generating Smooth Pose Sequences for Diverse Human Motion Prediction

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

A Lightweight Experiment & Resource Monitoring Tool 📺

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

Jingju baseline - A baseline model of our project of Beijing opera script generation

Artificial Intelligence playing minesweeper 🤖

Consecutive-Subsequence - Simple software to calculate susequence with highest sum

Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

learned_optimization: Training and evaluating learned optimizers in JAX

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

End-to-end image segmentation kit based on PaddlePaddle.

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Hyperbolic Procrustes Analysis Using Riemannian Geometry