slim-python is a package to learn customized scoring systems for decision-making problems.

Last update: Nov 02, 2022

Related tags

Overview

slim-python is a package to learn customized scoring systems for decision-making problems.

These are simple decision aids that let users make yes-no predictions by adding and subtracting a few small numbers.

SLIM is designed to learn the most accurate scoring system for a given dataset and set of constraints. These models are produced by solving a hard optimization problem that directly optimizes for accuracy, sparsity, and customized constraints (e.g., hard limits on model size, TPR, FPR).

Requirements

slim-python was developed using Python 2.7.11 and CPLEX 12.6.2.

CPLEX

CPLEX is cross-platform commercial optimization tool with a Pytho API. It is freely available to students and faculty members at accredited institutions as part of the IBM Academic Initiative. To get CPLEX:

Join the IBM Academic Initiative. Note that it may take up to a week to obtain approval.
Download IBM ILOG CPLEX Optimization Studio V12.6.1 (or higher) from the software catalog
Install the file on your computer. Note mac/unix users will need to install a .bin file.
Setup the CPLEX Python modules as described here here.

Please check the CPLEX user manual or the CPLEX forums if you have problems installing CPLEX.

Citation

If you use SLIM for academic research, please cite our paper!

@article{
    ustun2015slim,
    year = {2015},
    issn = {0885-6125},
    journal = {Machine Learning},
    doi = {10.1007/s10994-015-5528-6},
    title = {Supersparse linear integer models for optimized medical scoring systems},
    url = {http://dx.doi.org/10.1007/s10994-015-5528-6},
    publisher = { Springer US},
    author = {Ustun, Berk and Rudin, Cynthia},
    pages = {1-43},
    language = {English}
}

slim-python is a package to learn customized scoring systems for decision-making problems.

Related tags

Overview

Requirements

CPLEX

Citation

Owner

Berk Ustun

Penguins species predictor app is used to classify penguins species created using python's scikit-learn, fastapi, numpy and joblib packages.

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

A single Python file with some tools for visualizing machine learning in the terminal.

Azure MLOps (v2) solution accelerators.

Titanic Traveller Survivability Prediction

scikit-multimodallearn is a Python package implementing algorithms multimodal data.

Generate music from midi files using BPE and markov model

A Python package for time series classification

Breast-Cancer-Classification - Using SKLearn breast cancer dataset which contains 569 examples and 32 features classifying has been made with 6 different algorithms

A data preprocessing and feature engineering script for a machine learning pipeline is prepared.

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Visualize classified time series data with interactive Sankey plots in Google Earth Engine

Relevance Vector Machine implementation using the scikit-learn API.

This is a curated list of medical data for machine learning

Implementations of Machine Learning models, Regularizers, Optimizers and different Cost functions.

Gaussian Process Optimization using GPy

Anytime Learning At Macroscale

Coursera Machine Learning - Python code

A library to generate synthetic time series data by easy-to-use factors and generator

#30DaysOfStreamlit is a 30-day social challenge for you to build and deploy Streamlit apps.