Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Last update: Nov 08, 2022

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

This repo contains the python/stan version of the Statistical Rethinking course that Professor Richard McElreath taught on the Max Planck Institute for Evolutionary Anthropology in Leipzig during the Winter of 2019/2020. The original repo for the course, from which this repo is forked, can be found here. The course contains 20 lectures structured in 10 weeks with a series of assignments for each week. The course is an excellent introduction to bayesian modelling in general and to the Rethinking Statistics wonderful book written by Professor McElreath.

How to use this repo

There are ten jupyter notebooks, one for each week of the course. At the beginning of each notebook there are links to the youtube videos of the lectures, the slides used and the original homework questions and answers in R.

How I would use this repo is like this:

Go to the notebook of the week.
Watch the two videos for the lectures of that week. Their URL are at the very top of each notebook.
Read the original problems presented to the students and try to solve them on your own.
Follow the exercises solutions of the notebook with my code and explanations by Professor McElreath.

Installing `CmdStanPy`

The stan code is executed thanks to CmdStanPy. CmdStanPy is a lightweight pure-Python interface to CmdStan which provides access to the Stan compiler and all inference algorithms. It provides the function install_cmdstan() which downloads CmdStan from GitHub and builds the CmdStan utilities. It can be can be called from within Python or from the command line.

import cmdstanpy
cmdstanpy.install_cmdstan()

You can found more information about the installation process here.

Other useful resources

There are a lot of very useful resources for bayesian statistical modelling out there. Specifically centered on Professor McElreath work I would mention:

Original repo for the course.
Original rethinking package repo

Copyright

The present work is a derivative work of Statistical Rethinking: A Bayesian Course Using python and pymc3 by Gabriel Bosque Chacon and Statistical Rethinking: A Bayesian Course Using Python and NumPyro by Andrés Suárez. I made the stan code, the plotnine figures and slightly modifications to his comments.

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Related tags

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

How to use this repo

Installing `CmdStanPy`

Other useful resources

Copyright

Owner

Andrés Suárez

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

Tokyo 2020 Paralympics, Analytics

Data collection, enhancement, and metrics calculation.

VevestaX is an open source Python package for ML Engineers and Data Scientists.

PyClustering is a Python, C++ data mining library.

Stock Analysis dashboard Using Streamlit and Python

A Python package for modular causal inference analysis and model evaluations

A crude Hy handle on Pandas library

Top 50 best selling books on amazon

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

track your GitHub statistics

bigdata_analyse 大数据分析项目

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

Pipetools enables function composition similar to using Unix pipes.

Fancy data functions that will make your life as a data scientist easier.

For making Tagtog annotation into csv dataset

DataPrep — The easiest way to prepare data in Python

Describing statistical models in Python using symbolic formulas

Open-source Laplacian Eigenmaps for dimensionality reduction of large data in python.

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Related tags

Overview

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Intro

How to use this repo

Installing CmdStanPy

Other useful resources

Copyright

Owner

Andrés Suárez

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

Tokyo 2020 Paralympics, Analytics

Data collection, enhancement, and metrics calculation.

VevestaX is an open source Python package for ML Engineers and Data Scientists.

PyClustering is a Python, C++ data mining library.

Stock Analysis dashboard Using Streamlit and Python

A Python package for modular causal inference analysis and model evaluations

A crude Hy handle on Pandas library

Top 50 best selling books on amazon

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

track your GitHub statistics

bigdata_analyse 大数据分析项目

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

Pipetools enables function composition similar to using Unix pipes.

Fancy data functions that will make your life as a data scientist easier.

For making Tagtog annotation into csv dataset

DataPrep — The easiest way to prepare data in Python

Describing statistical models in Python using symbolic formulas

Open-source Laplacian Eigenmaps for dimensionality reduction of large data in python.

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Installing `CmdStanPy`