A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

Last update: Dec 30, 2022

Overview

Machine Learning Mindmap / Cheatsheet

A Mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

Overview

Machine Learning is a subfield of computer science that gives computers the ability to learn without being explicitly programmed. It explores the study and construction of algorithms that can learn from and make predictions on data.

Machine Learning is as fascinating as it is broad in scope. It spans over multiple fields in Mathematics, Computer Science, and Neuroscience. This is an attempt to summarize this enormous field in one .PDF file.

Download

Download the PDF here:

https://github.com/dformoso/machine-learning-mindmap/blob/master/Machine%20Learning.pdf

Same, but with a white background:

https://github.com/dformoso/machine-learning-mindmap/blob/master/Machine%20Learning%20-%20White%20BG.pdf

I've built the mindmap with MindNode for Mac. https://mindnode.com

Companion Notebook

This Mindmap/Cheatsheet has a companion Jupyter Notebook that runs through most of the Data Science steps that can be found at the following link:

https://github.com/dformoso/sklearn-classification

Mindmap on Deep Learning

Here's another mindmap which focuses only on Deep Learning

https://github.com/dformoso/deeplearning-mindmap

1. Process

The Data Science it's not a set-and-forget effort, but a process that requires design, implementation and maintenance. The PDF contains a quick overview of what's involved. Here's a quick screenshot.

2. Data Processing

First, we'll need some data. We must find it, collect it, clean it, and about 5 other steps. Here's a sample of what's required.

3. Mathematics

Machine Learning is a house built on Math bricks. Browse through the most common components, and send your feedback if you see something missing.

4. Concepts

A partial list of the types, categories, approaches, libraries, and methodology.

5. Models

A sampling of the most popular models. Send your comments to add more.

References

I'm planning to build a more complete list of references in the future. For now, these are some of the sources I've used to create this Mindmap.

 Stanford and Oxford Lectures. CS20SI, CS224d.
> Books: 
  > Deep Learning - Goodfellow. 
  > Pattern Recognition and Machine Learning - Bishop. 
  > The Elements of Statistical Learning - Hastie.
- Colah's Blog. http://colah.github.io
- Kaggle Notebooks.
- Tensorflow Documentation pages.
- Google Cloud Data Engineer certification materials.
- Multiple Wikipedia articles.

About Me

Twitter:

https://twitter.com/danielmartinezf

Linkedin:

https://www.linkedin.com/in/danielmartinezformoso/

Email:

[email protected]

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

Related tags

Overview

Machine Learning Mindmap / Cheatsheet

Overview

Download

Companion Notebook

Mindmap on Deep Learning

1. Process

2. Data Processing

3. Mathematics

4. Concepts

5. Models

References

About Me

Owner

Daniel Formoso

Meerkat provides fast and flexible data structures for working with complex machine learning datasets.

Projeto: Machine Learning: Linguagens de Programacao 2004-2001

A Python library for choreographing your machine learning research.

Python-based implementations of algorithms for learning on imbalanced data.

An implementation of Relaxed Linear Adversarial Concept Erasure (RLACE)

The unified machine learning framework, enabling framework-agnostic functions, layers and libraries.

A Lucid Framework for Transparent and Interpretable Machine Learning Models.

Adaptive: parallel active learning of mathematical functions

PySpark + Scikit-learn = Sparkit-learn

Predict the output which should give a fair idea about the chances of admission for a student for a particular university

A collection of Scikit-Learn compatible time series transformers and tools.

Apple-voice-recognition - Machine Learning

Bayesian Modeling and Computation in Python

Azure Cloud Advocates at Microsoft are pleased to offer a 12-week, 24-lesson curriculum all about Machine Learning

Bonsai: Gradient Boosted Trees + Bayesian Optimization

Binary Classification Problem with Machine Learning

Distributed scikit-learn meta-estimators in PySpark

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Time series changepoint detection

Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.