Decision tree is the most powerful and popular tool for classification and prediction

Last update: Jan 23, 2022

Overview

Diabetes Prediction Using Decision Tree

Introduction

Decision tree is the most powerful and popular tool for classification and prediction. A Decision tree is a flowchart like tree structure, where each internal node denotes a test on an attribute, each branch represents an outcome of the test, and each leaf node (terminal node) holds a class label.

In this project we build a decsion tree to predict diabetes for Pima Indians dataset with variables such as age, blood, pressure etc

Major Steps

Load the required libraries
Load the data sets using Pandas
Divide the columns to two types of variables dependent and independent variables
Bulding Decision Tree using scikit-learn
Evaluvating the model or classifier
Creating a visual Decision Tree

Group Members

Reference

Decision Tree Classification on Diabetes-Dataset using Python : https://medium.com/@ananya_bt18/decision-tree-classification-on-diabetes-dataset-using-python-scikit-learn-package-f7be624c344e

Decision tree is the most powerful and popular tool for classification and prediction

Related tags

Overview

Diabetes Prediction Using Decision Tree

Introduction

Major Steps

Group Members

Reference

Owner

Arjun U

Firebase + Cloudrun + Machine learning

Predicting diabetes over a five year period using logistic regression and the Pima First-Nation dataset

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

Python implementation of the rulefit algorithm

Causal Inference and Machine Learning in Practice with EconML and CausalML: Industrial Use Cases at Microsoft, TripAdvisor, Uber

Cryptocurrency price prediction and exceptions in python

This repository contains the code to predict house price using Linear Regression Method

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Kaggler is a Python package for lightweight online machine learning algorithms and utility functions for ETL and data analysis.

Test symmetries with sklearn decision tree models

BudouX is the successor to Budou, the machine learning powered line break organizer tool.

Probabilistic time series modeling in Python

Predicting India’s COVID-19 Third Wave with LSTM

MBTR is a python package for multivariate boosted tree regressors trained in parameter space.

The Fuzzy Labs guide to the universe of open source MLOps

[HELP REQUESTED] Generalized Additive Models in Python

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

All-in-one web-based development environment for machine learning

A benchmark of data-centric tasks from across the machine learning lifecycle.

Adaptive: parallel active learning of mathematical functions