Conducted ANOVA and Logistic regression analysis using matplot library to visualize the result.

Last update: Feb 06, 2022

Related tags

Overview

Intro-to-Data-Science

Conducted ANOVA and Logistic regression analysis.

Project ANOVA

The main aim of this project is to perform One-Way ANOVA analysis on the given set of data(values in various levels of education) using python. We build a model that outputs the summary and gives anova table. We set hypothesis for the given data and calculate F-statistic. From F-statistic, p-value is calculated. If the p-value is less than significance level, we reject Null hypothesis which refers to that means of all groups are not equal and the observed difference in the means is not due to sampling variability. After performing hypothesis test, we perform multiple pairwise comparisons of different groups using t-test to determine which means are different. In conclusion, we determine whether the mean of various levels of education is same or which levels of education have different means.

Project Logistic regression analysis

The main aim of this project is to perform logistic regression analysis on the given data set that represents whether a given e-mail is spam or not spam. The dataset contains 20 features that are used to determine whether an e-mail is spam or not spam. Before performing logistic regression, we perform feature elimination so that significant feature sets are used in model analysis. After modeling the data, we iterate the model for various threshold probability values and check the values of sensitivity and specificity for various thresholds.

Therefore, our goal is to find the optimal threshold value for which the true positive rate is close to 1 so that we build an optimum classification model that classifies a spam e-mail from ham.

Outline

Abstract
Theory
Exploratory Data analysis
Analysis Results & Explanation
Conclusion

Conducted ANOVA and Logistic regression analysis using matplot library to visualize the result.

Related tags

Overview

Intro-to-Data-Science

Project ANOVA

Project Logistic regression analysis

Outline

Owner

Chris Yuan

pywFM is a Python wrapper for Steffen Rendle's factorization machines library libFM

Short PhD seminar on Machine Learning Security (Adversarial Machine Learning)

Python Automated Machine Learning library for tabular data.

CD) in machine learning projectsImplementing continuous integration & delivery (CI/CD) in machine learning projects

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques

Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

hgboost - Hyperoptimized Gradient Boosting

scikit-learn models hyperparameters tuning and feature selection, using evolutionary algorithms.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

fMRIprep Pipeline To Machine Learning

An easier way to build neural search on the cloud

CobraML: Completely Customizable A python ML library designed to give the end user full control

This repository contains the code to predict house price using Linear Regression Method

Both social media sentiment and stock market data are crucial for stock price prediction

Tutorial for Decision Threshold In Machine Learning.

Stacked Generalization (Ensemble Learning)

CorrProxies - Optimizing Machine Learning Inference Queries with Correlative Proxy Models

Machine learning that just works, for effortless production applications

Real-time stream processing for python

🎛 Distributed machine learning made simple.