✨ Real-life Data Analysis and Model Training Workshop by Global AI Hub.

Overview

🎓 Data Analysis and Model Training Course by Global AI Hub

Syllabus:

Day 1

  • What is Data?

  • Multimedia

  • Structured and Unstructured Data

  • Data Types

  • Data Visualization

    • What is Visualization?
    • Tufte's 6 Principle
    • Visualization Types
      • Line Plot
      • Scatter Plot
      • Bar Plot
      • Histogram
      • Pie Charts
      • Heatmap
      • Box Plot
      • Kartil Nedir? Nasıl Hesaplanır?
      • Joint Plot
      • KDE(Kernel Density Estimate)
  • Statistics

    • Descriptive Statistics Concepts
    • The Concept of Skewness
    • Correlation and Correlation Matrix
    • The Simpsons Paradox
    • Anscombe Quartet
    • Data Distribution and Hypothesis Testing
  • Data Distribution

    • Data and Distribution
    • Gaussian(Normal) Distribution
    • t-Distribution
    • Degrees of Freedom
    • Bernoulli's Distribution
    • Exponential Distribution
  • Application

    • Pandas Revision
    • Introduction to Data Preprocessing with Pandas

Day 2

  • Hypothesis Tests

    • Basic Hypothesis testing
    • P value
    • T test
    • Z test
    • Chi-square (Chi-Square) Test
    • Errors in Hypothesis Testing
  • Data Cleaning

    • The 68-95-99.7 Rule and 3 Sigma
    • Outlier, Missing and Duplicate Data and their Detection
    • Z-Score
    • Handling missing values
    • Null vs NaN
    • Pandas Functions for missing values
    • Dimensionality Reduction
    • PCA (Principal Component Analysis)
    • Collinearity (Multiple Linear Connection
  • Data Transformation

    • Data Conversion Techniques
      • round
      • Scaling
      • Label Encoding
      • One Hot Encoding
      • Stack
      • melt
      • Shorts
      • Feature Engineering
  • Data Augmentation

    • Aggregation Functions
  • Application

    • Data Visualization with Seaborn
    • Data Preprocessing with Pandas

Day 3

  • ML Review

    • What is Machine Learning?
    • Supervised Learning
    • Unsupervised Learning
    • Errors That May Be Encountered in Model Training
    • Tools Used in Data Analysis and Machine Learning
    • End-to-End Machine Learning Project Steps
  • Application

    • Training An End-to-End ML Model with a Real Dataset

Certification

The course completion is certified.

Owner
Global AI Hub
Where AI Talent Meets Opportunity
Global AI Hub
AiiDA plugin for the HyperQueue metascheduler.

aiida-hyperqueue WARNING: This plugin is still in heavy development. Expect bugs to pop up and the API to change. AiiDA plugin for the HyperQueue meta

AiiDA team 3 Jun 19, 2022
EasyModerationKit is an open-source framework designed to moderate and filter inappropriate content.

EasyModerationKit is a public transparency statement. It declares any repositories and legalities used in the EasyModeration system. It allows for implementing EasyModeration into an advanced charact

Aarav 1 Jan 16, 2022
Python For Finance Cookbook - Code Repository

Python For Finance Cookbook - Code Repository

Packt 544 Dec 25, 2022
Hjson for Python

hjson-py Hjson, a user interface for JSON Hjson works with Python 2.5+ and Python 3.3+ The Python implementation of Hjson is based on simplejson. For

Hjson 185 Dec 13, 2022
📘 OpenAPI/Swagger-generated API Reference Documentation

Generate interactive API documentation from OpenAPI definitions This is the README for the 2.x version of Redoc (React-based). The README for the 1.x

Redocly 19.2k Jan 02, 2023
My solutions to the Advent of Code 2021 problems in Go and Python 🎄

🎄 Advent of Code 2021 🎄 Summary Advent of Code is an annual Advent calendar of programming puzzles. This year I am doing it in Go and Python. Runnin

Orfeas Antoniou 16 Jun 16, 2022
Some of the best ways and practices of doing code in Python!

Pythonicness ❤ This repository contains some of the best ways and practices of doing code in Python! Features Properly formatted codes (PEP 8) for bet

Samyak Jain 2 Jan 15, 2022
Numpy's Sphinx extensions

numpydoc -- Numpy's Sphinx extensions This package provides the numpydoc Sphinx extension for handling docstrings formatted according to the NumPy doc

NumPy 234 Dec 26, 2022
🌱 Complete API wrapper of Seedr.cc

Python API Wrapper of Seedr.cc Table of Contents Installation How I got the API endpoints? Start Guide Getting Token Logging with Username and Passwor

Hemanta Pokharel 43 Dec 26, 2022
An awesome Data Science repository to learn and apply for real world problems.

AWESOME DATA SCIENCE An open source Data Science repository to learn and apply towards solving real world problems. This is a shortcut path to start s

Academic.io 20.3k Jan 09, 2023
script to calculate total GPA out of 4, based on input gpa.csv

gpa_calculator script to calculate total GPA out of 4 based on input gpa.csv to use, create a total.csv file containing only one integer showing the t

Mohamad Bastin 1 Feb 07, 2022
This repo contains everything you'll ever need to learn/revise python basics

Python Notes/cheat sheet Simplified notes to get your Python basics right Just compare code and output side by side and feel the rush of enlightenment

Hem 5 Oct 06, 2022
Python script to generate Vale linting rules from word usage guidance in the Red Hat Supplementary Style Guide

ssg-vale-rules-gen Python script to generate Vale linting rules from word usage guidance in the Red Hat Supplementary Style Guide. These rules are use

Vale at Red Hat 1 Jan 13, 2022
Leetcode Practice

LeetCode Practice Description This is my LeetCode Practice. Visit LeetCode Website for detailed question description. The code in this repository has

Leo Hsieh 75 Dec 27, 2022
A powerful Sphinx changelog-generating extension.

What is Releases? Releases is a Python (2.7, 3.4+) compatible Sphinx (1.8+) extension designed to help you keep a source control friendly, merge frien

Jeff Forcier 166 Dec 29, 2022
Some custom tweaks to the results produced by pytkdocs.

pytkdocs_tweaks Some custom tweaks for pytkdocs. For use as part of the documentation-generation-for-Python stack that comprises mkdocs, mkdocs-materi

Patrick Kidger 4 Nov 24, 2022
SamrSearch - SamrSearch can get user info and group info with MS-SAMR

SamrSearch SamrSearch can get user info and group info with MS-SAMR.like net use

knight 10 Oct 06, 2022
Quilt is a self-organizing data hub for S3

Quilt is a self-organizing data hub Python Quick start, tutorials If you have Python and an S3 bucket, you're ready to create versioned datasets with

Quilt Data 1.2k Dec 30, 2022
ReStructuredText and Sphinx bridge to Doxygen

Breathe Packagers: PGP signing key changes for Breathe = v4.23.0. https://github.com/michaeljones/breathe/issues/591 This is an extension to reStruct

Michael Jones 643 Dec 31, 2022