Python Practicum - prepare for your Data Science interview or get a refresher.

Last update: Jul 27, 2021

Overview

Python-Practicum

Python Practicum - prepare for your Data Science interview or get a refresher.

Data

Data visualization using data on births from the state of North Carolina.
Data manupulation is based on "txhousing.csv"

Jupiter Notebook:

https://github.com/trajceskijovan/Python-Practicum/blob/main/Python%20Practicum.ipynb

Sample Insights

Owner

Jovan Trajceski

MSc. Data Analytics (HKBU, Hong Kong) and CPA (Toronto, Canada) professional with experience in Data Science, Digital Transformation, and Analytics.

GitHub Repository

Modular analysis tools for neurophysiology data

Neuroanalysis Modular and interactive tools for analysis of neurophysiology data, with emphasis on patch-clamp electrophysiology. Functions for runnin

5 Dec 22, 2021

Semi-Automated Data Processing

Perform semi automated exploratory data analysis, feature engineering and feature selection on provided dataset by visualizing every possibilities on each step and assisting the user to make a meanin

1 Jan 17, 2022

PostQF is a user-friendly Postfix queue data filter which operates on data produced by postqueue -j.

11 Nov 24, 2022

A real data analysis and modeling project - restaurant inspections

A real data analysis and modeling project - restaurant inspections Jafar Pourbemany 9/27/2021 This project represents data analysis and modeling of re

2 Aug 21, 2022

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

3.7k Jan 03, 2023

Very useful and necessary functions that simplify working with data

Additional-function-for-pandas Very useful and necessary functions that simplify working with data random_fill_nan(module_name, nan) - Replaces all sp

2 Dec 02, 2021

Automated Exploration Data Analysis on a financial dataset

Automated EDA on financial dataset Just a simple way to get automated Exploration Data Analysis from financial dataset (OHLCV) using Streamlit and ta.

28 Nov 27, 2022

PyPDC is a Python package for calculating asymptotic Partial Directed Coherence estimations for brain connectivity analysis.

Python asymptotic Partial Directed Coherence and Directed Coherence estimation package for brain connectivity analysis. Free software: MIT license Doc

3 Nov 26, 2022

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN. Allowing for both categorical and numerical data, DenseClus makes it possible to incorporate all features in cluste

53 Dec 08, 2022

A DSL for data-driven computational pipelines

"Dataflow variables are spectacularly expressive in concurrent programming" Henri E. Bal , Jennifer G. Steiner , Andrew S. Tanenbaum Quick overview Ne

1.9k Jan 03, 2023

MDAnalysis is a Python library to analyze molecular dynamics simulations.

MDAnalysis Repository README [*] MDAnalysis is a Python library for the analysis of computer simulations of many-body systems at the molecular scale,

933 Dec 28, 2022

CSV database for chihuahua (HUAHUA) blockchain transactions

super-fiesta Shamelessly ripped components from https://github.com/hodgerpodger/staketaxcsv - Thanks for doing all the hard work. This code does only

1 Jan 07, 2022

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

Two phase pipeline + Streamlit This is an example project that demonstrates how to create a pipeline that consists of two phases of execution. In betw

1 Nov 17, 2021

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

PandasVault ⁠— Advanced Pandas Functions and Code Snippets The only Pandas utility package you would ever need. It has no exotic external dependencies

374 Jan 07, 2023

NumPy and Pandas interface to Big Data

Blaze translates a subset of modified NumPy and Pandas-like syntax to databases and other computing systems. Blaze allows Python users a familiar inte

3.1k Jan 05, 2023

Synthetic Data Generation for tabular, relational and time series data.

An Open Source Project from the Data to AI Lab, at MIT Website: https://sdv.dev Documentation: https://sdv.dev/SDV User Guides Developer Guides Github

1.2k Jan 07, 2023

ETL flow framework based on Yaml configs in Python

ETL framework based on Yaml configs in Python A light framework for creating data streams. Setting up streams through configuration in the Yaml file.

18 Jul 06, 2022

A data structure that extends pyspark.sql.DataFrame with metadata information.

MetaFrame A data structure that extends pyspark.sql.DataFrame with metadata info

8 Feb 15, 2022

pipeline for migrating lichess data into postgresql

How Long Does It Take Ordinary People To "Get Good" At Chess? TL;DR: According to 5.5 years of data from 2.3 million players and 450 million games, mo

182 Nov 11, 2022

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

2 Feb 14, 2022

Python Practicum - prepare for your Data Science interview or get a refresher.

Related tags

Overview

Python-Practicum

Data

Jupiter Notebook:

Sample Insights

Owner

Jovan Trajceski

Modular analysis tools for neurophysiology data

Semi-Automated Data Processing

PostQF is a user-friendly Postfix queue data filter which operates on data produced by postqueue -j.

A real data analysis and modeling project - restaurant inspections

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Very useful and necessary functions that simplify working with data

Automated Exploration Data Analysis on a financial dataset

PyPDC is a Python package for calculating asymptotic Partial Directed Coherence estimations for brain connectivity analysis.

DenseClus is a Python module for clustering mixed type data using UMAP and HDBSCAN

A DSL for data-driven computational pipelines

MDAnalysis is a Python library to analyze molecular dynamics simulations.

CSV database for chihuahua (HUAHUA) blockchain transactions

Two phase pipeline + StreamlitTwo phase pipeline + Streamlit

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

NumPy and Pandas interface to Big Data

Synthetic Data Generation for tabular, relational and time series data.

ETL flow framework based on Yaml configs in Python

A data structure that extends pyspark.sql.DataFrame with metadata information.

pipeline for migrating lichess data into postgresql

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data