Important dataframe statistics with a single command

Last update: Dec 19, 2021

Overview

quick_eda

Receiving dataframe statistics with one command

Project description

A python package for Data Scientists, Students, ML Engineers and anyone who wants dataframe meta data without the trouble of having to type in numerous commands.

Installation

Use pip to install quick-eda by typing or copying the following command.

pip install quick-eda

License

This package is licensed under BSD Clause 3.

Example usage

Users of the package can import the individual modules from this package, for example:

import quick_eda.df_eda
import quick_eda.column_eda

This loads the submodules quick_eda.df_eda and quick_eda.column_eda. They must be referenced with their full name.

quick_eda.df_eda.df_eda(<df>)
quick_eda.column_eda.column_eda(<column_name>)

An alternative way of importing the submodules is:

from quick_eda import df_eda
from quick_eda import column_eda

This also loads the submodules quick_eda.df_eda and quick_eda.column_eda, and makes them available without their prefix, so they can be used as follows:

df_eda.df_eda(<df>)
column_eda.column_eda(<column_name>)

Yet another variation is to import the desired functions directly:

from quick_eda.df_eda import df_eda
from quick_eda.column_eda import column_eda

Again, this loads the submodules, but makes them directly available:

df_eda(<df>)
column_eda(<column_name>)

Imagine you have a dataframe called pets with the columns name, age and color. You could then run statistics on both the entire dataframe or e.g. the column age with

df_eda(pets)
column_eda(pets, "age")

Source code & further information

The source code is maintained at https://github.com/sveneschlbeck/quick_eda
There are also further information concerning the BSD license model, contributing guidelines and more...

Important dataframe statistics with a single command

Related tags

Overview

quick_eda

Project description

Installation

License

Example usage

Source code & further information

Owner

Sven Eschlbeck

Finds, downloads, parses, and standardizes public bikeshare data into a standard pandas dataframe format

Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).

Automatic earthquake catalog building workflow: EQTransformer + Siamese EQTransformer + PickNet + REAL + HypoInverse

Includes all files needed to satisfy hw02 requirements

VevestaX is an open source Python package for ML Engineers and Data Scientists.

Python package for analyzing sensor-collected human motion data

Minimal working example of data acquisition with nidaqmx python API

Analyzing Earth Observation (EO) data is complex and solutions often require custom tailored algorithms.

songplays datamart provide details about the musical taste of our customers and can help us to improve our recomendation system

Universal data analysis tools for atmospheric sciences

An orchestration platform for the development, production, and observation of data assets.

Sample code for Harry's Airflow online trainng course

Project: Netflix Data Analysis and Visualization with Python

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science

Flenser is a simple, minimal, automated exploratory data analysis tool.

Synthetic Data Generation for tabular, relational and time series data.

Pipeline and Dataset helpers for complex algorithm evaluation.

Employee Turnover Analysis

simple way to build the declarative and destributed data pipelines with python