Data-sets from the survey and analysis

Last update: Jan 26, 2022

Related tags

Overview

bachelor-thesis

"Umfragewerte.xlsx" contains the orginal survey results. "umfrage_alle.csv" contains the survey results but one participant is canceled due to response to the survey before publication time (trial run). Additionally the coloumn names have been changed to a nomenclatur-scheme which is used in the analysis part of the bachelor-thesis, describing the compared maps.

bar_all.py includes var_all.png

bar_laien_vs_erf_vs_exp.py includes bar_mittlere_std

For some plots the installation of "altair" is necessary. Using Sypder (Python 3.9) its necessary to enable the "altair_viewer" (delete the # in line 4) in all_vs_all.py using:

alt.renderers.enable('altair_viewer')

Altair can be installed, along with the example datasets in vega_datasets, using:

$ pip install altair vega_datasets $ pip install altair_viewer

If you are using the conda package manager, the equivalent is:

$ conda install -c conda-forge altair vega_datasets

You might also like...

A collection of learning outcomes data analysis using Python and SQL, from DQLab.

Data Analyst with PYTHON Data Analyst berperan dalam menghasilkan analisa data serta mempresentasikan insight untuk membantu proses pengambilan keputu

6 Oct 11, 2022

This tool parses log data and allows to define analysis pipelines for anomaly detection.

logdata-anomaly-miner This tool parses log data and allows to define analysis pipelines for anomaly detection. It was designed to run the analysis wit

32 Nov 27, 2022

A real data analysis and modeling project - restaurant inspections

A real data analysis and modeling project - restaurant inspections Jafar Pourbemany 9/27/2021 This project represents data analysis and modeling of re

2 Aug 21, 2022

Tools for the analysis, simulation, and presentation of Lorentz TEM data.

ltempy ltempy is a set of tools for Lorentz TEM data analysis, simulation, and presentation. Features Single Image Transport of Intensity Equation (SI

1 Dec 26, 2022

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python 📊

2 May 26, 2022

Project: Netflix Data Analysis and Visualization with Python

Project: Netflix Data Analysis and Visualization with Python Table of Contents General Info Installation Demo Usage and Main Functionalities Contribut

2 Feb 13, 2022

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

3.7k Jan 3, 2023

Flenser is a simple, minimal, automated exploratory data analysis tool.

Flenser Have you ever been handed a dataset you've never seen before? Flenser is a simple, minimal, automated exploratory data analysis tool. It runs

79 Sep 20, 2022

Visions provides an extensible suite of tools to support common data analysis operations

Visions And these visions of data types, they kept us up past the dawn. Visions provides an extensible suite of tools to support common data analysis

168 Dec 28, 2022

Releases(v1.2)

v1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
maps_ba.zip(271.24 KB)
v.1.1(Mar 25, 2022)

Source code(tar.gz)
Source code(zip)
maps_ba.zip(271.24 KB)
v1.0(Mar 25, 2022)

Source code(tar.gz)
Source code(zip)
maps_ba.zip(271.24 KB)

Owner

GitHub Repository

Scraping and analysis of leetcode-compensations page.

Leetcode compensations report Scraping and analysis of leetcode-compensations page.

96 Jan 01, 2023

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

deepdish Flexible HDF5 saving/loading and other data science tools from the University of Chicago. This repository also host a Deep Learning blog: htt

255 Dec 10, 2022

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Data lineage made simple, reliable, and automated. Effortlessly track the flow of data, understand dependencies and analyze impact. Features Visualiza

898 Jan 09, 2023

A probabilistic programming language in TensorFlow. Deep generative models, variational inference.

Edward is a Python library for probabilistic modeling, inference, and criticism. It is a testbed for fast experimentation and research with probabilis

4.7k Jan 09, 2023

MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.

MetPy MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data. MetPy follows semantic versioni

971 Dec 25, 2022

Hatchet is a Python-based library that allows Pandas dataframes to be indexed by structured tree and graph data.

Hatchet Hatchet is a Python-based library that allows Pandas dataframes to be indexed by structured tree and graph data. It is intended for analyzing

14 Aug 19, 2022

Manage large and heterogeneous data spaces on the file system.

signac - simple data management The signac framework helps users manage and scale file-based workflows, facilitating data reuse, sharing, and reproduc

109 Dec 14, 2022

Port of dplyr and other related R packages in python, using pipda.

Unlike other similar packages in python that just mimic the piping syntax, datar follows the API designs from the original packages as much as possible, and is tested thoroughly with the cases from t

179 Dec 21, 2022

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

scikit-survival scikit-survival is a Python module for survival analysis built on top of scikit-learn. It allows doing survival analysis while utilizi

876 Jan 04, 2023

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Pandaral·lel Without parallelization With parallelization Installation $ pip install pandarallel [--upgrade] [--user] Requirements On Windows, Pandara

2.8k Dec 31, 2022

Pyspark project that able to do joins on the spark data frames.

SPARK JOINS This project is to perform inner, all outer joins and semi joins. create_df.py: load_data.py : helps to put data into Spark data frames. d

1 Dec 14, 2021

Python Practicum - prepare for your Data Science interview or get a refresher.

Python-Practicum Python Practicum - prepare for your Data Science interview or get a refresher. Data Data visualization using data on births from the

1 Jul 27, 2021

Data-sets from the survey and analysis

bachelor-thesis "Umfragewerte.xlsx" contains the orginal survey results. "umfrage_alle.csv" contains the survey results but one participant is cancele

1 Jan 26, 2022

Feature engineering and machine learning: together at last

Feature engineering and machine learning: together at last! Lambdo is a workflow engine which significantly simplifies data analysis by unifying featu

14 Sep 15, 2022

BAyesian Model-Building Interface (Bambi) in Python.

Bambi BAyesian Model-Building Interface in Python Overview Bambi is a high-level Bayesian model-building interface written in Python. It's built on to

861 Dec 29, 2022

Jupyter notebooks for the book "The Elements of Statistical Learning".

This repository contains Jupyter notebooks implementing the algorithms found in the book and summary of the textbook.

369 Dec 30, 2022

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

4 Oct 13, 2022

Data Science Environment Setup in single line

datascienv is package that helps your to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries

55 Dec 16, 2022

BigDL - Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems

Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems.

1 Jan 06, 2022

Data-sets from the survey and analysis

Related tags

Overview

bachelor-thesis

You might also like...

A collection of learning outcomes data analysis using Python and SQL, from DQLab.

This tool parses log data and allows to define analysis pipelines for anomaly detection.

A real data analysis and modeling project - restaurant inspections

Tools for the analysis, simulation, and presentation of Lorentz TEM data.

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Project: Netflix Data Analysis and Visualization with Python

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Flenser is a simple, minimal, automated exploratory data analysis tool.

Visions provides an extensible suite of tools to support common data analysis operations

Releases(v1.2)

v1.2(Apr 7, 2022)

v.1.1(Mar 25, 2022)

v1.0(Mar 25, 2022)

Owner

Scraping and analysis of leetcode-compensations page.

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

A probabilistic programming language in TensorFlow. Deep generative models, variational inference.

MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.

Hatchet is a Python-based library that allows Pandas dataframes to be indexed by structured tree and graph data.

Manage large and heterogeneous data spaces on the file system.

Port of dplyr and other related R packages in python, using pipda.

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

scikit-survival is a Python module for survival analysis built on top of scikit-learn.

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Pyspark project that able to do joins on the spark data frames.

Python Practicum - prepare for your Data Science interview or get a refresher.

Data-sets from the survey and analysis

Feature engineering and machine learning: together at last

BAyesian Model-Building Interface (Bambi) in Python.

Jupyter notebooks for the book "The Elements of Statistical Learning".

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

Data Science Environment Setup in single line

BigDL - Evaluate the performance of BigDL (Distributed Deep Learning on Apache Spark) in big data analysis problems