VevestaX is an open source Python package for ML Engineers and Data Scientists.

Last update: Dec 14, 2022

Related tags

Overview

VevestaX

Track failed and successful experiments as well as features.

VevestaX is an open source Python package for ML Engineers and Data Scientists. It includes modules for tracking features sourced from data, feature engineering and variables. The output is an excel file which has tabs namely, data sourcing, feature engineering and modelling. It tracks these values in Jupyter notebook.

How to install the library:

$ pip install vevestaX

How to import a library and create the object

How to extract features present in input data.

How to extract engineered features

How to track variables used in modelling section of the code

How to dump the features and modelling variables in an xlsx file

For additional features, explore our tool at www.vevesta.com

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets that can be described as multidimensional arrays o

411 Dec 27, 2022

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

Meltano is open source, self-hosted, CLI-first, debuggable, and extensible. Pipelines are code, ready to be version c

625 Jan 2, 2023

Hue Editor: Open source SQL Query Assistant for Databases/Warehouses

759 Jan 7, 2023

OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase working capital.

Overview OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase

3 Feb 12, 2022

Python package to transfer data in a fast, reliable, and packetized form.

pySerialTransfer Python package to transfer data in a fast, reliable, and packetized form.

101 Dec 7, 2022

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors. GWpy provides a user-f

342 Jan 7, 2023

Python package for processing UC module spectral data.

UC Module Python Package How To Install clone repo. cd UC-module pip install . How to Use uc.module.UC(measurment=str, dark=str, reference=str, heade

1 Oct 20, 2021

PyEmits, a python package for easy manipulation in time-series data.

PyEmits, a python package for easy manipulation in time-series data. Time-series data is very common in real life. Engineering FSI industry (Financial

5 Sep 23, 2022

nrgpy is the Python package for processing NRG Data Files

nrgpy nrgpy is the Python package for processing NRG Data Files Website and source: https://github.com/nrgpy/nrgpy Documentation: https://nrgpy.github

23 Dec 8, 2022

Comments

Create a tab in the excel created using V.dump. The tab will contain a random set of rows from the input data (panda data frame)

Create a tab in the excel sheet with name "data". This tab will contain a randomized snapshot of input data being read from the input file. The input data snapshot will be extracted from V.ds = df.
enhancement good first issue

opened by Priyanka-Vevesta 0

Releases(v6.8.2)

v6.8.2(Sep 3, 2022)

Simplified the library interface
Source code(tar.gz)
Source code(zip)
vevestaX-6.8.2-py3-none-any.whl(16.26 KB)
v6.7.0(Jul 13, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.7.0-py3-none-any.whl(16.02 KB)
v6.5.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.3-py3-none-any.whl(15.52 KB)
v6.5.2(Jul 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.2-py3-none-any.whl(15.51 KB)
v6.5.1(Jul 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-6.5.1-py2.py3-none-any.whl(15.67 KB)
v6.3.0(Jun 28, 2022)

Added integration with Github.
Source code(tar.gz)
Source code(zip)
vevestaX-6.3.0-py2.py3-none-any.whl(15.42 KB)
v5.5.0(Jun 2, 2022)

3d plots were generated
Source code(tar.gz)
Source code(zip)
v5.4.0(May 19, 2022)

Add box plots for numeric data
Source code(tar.gz)
Source code(zip)
vevestaX-5.4.0-py3-none-any.whl(13.22 KB)
v5.3.0(May 18, 2022)

Added following values to profiling report Kurtosis Skewness Outliers Outliers (%) Median Mode Q1 quantile Q2 quantile Q3 quantile 100th quantile
Source code(tar.gz)
Source code(zip)
vevestaX-5.3.0-py3-none-any.whl(13.00 KB)
v5.2.0(May 16, 2022)

With this release, we add another tab for data profiling. The variables data profile calculates following values: Distinct Distinct (%) Missing Missing (%) Infinite Infinite (%) Mean Minimum Maximum Zeros Zeros (%) Negative Negative (%) Total Memory size
Source code(tar.gz)
Source code(zip)
vevestaX-5.2.0-py3-none-any.whl(12.66 KB)
pysparkCorrelation(May 11, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-5.1.0-py3-none-any.whl(12.25 KB)
pysparkIntegration(May 8, 2022)

Integrated with pyspark.
Source code(tar.gz)
Source code(zip)
vevestaX-5.0.0-py3-none-any.whl(11.99 KB)
updatedLibraryDependency(Apr 12, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-3.3.0-py3-none-any.whl(11.44 KB)
updatedDependencies(Apr 11, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-3.0.0-py3-none-any.whl(11.43 KB)
colab/kaggle(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.9.0-py3-none-any.whl(11.16 KB)
majorbugfix(Apr 3, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.8.0-py3-none-any.whl(10.51 KB)
EDA(Apr 2, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.7.0-py3-none-any.whl(10.50 KB)
EDA_extended(Apr 1, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.6.0-py3-none-any.whl(10.35 KB)
updatedContent(Mar 27, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.5.0-py3-none-any.whl(9.91 KB)
messagesUpdated(Mar 26, 2022)

Source code(tar.gz)
Source code(zip)
vevestaX-2.3.0-py3-none-any.whl(9.77 KB)
correlation-plot(Mar 23, 2022)

Added EDA-correlation to the output
Source code(tar.gz)
Source code(zip)
vevestaX-2.1.0-py3-none-any.whl(9.69 KB)
performance-plots(Mar 9, 2022)

Source code(tar.gz)
Source code(zip)
mlops(Nov 3, 2021)

Library works with spyder
Source code(tar.gz)
Source code(zip)
vevestaX-1.0.0-py3-none-any.whl(7.10 KB)

Owner

Vevesta

GitHub Repository

Binance Kline Data With Python

Binance Kline Data by seunghan(gingerthorp) reference https://github.com/binance/binance-public-data/ All intervals are supported: 1m, 3m, 5m, 15m, 30

5 Jul 13, 2022

Titanic data analysis for python

Titanic-data-analysis This Repo is an analysis on Titanic_mod.csv This csv file contains some assumed data of the Titanic ship after sinking This full

1 Dec 26, 2021

Approximate Nearest Neighbor Search for Sparse Data in Python!

Approximate Nearest Neighbor Search for Sparse Data in Python! This library is well suited to finding nearest neighbors in sparse, high dimensional spaces (like text documents).

906 Jan 01, 2023

ASTR 302: Python for Astronomy (Winter '22)

ASTR 302, Winter 2022, University of Washington: Python for Astronomy Mario Jurić Location When: 2:30-3:50, Monday & Wednesday, Winter quarter 2022 Wh

4 Jan 12, 2022

Lale is a Python library for semi-automated data science.

Lale is a Python library for semi-automated data science. Lale makes it easy to automatically select algorithms and tune hyperparameters of pipelines that are compatible with scikit-learn, in a type-

293 Dec 29, 2022

Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions.

About Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions. The tool provides rich data and a summary g

9 Nov 16, 2022

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities. This is aimed at those looking to get into the field of D

1 Dec 26, 2021

The OHSDI OMOP Common Data Model allows for the systematic analysis of healthcare observational databases.

14 Jan 02, 2023

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains This repository contains the source code for an end-to-end open-domain question

7 Sep 27, 2022

Feature Detection Based Template Matching

Feature Detection Based Template Matching The classification of the photos was made using the OpenCv template Matching method. Installation Use the pa

2 Nov 18, 2021

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Karate Club is an unsupervised machine learning extension library for NetworkX. Please look at the Documentation, relevant Paper, Promo Video, and Ext

1.8k Jan 09, 2023

Candlestick Pattern Recognition with Python and TA-Lib

Candlestick-Pattern-Recognition-with-Python-and-TA-Lib Goal Look at the S&P500 to try and get a better understanding of these candlestick patterns and

11 Oct 07, 2022

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

3.7k Jan 03, 2023

VevestaX is an open source Python package for ML Engineers and Data Scientists.

Related tags

Overview

VevestaX

You might also like...

HyperSpy is an open source Python library for the interactive analysis of multidimensional datasets

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

Hue Editor: Open source SQL Query Assistant for Databases/Warehouses

OpenARB is an open source program aiming to emulate a free market while encouraging players to participate in arbitrage in order to increase working capital.

Python package to transfer data in a fast, reliable, and packetized form.

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

Python package for processing UC module spectral data.

PyEmits, a python package for easy manipulation in time-series data.

nrgpy is the Python package for processing NRG Data Files

Comments

Create a tab in the excel created using V.dump. The tab will contain a random set of rows from the input data (panda data frame)

Releases(v6.8.2)

v6.8.2(Sep 3, 2022)

v6.7.0(Jul 13, 2022)

v6.5.3(Jul 3, 2022)

v6.5.2(Jul 1, 2022)

v6.5.1(Jul 1, 2022)

v6.3.0(Jun 28, 2022)

v5.5.0(Jun 2, 2022)

v5.4.0(May 19, 2022)

v5.3.0(May 18, 2022)

v5.2.0(May 16, 2022)

pysparkCorrelation(May 11, 2022)

pysparkIntegration(May 8, 2022)

updatedLibraryDependency(Apr 12, 2022)

updatedDependencies(Apr 11, 2022)

colab/kaggle(Apr 7, 2022)

majorbugfix(Apr 3, 2022)

EDA(Apr 2, 2022)

EDA_extended(Apr 1, 2022)

updatedContent(Mar 27, 2022)

messagesUpdated(Mar 26, 2022)

correlation-plot(Mar 23, 2022)

performance-plots(Mar 9, 2022)

mlops(Nov 3, 2021)

Owner

Vevesta

Binance Kline Data With Python

Titanic data analysis for python

Approximate Nearest Neighbor Search for Sparse Data in Python!

ASTR 302: Python for Astronomy (Winter '22)

Lale is a Python library for semi-automated data science.

Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions.

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

The OHSDI OMOP Common Data Model allows for the systematic analysis of healthcare observational databases.

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

Feature Detection Based Template Matching

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Candlestick Pattern Recognition with Python and TA-Lib

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

An easy-to-use feature store

Universal data analysis tools for atmospheric sciences

API>local_db>AWS_RDS - Disclaimer! All data used is for educational purposes only.

Python package for analyzing sensor-collected human motion data

Exploring the Top ML and DL GitHub Repositories

A crude Hy handle on Pandas library

NumPy aware dynamic Python compiler using LLVM