Python Auto-ML Package for Tabular Datasets

Last update: Nov 20, 2022

Overview

Tabular-AutoML

AutoML Package for tabular datasets

Tabular dataset tuning is now hassle free!

Run one liner command and get best tuning and processed dataset in a go.

Used Python Libraries :

Installation & Usage

Create a Virtual Environment : Tutorial
Clone the repository.
Open the directory with cmd.
Copy this command in terminal to install dependencies.

pip install -r requirements.txt

Installing the requirements.txt may generate some error due to outdated MS Visual C++ Build. You can fix this problem using this.
First check the parser variable that has to be passed with all customizations.

>>> python -m tab_automl.main --help
usage: main.py [-h] -d  -t  -tf  [-p] [-f] [-spd] [-sfd] [-sm]

automl hyper parameters

optional arguments:
  -h, --help            show this help message and exit
  -d , --data-source    File path
  -t , --problem-type   Problem Type , currently supporting *regression* or *classification*
  -tf , --target-feature
                        Target feature inside the data
  -p , --pre-proc       If data processing is required
  -f , --fet-eng        If feature engineering is required
  -spd , --save-proc-data
                        Save the processed data
  -sfd , --save-fet-data
                        Save the feature engineered data
  -sm , --save-model    Save the best trained model

Now run the command with your custom data, problem type and target feature

>> # For Classification Problem >>> python -m tab_automl.main -d "your custom data scource\custom_data.csv" -t "classification" -tf "your_custom_target_feature" -spd "true" -sfd "true" -sm "true"">

>>> # For Regression Problem
>>> python -m tab_automl.main -d "your custom data scource\custom_data.csv" -t "regression" -tf "your_custom_target_feature" -spd "true" -sfd "true" -sm "true"

>>> # For Classification Problem
>>> python -m tab_automl.main -d "your custom data scource\custom_data.csv" -t "classification" -tf "your_custom_target_feature" -spd "true" -sfd "true" -sm "true"

Contributing Guidelines

Coment on the issue on which you want to work.
If you get assigned, fork the repository.
Create a new branch which should be named on your github user_id , e.g. sagnik1511.
Update the changes on that branch.
Create a PR (Pull request) to the main branch of the parent repository.
The PR title should named like this [Issue Number] Heading of the issue.
Describe the changes you have done with proper reasons.

Python Auto-ML Package for Tabular Datasets

Related tags

Overview

Tabular-AutoML

AutoML Package for tabular datasets

Tabular dataset tuning is now hassle free!

Run one liner command and get best tuning and processed dataset in a go.

Installation & Usage

Contributing Guidelines

Contributors

Sagnik Roy : sagnik1511

If you like the project, do ⭐

Also follow me on GitHub , Kaggle , LinkedIn

Thank You for Visiting :)

Owner

Sagnik Roy

The `rtdl` library + The official implementation of the paper

MPI Interest Group on Algorithms on 1st semester 2021

VOGUE: Try-On by StyleGAN Interpolation Optimization

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking

LightLog is an open source deep learning based lightweight log analysis tool for log anomaly detection.

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

Setup and customize deep learning environment in seconds.

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

Python script that takes an Impulse response .wav and a input .wav to demonstrate audio convolution.

Simple, efficient and flexible vision toolbox for mxnet framework.

implicit displacement field

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Physics-informed Neural Operator for Learning Partial Differential Equation

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

[CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.

Reimplementation of Dynamic Multi-scale filters for Semantic Segmentation.

Checking fibonacci - Generating the Fibonacci sequence is a classic recursive problem