A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Last update: Nov 05, 2022

Related tags

Overview

Feature Forge

This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).

Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:

Defining and documenting features
Testing your features against specified cases and against randomly generated cases (stress-testing). This helps you making your application more robust against invalid/misformatted input data. This also helps you checking that low-relevance results when doing feature analysis is actually because the feature is bad, and not because there's a slight bug in your feature code.
Evaluating your features on a data set, producing a feature evaluation matrix. The evaluator has a robust mode that allows you some tolerance both for invalid data and buggy features.
Experimentation: running, registering, classifying and reproducing experiments for determining best settings for your problems.

Installation

Just pip install featureforge.

Documentation

Documentation is available at http://feature-forge.readthedocs.org/en/latest/

Contact information

Javier Mansilla <[email protected]> (jmansilla at github)
Daniel Moisset <[email protected]> (dmoisset at github)
Rafael Carrascosa <[email protected]> (rafacarrascosa at github)

Any contributions or suggestions are welcome, the official channel for this is submitting github pull requests or issues.

Changelog

0.1.7:

StatsManager api change (order of arguments swapped)
For experimentation, enabled a way of booking experiments forever.

0.1.6:

Bug fixes related to sparse matrices.
Small documentation improvements.
Reduced default logging verbosity.

0.1.5:

Using sparse numpy matrices by default.

0.1.4:

Discarded the need of using forked version of Schema library.

0.1.3:

Added support for running and generating stats for experiments

0.1.2:

Fixing installer dependencies

0.1.1:

Added support for python 3
Added support for bag-of-words features

0.1:

Initial release

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Related tags

Overview

Feature Forge

Installation

Documentation

Contact information

Changelog

Owner

Machinalis

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

A simple python program that can be used to implement user authentication tokens into your program...

Data labels and scripts for fastMRI.org

E2VID_ROS - E2VID_ROS: E2VID to a real-time system

StrongSORT: Make DeepSORT Great Again

The implementation of the algorithm in the paper "Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data" published in ICML 2020.

Platform-agnostic AI Framework 🔥

Universal Adversarial Examples in Remote Sensing: Methodology and Benchmark

Domain Adaptation with Invariant RepresentationLearning: What Transformations to Learn?

Convert openmmlab (not only mmdetection) series model to tensorrt

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Contrastive Learning for Compact Single Image Dehazing, CVPR2021

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Cache Requests in Deta Bases and Echo them with Deta Micros

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation(mCOLT/mRASP2), ACL2021

This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

It is modified Tensorflow 2.x version of Mask R-CNN