TensorFlow implementation of an arbitrary order Factorization Machine

Last update: Dec 21, 2022

Overview

This is a TensorFlow implementation of an arbitrary order (>=2) Factorization Machine based on paper Factorization Machines with libFM.

It supports:

dense and sparse inputs
different (gradient-based) optimization methods
classification/regression via different loss functions (logistic and mse implemented)
logging via TensorBoard

The inference time is linear with respect to the number of features.

Tested on Python3.5, but should work on Python2.7

This implementation is quite similar to the one described in Blondel's et al. paper [https://arxiv.org/abs/1607.07195], but was developed independently and prior to the first appearance of the paper.

Dependencies

Installation

Stable version can be installed via pip install tffm.

Usage

The interface is similar to scikit-learn models. To train a 6-order FM model with rank=10 for 100 iterations with learning_rate=0.01 use the following sample

from tffm import TFFMClassifier
model = TFFMClassifier(
    order=6,
    rank=10,
    optimizer=tf.train.AdamOptimizer(learning_rate=0.01),
    n_epochs=100,
    batch_size=-1,
    init_std=0.001,
    input_type='dense'
)
model.fit(X_tr, y_tr, show_progress=True)

See example.ipynb and gpu_benchmark.ipynb for more details.

It's highly recommended to read tffm/core.py for help.

Testing

Just run python test.py in the terminal. nosetests works too, but you must pass the --logging-level=WARNING flag to avoid printing insane amounts of TensorFlow logs to the screen.

Citation

If you use this software in academic research, please, cite it using the following BibTeX:

@misc{trofimov2016,
author = {Mikhail Trofimov, Alexander Novikov},
title = {tffm: TensorFlow implementation of an arbitrary order Factorization Machine},
year = {2016},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/geffy/tffm}},
}

TensorFlow implementation of an arbitrary order Factorization Machine

Related tags

Overview

Dependencies

Installation

Usage

Testing

Citation

Owner

Mikhail Trofimov

Bayesian Additive Regression Trees For Python

Repository for DCA0305, an undergraduate course about Machine Learning Workflows and Pipelines

High performance implementation of Extreme Learning Machines (fast randomized neural networks).

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Getting Profit and Loss Make Easy From Binance

🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams

PyTorch extensions for high performance and large scale training.

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Random Forest Classification for Neural Subtypes

Classification based on Fuzzy Logic(C-Means).

TIANCHI Purchase Redemption Forecast Challenge

Cryptocurrency price prediction and exceptions in python

Learning --> Numpy January 2022 - winter'22

Mixing up the Invariant Information clustering architecture, with self supervised concepts from SimCLR and MoCo approaches

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

Crypto-trading - ML techiques are used to forecast short term returns in 14 popular cryptocurrencies

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

A collection of neat and practical data science and machine learning projects

A simple guide to MLOps through ZenML and its various integrations.