A workshop with several modules to help learn Feast, an open-source feature store

Last update: Jan 05, 2023

Related tags

Text Data & NLP feast-workshop

Overview

Workshop: Learning Feast

This workshop aims to teach users about Feast, an open-source feature store.

We explain concepts & best practices by example, and also showcase how to address common use cases.

What is Feast?

Feast is an operational system for managing and serving machine learning features to models in production. It can serve features from a low-latency online store (for real-time prediction) or from an offline store (for batch scoring).

Why Feast?

Feast solves several common challenges teams face:

Lack of feature reuse across teams
Complex point-in-time-correct data joins for generating training data
Difficulty operationalizing features for online inference while minimizing training / serving skew

Pre-requisites

This workshop assumes you have the following installed:

A local development environment that supports running Jupyter notebooks (e.g. VSCode with Jupyter plugin)
Python 3.7+
Java 11 (for Spark, e.g. brew install java11)
pip
Docker & Docker Compose (e.g. brew install docker docker-compose)
Terraform (docs)
AWS CLI
An AWS account setup with credentials via aws configure (e.g see AWS credentials quickstart)

Since we'll be learning how to leverage Feast in CI/CD, you'll also need to fork this workshop repository.

Caveats

M1 Macbook development is untested with this flow. See also How to run / develop for Feast on M1 Macs.
Windows development has only been tested with WSL. You will need to follow this guide to have Docker play nicely.

Modules

These are meant mostly to be done in order, with examples building on previous concepts.

Time (min)	Description	Module
30-45	Setting up Feast projects & CI/CD + powering batch predictions	Module 0
15-20	Streaming ingestion & online feature retrieval with Kafka, Spark, Redis	Module 1
10-15	Real-time feature engineering with on demand transformations	Module 2
TBD	Feature server deployment (embed, as a service, AWS Lambda)	TBD
TBD	Versioning features / models in Feast	TBD
TBD	Data quality monitoring in Feast	TBD
TBD	Batch transformations	TBD
TBD	Stream transformations	TBD

A workshop with several modules to help learn Feast, an open-source feature store

Related tags

Overview

Workshop: Learning Feast

What is Feast?

Why Feast?

Pre-requisites

Modules

Owner

Feast

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Natural language Understanding Toolkit

Rhyme with AI

2021 AI CUP Competition on Traditional Chinese Scene Text Recognition - Intermediate Contest

Natural Language Processing at EDHEC, 2022

Share constant definitions between programming languages and make your constants constant again

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Natural language computational chemistry command line interface.

DeepAmandine is an artificial intelligence that allows you to talk to it for hours, you won't know the difference.

A complete NLP guideline for enthusiasts

HAN2HAN : Hangul Font Generation

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

UniSpeech - Large Scale Self-Supervised Learning for Speech

Dust model dichotomous performance analysis

A PyTorch implementation of VIOLET

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Combating Embedding Barrier in Multilingual Models for Low-Resource Language Understanding".

🏆 • 5050 most frequent words in 109 languages

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.