Learn machine learning the fun way, with Oracle and RedBull Racing

Last update: Oct 24, 2022

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Oracle and Red Bull partnership announcement

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments

Refactored Terraform code
Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
opened by timclegg 2
Issue with hands on lab guide - launchapp.sh missing

https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

In Starting The Web Application it reads:

cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

However is launchapp.sh is missing, for example

(redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

opened by raekins 1
fix: Updating schema.yaml syntax

Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

opened by timclegg 1
Exploratory Data Analysis Merge Issue

Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:

ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

opened by yankodavila 2
Has the PAR for the stack deploy image expired.

Cannot deploy stack as getting PAR expired message.

2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

opened by Mel-A-M 1

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

Optimized the models generation for Quickstarts Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.7...v0.1.8
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(20.78 KB)
v0.1.7(Feb 17, 2022)

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.6...v0.1.7
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.6(Feb 17, 2022)
What's Changed

add quickstart configuration by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/43

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.5...v0.1.6
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(17.20 KB)
v0.1.5(Feb 16, 2022)
What's Changed

Livelabs02162022 by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

fix: updated Alyssa Cotton's changes by @jasperan in https://github.com/oracle-devrel/redbull-analytics-hol/pull/42

New Contributors

@jasperan made their first contribution in https://github.com/oracle-devrel/redbull-analytics-hol/pull/41

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.4...v0.1.5
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.4(Jan 25, 2022)
What's Changed

Update Port for Jupyter Lab. Changed with last Stack script by @operard in https://github.com/oracle-devrel/redbull-analytics-hol/pull/38

automatically set the latest Oracle Linux 7.9 image build number as default OS image by @snafuz in https://github.com/oracle-devrel/redbull-analytics-hol/pull/40

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.3...v0.1.4
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.33 KB)
v0.1.3(Nov 10, 2021)
What's Changed

fix: ORM zip file not being generated properly

Fixed it so that ORM can be used to deploy the lab.

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.1.2...v0.1.3
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(11.21 KB)
v0.1.0(Nov 9, 2021)
The lab has been refactored to not use a custom compute image, but rather to build out the compute instance.

What's Changed

feat: removing custom image usage by @timclegg in https://github.com/oracle-devrel/redbull-analytics-hol/pull/34

Full Changelog: https://github.com/oracle-devrel/redbull-analytics-hol/compare/v0.0.12...v0.1.0
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.62 KB)
v0.0.12(Sep 6, 2021)

Redbull HOL Beginner Extension Period to access Image
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.01 KB)
v0.0.11(Aug 10, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.10(Aug 10, 2021)

The SSH public key is optional, but present in the ORM dialog. Happy deploying!
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.06 KB)
v0.0.9(Aug 9, 2021)

The SSH key isn't directly needed for the hands-on lab, so making this optional. Also some doc updates.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.8(Aug 9, 2021)

Updated docs and a bug in the deployment.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.83 KB)
v0.0.7(Aug 6, 2021)

This release has a refactored "one-click" (or really close to it!) hands-on lab.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.82 KB)
v0.0.6(Aug 4, 2021)

This repo now can build its own ZIP files for ORM deployments. These are automatically built and stored in the release (as it's made).
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(8.19 KB)
v0.0.5(Jul 28, 2021)

Fixing situations where the group name and/or dynamic group name creation would fail, if it already existed. This might occur in situations where the HoL would be deployed more than once in the same tenancy. This eliminates the potential for collision with the same group names being used.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.40 KB)
v0.0.4(Jul 23, 2021)

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(10.23 KB)
v0.0.3(Jul 15, 2021)

Fixed home region detection.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.26 KB)
v0.2(Jul 14, 2021)
This release makes it easier to deploy the infrastructure, whether using ORM, Cloud Shell or Terraform CLI.

Added DevRel defined tags (and ignored the default tags)

Compatible with ORM, Cloud Shell and Terraform CLI

Updated README to include instructions for all three methods

Refactored, removing unnecessary resources (Vault, public Subnet, etc.).

Added a nerd knob so that it could use an existing Group (rather than create a new one)

Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names

Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(7.19 KB)
v0.1(Jun 21, 2021)

This release includes the beginner series of tutorials, along with the Terraform stack to create the required OCI resources.
Source code(tar.gz)
Source code(zip)
redbull-analytics-hol-latest.zip(9.24 KB)

Owner

Oracle DevRel

GitHub Repository

Data science/Analysis Health Care Portfolio

Health-Care-DS-Projects Data Science/Analysis Health Care Portfolio Consists Of 3 Projects: Mexico Covid-19 project, analyze the patient medical histo

1 Feb 13, 2022

Functional Data Analysis, or FDA, is the field of Statistics that analyses data that depend on a continuous parameter.

Functional Data Analysis Python package

184 Dec 27, 2022

TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI) data

tedana: TE Dependent ANAlysis TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI)

136 Dec 22, 2022

Python for Data Analysis, 2nd Edition

Python for Data Analysis, 2nd Edition Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media Buy

18.6k Jan 08, 2023

The Dash Enterprise App Gallery "Oil & Gas Wells" example

This app is based on the Dash Enterprise App Gallery "Oil & Gas Wells" example. For more information and more apps see: Dash App Gallery See the Dash

1 Nov 08, 2021

A model checker for verifying properties in epistemic models

Epistemic Model Checker This is a model checker for verifying properties in epistemic models. The goal of the model checker is to check for Pluralisti

2 Dec 22, 2021

Maximum Covariance Analysis in Python

xMCA | Maximum Covariance Analysis in Python The aim of this package is to provide a flexible tool for the climate science community to perform Maximu

39 Jan 03, 2023

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

qgrid Qgrid is a Jupyter notebook widget which uses SlickGrid to render pandas DataFrames within a Jupyter notebook. This allows you to explore your D

2.9k Jan 08, 2023

Common bioinformatics database construction

biodb Common bioinformatics database construction 1.taxonomy （Substance classification database） Download the database wget -c https://ftp.ncbi.nlm.ni

2 Jan 04, 2022

An extension to pandas dataframes describe function.

pandas_summary An extension to pandas dataframes describe function. The module contains DataFrameSummary object that extend describe() with: propertie

450 Dec 30, 2022

Data pipelines built with polars

valves Warning: the project is very much work in progress. Valves is a collection of functions for your data .pipe()-lines. This project aimes to host

14 Jan 03, 2023

Anomaly Detection with R

AnomalyDetection R package AnomalyDetection is an open-source R package to detect anomalies which is robust, from a statistical standpoint, in the pre

3.5k Dec 27, 2022

ETL flow framework based on Yaml configs in Python

ETL framework based on Yaml configs in Python A light framework for creating data streams. Setting up streams through configuration in the Yaml file.

18 Jul 06, 2022

Exploring the Top ML and DL GitHub Repositories

This repository contains my work related to my project where I scraped data on the most popular machine learning and deep learning GitHub repositories in order to further visualize and analyze it.

17 Aug 21, 2022

Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

Data Scientist Learning Plan Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

27 Nov 01, 2022

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

4 Oct 13, 2022

ETL pipeline on movie data using Python and postgreSQL

Movies-ETL ETL pipeline on movie data using Python and postgreSQL Overview This project consisted on a automated Extraction, Transformation and Load p

0 Jul 07, 2021

Learn machine learning the fun way, with Oracle and RedBull Racing

Related tags

Overview

Red Bull Racing Analytics Hands-On Labs

Introduction

Getting Started

Prerequisites

Notes/Issues

URLs

Contributing

License

Comments

Refactored Terraform code

Issue with hands on lab guide - launchapp.sh missing

fix: Updating schema.yaml syntax

Exploratory Data Analysis Merge Issue

Has the PAR for the stack deploy image expired.

Releases(v0.1.8)

v0.1.8(Feb 18, 2022)

v0.1.7(Feb 17, 2022)

v0.1.6(Feb 17, 2022)

What's Changed

v0.1.5(Feb 16, 2022)

What's Changed

New Contributors

v0.1.4(Jan 25, 2022)

What's Changed

v0.1.3(Nov 10, 2021)

What's Changed

v0.1.0(Nov 9, 2021)

What's Changed

v0.0.12(Sep 6, 2021)

v0.0.11(Aug 10, 2021)

v0.0.10(Aug 10, 2021)

v0.0.9(Aug 9, 2021)

v0.0.8(Aug 9, 2021)

v0.0.7(Aug 6, 2021)

v0.0.6(Aug 4, 2021)

v0.0.5(Jul 28, 2021)

v0.0.4(Jul 23, 2021)

v0.0.3(Jul 15, 2021)

v0.2(Jul 14, 2021)

v0.1(Jun 21, 2021)

Owner

Oracle DevRel

Data science/Analysis Health Care Portfolio

Functional Data Analysis, or FDA, is the field of Statistics that analyses data that depend on a continuous parameter.

TE-dependent analysis (tedana) is a Python library for denoising multi-echo functional magnetic resonance imaging (fMRI) data

Python for Data Analysis, 2nd Edition

The Dash Enterprise App Gallery "Oil & Gas Wells" example

A model checker for verifying properties in epistemic models

Maximum Covariance Analysis in Python

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Common bioinformatics database construction

An extension to pandas dataframes describe function.

Data pipelines built with polars

Anomaly Detection with R

ETL flow framework based on Yaml configs in Python

Exploring the Top ML and DL GitHub Repositories

Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

signac-flow - manage workflows with signac

Sentiment analysis on streaming twitter data using Spark Structured Streaming & Python

Sample code for Harry's Airflow online trainng course

PCAfold is an open-source Python library for generating, analyzing and improving low-dimensional manifolds obtained via Principal Component Analysis (PCA).

ETL pipeline on movie data using Python and postgreSQL