Learn machine learning the fun way, with Oracle and RedBull Racing

Overview

Red Bull Racing Analytics Hands-On Labs

License: UPL Quality gate

Introduction

Are you interested in learning machine learning (ML)? How about doing this in the context of the exciting world of F1 racing?! Get your ML skills bootstrapped here with Oracle and Red Bull Racing!

Red Bull F1 Race Car

This tutorial teaches ML analytics with a series of hands-on labs (HOLs) using the Data Science service in Oracle Cloud Infrastructure.

You'll learn how to get data from some public data sources, then how to analyze this data using some of the latest ML techniques. In the process you'll build ML models and test them out in a predictor app.

Getting Started

There is some infrastructure that must be deployed before you can enjoy this tutorial. See the Terraform documentation for more information.

After the OCI infrastructure is deployed, proceed with the beginner's tutorial to start through the ML labs.

Prerequisites

You must have an OCI account. Click here to create a new cloud account.

This solution is designed to work with several OCI services, allowing you to quickly be up-and-running:

There are required OCI resources (see the Terraform documentation for more information) that are needed for this tutorial.

Notes/Issues

None at this time.

URLs

Contributing

This project is open source. Please submit your contributions by forking this repository and submitting a pull request! Oracle appreciates any contributions that are made by the open source community.

License

Copyright (c) 2021 Oracle and/or its affiliates.

Licensed under the Universal Permissive License (UPL), Version 1.0.

See LICENSE for more details.

Comments
  • Refactored Terraform code

    Refactored Terraform code

    • Compatible with ORM, Cloud Shell and Terraform CLI
    • Updated README to include instructions for all three methods
    • Refactored, removing unnecessary resources (Vault, public Subnet, etc.).
    • Added a nerd knob so that it could use an existing Group (rather than create a new one)
    • Fixed ORM RegEx filters to allow dashes (-) and underscores (_), for the names
    opened by timclegg 2
  • Issue with hands on lab guide - launchapp.sh missing

    Issue with hands on lab guide - launchapp.sh missing

    https://github.com/oracle-devrel/redbull-analytics-hol/tree/main/beginners#beginners-hands-on-lab

    In Starting The Web Application it reads:

    cd /home/opc/redbull-analytics-hol/beginners/web ./launchapp.sh start

    However is launchapp.sh is missing, for example

    (redbullenv) cd /home/opc/redbull-analytics-hol/beginners/web (redbullenv) ./launchapp.sh start bash: ./launchapp.sh: No such file or directory

    opened by raekins 1
  • fix: Updating schema.yaml syntax

    fix: Updating schema.yaml syntax

    Making the variable notation follow what the doc syntax shows (https://docs.oracle.com/en-us/iaas/Content/ResourceManager/Concepts/terraformconfigresourcemanager_topic-schema.htm)

    opened by timclegg 1
  • Exploratory Data Analysis Merge Issue

    Exploratory Data Analysis Merge Issue

    Hello I have been encountering an issue while running the lab. The Jupyter notebook 03.f1_analysis_EDA.ipynb has the following issue on cell number 5:


    ValueError Traceback (most recent call last) in ----> 1 df1 = pd.merge(races,results,how='inner',on=['raceId']) 2 df2 = pd.merge(df1,quali,how='inner',on=['raceId','driverId','constructorId']) 3 df3 = pd.merge(df2,drivers,how='inner',on=['driverId']) 4 df4 = pd.merge(df3,constructors,how='inner',on=['constructorId']) 5 df5 = pd.merge(df4,circuit,how='inner',on=['circuitId'])

    ~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate) 85 copy=copy, 86 indicator=indicator, ---> 87 validate=validate, 88 ) 89 return op.get_result()

    ~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in init(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate) 654 # validate the merge keys dtypes. We may need to coerce 655 # to avoid incompatible dtypes --> 656 self._maybe_coerce_merge_keys() 657 658 # If argument passed to validate,

    ~/redbullenv/lib64/python3.6/site-packages/pandas/core/reshape/merge.py in _maybe_coerce_merge_keys(self) 1163 inferred_right in string_types and inferred_left not in string_types 1164 ): -> 1165 raise ValueError(msg) 1166 1167 # datetimelikes must match exactly

    ValueError: You are trying to merge on object and int64 columns. If you wish to proceed you should use pd.concat

    I’m using an oracle automatic deployment provided by oracle as part of their environment. I do not have a lot of experience with Python but one possible ible solution is to read the numeric values form the csv file as integer or float but I’m almost certain the solution might be a little more elaborated than that 😉. Anyway thanks for your time. I’m really excited to test your solution and finish the lab. Thanks again.

    opened by yankodavila 2
  • Has the PAR for the stack deploy image expired.

    Has the PAR for the stack deploy image expired.

    Cannot deploy stack as getting PAR expired message.

    2021/11/07 10:50:11[TERRAFORM_CONSOLE] [INFO] Error Message: work request did not succeed, workId: ocid1.coreservicesworkrequest.oc1.eu-amsterdam-1.abqw2ljrwz2n7qqj7ghdwtnlrqol355oumc7a6coushvgdrebskspaewh7ea, entity: image, action: CREATED. Message: Import image not found: PAR is invalid (maybe is expired or deleted), please check.

    PAR in stack file is https://objectstorage.eu-frankfurt-1.oraclecloud.com/p/khhPjc_IMuyBOMfZUcJajIzCpoZ5aC-D7VMCU__GVZRlIQueXLIIcaaqLOZIuT1a/n/emeasespainsandbox/b/publichol/o/redbullhol-20210809-1523

    opened by Mel-A-M 1
Releases(v0.1.8)
Owner
Oracle DevRel
Oracle DevRel
Sentiment analysis on streaming twitter data using Spark Structured Streaming & Python

Sentiment analysis on streaming twitter data using Spark Structured Streaming & Python This project is a good starting point for those who have little

Himanshu Kumar singh 2 Dec 04, 2021
Full automated data pipeline using docker images

Create postgres tables from CSV files This first section is only relate to creating tables from CSV files using postgres container alone. Just one of

1 Nov 21, 2021
WaveFake: A Data Set to Facilitate Audio DeepFake Detection

WaveFake: A Data Set to Facilitate Audio DeepFake Detection This is the code repository for our NeurIPS 2021 (Track on Datasets and Benchmarks) paper

Chair for Sys­tems Se­cu­ri­ty 27 Dec 22, 2022
Python Project on Pro Data Analysis Track

Udacity-BikeShare-Project: Python Project on Pro Data Analysis Track Basic Data Exploration with pandas on Bikeshare Data Basic Udacity project using

Belal Mohammed 0 Nov 10, 2021
:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

To launch a live notebook server to test optimus using binder or Colab, click on one of the following badges: Optimus is the missing framework to prof

Iron 1.3k Dec 30, 2022
Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.

Overview docs tests package Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era

Tensorwerk 193 Nov 29, 2022
Weather Image Recognition - Python weather application using series of data

Weather Image Recognition - Python weather application using series of data

Kushal Shingote 1 Feb 04, 2022
An Integrated Experimental Platform for time series data anomaly detection.

Curve Sorry to tell contributors and users. We decided to archive the project temporarily due to the employee work plan of collaborators. There are no

Baidu 486 Dec 21, 2022
Generates a simple report about the current Covid-19 cases and deaths in Malaysia

Generates a simple report about the current Covid-19 cases and deaths in Malaysia. Results are delay one day, data provided by the Ministry of Health Malaysia Covid-19 public data.

Yap Khai Chuen 7 Dec 15, 2022
Making the DAEN information accessible.

The purpose of this repository is to make the information on Australian COVID-19 adverse events accessible. The Therapeutics Goods Administration (TGA) keeps a database of adverse reactions to medica

10 May 10, 2022
Analyze the Gravitational wave data stored at LIGO/VIRGO observatories

Gravitational-Wave-Analysis This project showcases how to analyze the Gravitational wave data stored at LIGO/VIRGO observatories, using Python program

1 Jan 23, 2022
Import, connect and transform data into Excel

xlwings_query Import, connect and transform data into Excel. Description The concept is to apply data transformations to a main query object. When the

George Karakostas 1 Jan 19, 2022
A tool to compare differences between dataframes and create a differences report in Excel

similarpanda A module to check for differences between pandas Dataframes, and generate a report in Excel format. This is helpful in a workplace settin

Andre Pretorius 9 Sep 15, 2022
Mining the Stack Overflow Developer Survey

Mining the Stack Overflow Developer Survey A prototype data mining application to compare the accuracy of decision tree and random forest regression m

1 Nov 16, 2021
CSV database for chihuahua (HUAHUA) blockchain transactions

super-fiesta Shamelessly ripped components from https://github.com/hodgerpodger/staketaxcsv - Thanks for doing all the hard work. This code does only

Arlene Macciaveli 1 Jan 07, 2022
Detailed analysis on fraud claims in insurance companies, gives you information as to why huge loss take place in insurance companies

Insurance-Fraud-Claims Detailed analysis on fraud claims in insurance companies, gives you information as to why huge loss take place in insurance com

1 Jan 27, 2022
pyETT: Python library for Eleven VR Table Tennis data

pyETT: Python library for Eleven VR Table Tennis data Documentation Documentation for pyETT is located at https://pyett.readthedocs.io/. Installation

Tharsis Souza 5 Nov 19, 2022
Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

Tablexplore is an application for data analysis and plotting built in Python using the PySide2/Qt toolkit.

Damien Farrell 81 Dec 26, 2022
COVID-19 deaths statistics around the world

COVID-19-Deaths-Dataset COVID-19 deaths statistics around the world This is a daily updated dataset of COVID-19 deaths around the world. The dataset c

Nisa Efendioğlu 4 Jul 10, 2022
Probabilistic reasoning and statistical analysis in TensorFlow

TensorFlow Probability TensorFlow Probability is a library for probabilistic reasoning and statistical analysis in TensorFlow. As part of the TensorFl

3.8k Jan 05, 2023