Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Last update: Jun 07, 2022

Overview

Grab x Pulis

Detailed analysis done to investigate possible reasons for delay in Grab services for NUS Data Analytics Competition 2022, to be found in here and here.

Our main tech-stack:

Vahalla, a C++ implementation for map matching.
ipyleaflet, for very interactive visualizations of geospatial data analysis
geopandas
Dask
matplotlib & seaborn

We've shortlisted the reasons to be:

Traffic bottlenecks at popular shopping malls due to narrow infrastructures of pickup points. We comparatively found out that pickup speeds at Changi Airport with optimized pick-up and drop-off pioints are much faster at the initial and end-timings of each trip, compared to popular shopping malls with narrow queues at their pick-up and drop-off locations.
Drivers picking inefficient routes, as we compare the actual driver routes taken with popular Google Maps and Open Street Map routes which we pulled using Google Maps API and osmnx. We found out that drivers's supposed "shortcuts" are more often slower, albeit, there were in-fact expert-curated routes which were actually even faster than Google Maps and Open Street Maps. These insights could be used to augment Grab-Nav!

Team:

Keng Hwee Lead @kenghweeng
Russell Saerang @RussellDash332
Sean Gee Zhing @pikasean
Terry Lim @terrylimxc
Jonathan Chen @cysjonathan

Geospatial data-science analysis on reasons behind delay in Grab ride-share services

Related tags

Overview

Grab x Pulis

Owner

Keng Hwee

Single-Cell Analysis in Python. Scales to >1M cells.

A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.

A stock analysis app with streamlit

Jupyter notebooks for the book "The Elements of Statistical Learning".

Uses MIT/MEDSL, New York Times, and US Census datasources to analyze per-county COVID-19 deaths.

t-SNE and hierarchical clustering are popular methods of exploratory data analysis, particularly in biology.

Top 50 best selling books on amazon

MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab

Kennedy Institute of Rheumatology University of Oxford Project November 2019

A pipeline that creates consensus sequences from a Nanopore reads. I

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

Validation and inference over LinkML instance data using souffle

2019 Data Science Bowl

Python for Data Analysis, 2nd Edition

ToeholdTools is a Python package and desktop app designed to facilitate analyzing and designing toehold switches, created as part of the 2021 iGEM competition.

MotorcycleParts DataAnalysis python

[CVPR2022] This repository contains code for the paper "Nested Collaborative Learning for Long-Tailed Visual Recognition", published at CVPR 2022

Vaex library for Big Data Analytics of an Airline dataset

Reading streams of Twitter data, save them to Kafka, then process with Kafka Stream API and Spark Streaming

Pipeline to convert a haploid assembly into diploid