Skip to content

πŸš• Performing Data Analytics on NYC Taxi data using GCP and MageAI

Notifications You must be signed in to change notification settings

Hamagistral/NYCTaxi-Analytics-ETL

Repository files navigation

Banner

    πŸš• NYC Taxi Trip Records Data Analysis

Data Engineering Project Using GCP & MageAI

Dashboard πŸŒ€ Data β˜„οΈ Request Feature

🎯 Goal

The goal of this project is to perform data analytics on NYC Taxi Trip Records using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.

πŸ’Ύ Dataset Used

Yellow trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP).

More info about the dataset can be found here :

  1. Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
  2. Data Dictionary - https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf

πŸ“Š Dashboard

image image

πŸ•΅οΈ Key Insights

  • 🧳 Total Trips

    • "VeriFone Inc" is the provider with the most number of trips with over 88k trips and "Creative Mobile Technologies" with only 11k trips.
  • πŸ’³ Top Payment Types

    • NΒ°1: Credit Card with 66%
    • NΒ°2: Cash with 33%
  • πŸ‘¨β€πŸ‘©β€πŸ‘§β€πŸ‘§ Number of passengers by trip

    • 65% of the trips have only 1 passenger.
    • 13% have 2 passengers.
    • 8% have 5 passengers.
  • πŸ’΅ Common Rate Code

    • The most common final rate code in effect at the end of the trip is the "Standard rate" with over 97%, followed by JFK with 2.2%, Negotiated fare etc. with less than 1%

πŸ› οΈ Technologies Used

Python Pandas Jupyter Google Cloud mageai

πŸ“ Project Architecture

Banner

πŸ“„ Data Model

nyctaxi-data-model

πŸ”§ Mage ETL

nyctaxi-mage-etl

πŸ“¨ Contact Me

LinkedIn β€’ Website β€’ Gmail

About

πŸš• Performing Data Analytics on NYC Taxi data using GCP and MageAI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published