Skip to content
View trevransom's full-sized avatar
Block or Report

Block or report trevransom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. dend-final-spark dend-final-spark Public

    A Spark ETL running millions of rows from immigrant, temperature and demographic records into a star schema.

    Python

  2. sparkify-airflow-etl sparkify-airflow-etl Public

    A high grade Airflow data pipeline that is dynamic and built from reusable tasks, can be monitored, and allows easy backfills.

    Python 1

  3. spark_aws_data_lake spark_aws_data_lake Public

    Spark and AWS Data Lake with ETL pipeline

    Python 1

  4. aws_sparkify_etl aws_sparkify_etl Public

    AWS Redshift data warehouse and S3 solution to provide an analytical database for Sparkify from JSON logs

    Python 1 1

  5. sparkify_etl sparkify_etl Public

    Sparkify, a music streaming startup, wanted to collect logs they have on user activity and song data and centralize them in a database in order to run analytics. This Postgres database, set up with…

    Jupyter Notebook 1 1

  6. cassandra_etl cassandra_etl Public

    Modeled data by creating tables in Apache Cassandra to run queries. Configured ETL pipeline that transfers data from a set of CSV files within a directory to create a streamlined CSV file to model …

    Jupyter Notebook 1 1