Skip to content

Latest commit

 

History

History
43 lines (25 loc) · 1.88 KB

File metadata and controls

43 lines (25 loc) · 1.88 KB

Building an End-to-End Serverless Data Analytics Solution on AWS

Overview

In this lab, we are going to build a serverless architecture to analyze the data directly from Amazon S3 using Amazon Athena and visualize the data in Amazon QuickSight.

The data set that we are going to use is a public data set that includes trip records from all trips completed in Yellow and Green taxis in New York City from 2009 to 2016, and all trips in for-hire vehicles (FHV) from 2015 to 2016.

Records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts.

The overall architecture is going to be something like this:

architecture-overview.png


Labs

Lab Name
Lab 1 Serverless ETL and Data Discovery using Amazon Glue
Lab 2 Serverless Analysis of data in Amazon S3 using Amazon Athena
Lab 3 Analysis of data in Amazon S3 using Amazon Redshift Spectrum
Lab 4 Visualization using Amazon QuickSight

ADDITIONAL RESOURCES

Amazon Athena:

Redshift Spectrum

Serverless Analysis Architecture Blogs: