Important Note: You will create AWS resources during the workshop which will incur cost in your AWS account. It is recommended to clean-up the resources as soon as you finish the workshop to minimize the cost.

AWS Kinesis Data Transformation using Glue

Amazon Kinesis Delivery Stream (Data Firehose) is used to deliver real-time streaming data to destinations like Amazon S3, Amazon Redshift and Amazon Elasticsearch Service. It also supports third-party services like Splunk, Datadog, MongoDB, and New Relic as the destination.

Amazon Kinesis Delivery Stream can convert the streaming data in JSON format to Apache Parquet or Apache ORC formats using a schema from a table defined in AWS Glue. Data in Apache Parquet or Apache ORC format is typically more efficient to query than JSON.

In this workshop, you create a scenario where Amazon Kinesis Delivery Stream converts JSON formatted source data into Apache Parquet formatted destination data using Glue Catalog Table Schema.

AWS Kinesis Data Transformation using Glue

Start the workshop

The AWS Resource consumption for the workshop does not fall under AWS Free Tier.