Workshoplists on AWS Dojo

Workshoplists on AWS Dojo https://aws-dojo.com/workshoplists/ Recent content in Workshoplists on AWS Dojo Hugo -- gohugo.io en-us Wed, 06 May 2020 09:52:45 +0200 Understanding Device to AWS IoT Core Communication https://aws-dojo.com/workshoplists/workshoplist1/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist1/ This is a very interesting workshop to understand how the communication between AWS IoT Core and devices work. Since there is no physical device, you will use MQTT.fx client to simulate the device. The device will send messages to the AWS IoT core. You will use AWS IoT rule to route and store the device messages to the Amazon S3 bucket. You will also implement a scenario where the AWS IoT core will communicate back to the device. Using Glue Job ETL from REST API Source to Amazon S3 Bucket Destination https://aws-dojo.com/workshoplists/workshoplist2/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist2/ In many scenarios, you are required to build an AWS Glue job which calls a REST API to fetch data for the ETL purpose. Such jobs can be configured to run either with a schedule or an event. The REST API could be deployed within the AWS Account or outside. In this workshop, you create an AWS Glue Job which calls the REST API hosted outside AWS Account. With little changes, you can create job which can call APIs hosted within the AWS Accounts as well. Create Data Lake with Amazon S3, Lake Formation and Glue https://aws-dojo.com/workshoplists/workshoplist3/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist3/ Ready to build a data lake - well a small one. In this workshop, you will keep two data sets sales and customers in Amazon S3. AWS Glue is used to catalog the data. You then use AWS Lake Formation to provide specific permission for the salesuser and customersuser users for the sales and customers tables. The users will use Amazon Athena to query the data as per the permission defined. Build managed self-service repository using AWS Service Catalog https://aws-dojo.com/workshoplists/workshoplist4/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist4/ AWS Service Catalog allows organizations to create and manage catalogs of IT services that are approved for use on AWS. AWS Service Catalog allows to centrally manage commonly deployed IT services, and helps achieve consistent governance and meet the compliance requirements, while enabling users to quickly deploy only the approved IT services they need. In this workshop, you configure Service Catalog for such use. You will create two portfolio which have their set of products (AWS Resources) associated. Build enterprise search service using Amazon Kendra https://aws-dojo.com/workshoplists/workshoplist5/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist5/ Amazon Kendra is an enterprise search service that delivers powerful natural language search capabilities to the websites and applications so that the end users can easily find the information they need within the vast amount of content spread across their organization. In this workshop, you will build a search service which will use Salesforce and Amazon S3 bucket as the data sources for the content. Once the search service is created, you will use Amazon Kendra Search Console to the test the service. Understanding Device Shadow Service in AWS IoT Core https://aws-dojo.com/workshoplists/workshoplist6/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist6/ A device shadow is a JSON document that is used to store and retrieve device state information. The device shadow is associated with AWS IoT thing object. Any application or service which wants to know the device state; can access the device shadow irrespective whether the device is connected to AWS IoT or not. Shadow communications use request/response model using the publish/subscribe communication model of MQTT. Every shadow action consists of a request topic, a successful response topic (accepted), and an error response topic (rejected). Using AWS Glue ETL Job with Streaming Data https://aws-dojo.com/workshoplists/workshoplist7/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist7/ Recently AWS announced streaming data support for AWS Glue ETL Jobs which helps in setting up continuous ingestion pipelines that processes streaming data on the fly. Streaming ETL jobs consume data from streaming sources likes Amazon Kinesis and Apache Kafka, clean and transform those data streams in-flight, and continuously load the results into Amazon S3 data lakes, data warehouses, or other data stores. In this workshop, you create an ETL job which will read streaming data from Kinesis data stream and upload to Amazon S3 bucket. Building AWS Glue Job using PySpark - Part:1(of 2) https://aws-dojo.com/workshoplists/workshoplist8/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist8/ AWS Glue Jobs are used to build ETL job which extracts data from sources, transforms the data, and loads it into targets. The job can be built using languages like Python and PySpark. PySpark is the Python API for Spark and it used for big data processing. It can perform data tranformation on large scale data in fast and efficient way. This workshop will be covered in two parts. Part-1: You learn about setting up a data lake, creating development environment for PySpark and finally building a Glue job using PySpark. Building AWS Glue Job using PySpark - Part:2(of 2) https://aws-dojo.com/workshoplists/workshoplist9/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist9/ In Part-1 of the workshop, you learnt about setting up a data lake, creating development environment for PySpark and finally building a Glue job using PySpark. In part-2, the focus is on learning PySpark for the ETL purpose. You must finish Part-1 of the workshop in order to start Part-2. Kindly complete Building AWS Glue Job using PySpark - Part:1(of 2) workshop before going to part-2. When building ETL job, you perform Extract, Transform and Load operations. Using Python to Communicate with AWS IoT Core https://aws-dojo.com/workshoplists/workshoplist10/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist10/ When building IoT device based solution, most of the time, you have to write code / firmware in the device to communicate with the AWS IoT Core to exchange messages. A simple example will be running raspberry pi based device which is running code to talk to AWS IoT core. In this exercise, you learn about writing such code in Python using AWS IoT Device SDK. Since there is no physical device, you will use AWS Cloud9 environment to simulate the device. AWS AI Services Programming Series - Part1 (Polly, Translate & Textract) https://aws-dojo.com/workshoplists/workshoplist11/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist11/ Amazon Web Services provides ready-made, pre-trained Artificial Intelligent (AI) Services to easily build applications and services. The AWS AI Services can be called programmatically using APIs and SDKs. You don’t need to train and test data for using the AI Service because they are pre-trained by AWS. AI Services on AWS don’t require machine learning experience. However many services like Rekognition do provide opportunity to train your custom data and then use as API. AWS AI Services Programming Series - Part2 (Lex) https://aws-dojo.com/workshoplists/workshoplist12/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist12/ Amazon Web Services provides ready-made, pre-trained Artificial Intelligent (AI) Services to easily build applications and services. The AWS AI Services can be called programmatically using APIs and SDKs. You don’t need to train and test data for using the AI Service because they are pre-trained by AWS. AI Services on AWS don’t require machine learning experience. However many services like Rekognition do provide opportunity to train your custom data and then use as API. AWS AI Services Programming Series - Part3 (Transcribe) https://aws-dojo.com/workshoplists/workshoplist13/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist13/ Amazon Web Services provides ready-made, pre-trained Artificial Intelligent (AI) Services to easily build applications and services. The AWS AI Services can be called programmatically using APIs and SDKs. You don’t need to train and test data for using the AI Service because they are pre-trained by AWS. AI Services on AWS don’t require machine learning experience. However many services like Rekognition do provide opportunity to train your custom data and then use as API. AWS Kinesis Data Transformation using Glue https://aws-dojo.com/workshoplists/workshoplist14/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist14/ Amazon Kinesis Delivery Stream (Data Firehose) is used to deliver real-time streaming data to destinations like Amazon S3, Amazon Redshift and Amazon Elasticsearch Service. It also supports third-party services like Splunk, Datadog, MongoDB, and New Relic as the destination. Amazon Kinesis Delivery Stream can convert the streaming data in JSON format to Apache Parquet or Apache ORC formats using a schema from a table defined in AWS Glue. Data in Apache Parquet or Apache ORC format is typically more efficient to query than JSON. AWS AI Services Programming Series - Part4 (Comprehend) https://aws-dojo.com/workshoplists/workshoplist15/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist15/ Amazon Comprehend is a natural language processing (NLP) service that uses machine learning to find insights and relationships in text. The developers can use Amazon Comprehend APIs with the programming language of their choice to create applications. The developers don’t need any machine learning experience for using the APIs. In this workshop, you write code to analyze a sample text for entities, key phrases, sentiments etc.. Start the workshop The AWS Resource consumption for the workshop falls under AWS Free Tier. Service Discovery using Amazon Cloud Map https://aws-dojo.com/workshoplists/workshoplist16/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist16/ AWS Cloud Map is a cloud resource discovery service. The resources and services used in the application can be defined in the Cloud Map. Application can dynamically discover and use the services / resources from the Cloud Map. This increases the application availability because it always discovers the most up-to-date locations of its resources / services. Cloud Map allows to register any application resources, such as databases, queues, microservices, and other cloud resources, with custom names. AWS AI Services Programming Series - Part5 (Fraud Detector) https://aws-dojo.com/workshoplists/workshoplist17/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist17/ Amazon Fraud Detector is a fully managed service that uses machine learning (ML) to identify potentially fraudulent activity. Amazon Fraud Detector automates the time consuming and expensive steps to build, train, and deploy an ML model for fraud detection. Amazon Fraud Detector customizes each model it creates to a customer’s own dataset. The model can be called using API to detect the frauds in the applications. In this workshop, you create a custom model using a training dataset stored in S3 bucket. Working with AWS IoT Device Defender https://aws-dojo.com/workshoplists/workshoplist18/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist18/ AWS IoT Device Defender helps in securing the fleet of IoT devices. It continuously audits the IoT configurations to make sure that they aren’t deviating from the security best practices. AWS IoT Device Defender allows to define appropriate behavior for each device using security metrics such as connections, failed authentications, number of messages, size of the messages etc. It continuously monitors security metrics from the devices and AWS IoT Core for deviations. AWS AI Services Programming Series - Part6 (Forecast) https://aws-dojo.com/workshoplists/workshoplist19/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist19/ Amazon Forecast is a fully managed service that uses machine learning to deliver highly accurate forecasts. The training data can be used with Amazon Forecast to train predictor model. The model is then used to generate forecast. For the model training, there is choice to either select a particular algorithm or let Amazon choose the best fitting algorithm. The developers can call the forecast using APIs to build their applications. Build AWS AppSync API and API Client using Python https://aws-dojo.com/workshoplists/workshoplist20/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist20/ AWS AppSync is used to create GraphQL APIs. The APIs can securely access, manipulate, and combine data from one or more data sources such as NoSQL data stores, relational databases, HTTP APIs, and any custom data sources with AWS Lambda. AppSync additionally provides local data access for mobile and web apps to enable device offline scenarios. In this workshop, you create an API with DynamoDB as the data source while the business logic is provided by the Lambda function. Introduction to AWS Glue Studio https://aws-dojo.com/workshoplists/workshoplist21/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist21/ AWS Glue Studio is GUI based service to create, run, and monitor extract, transform, and load (ETL) jobs in AWS Glue. It helps in visually composing data transformation workflows and run them on AWS Glue’s Apache Spark-based serverless ETL engine. AWS Glue Studio supports both tabular and semi-structured data. AWS Glue Studio also offers tools to monitor ETL workflows and validate that they are operating as intended. In this workshop, you create an ETL job using AWS Glue Studio which reads data from the data lake data catalog, performs transformation and writes to the S3 bucket. AWS IoT Core Integration with Amazon Timestream https://aws-dojo.com/workshoplists/workshoplist22/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist22/ AWS IoT Core enables the devices to easily and securely interact with cloud applications and other devices. AWS IoT Core supports HTTP, Web Sockets, and MQTT protocols for the communication. AWS IoT Core also allows to filter, transform, and act upon device data exchanged during the communication. Amazon Timestream is a serverless time series database service. Amazon Timestream manages time series data life cycle by keeping recent data in memory and moving historical data to a cost optimized storage tier. Using Custom Transformation in AWS Glue Studio https://aws-dojo.com/workshoplists/workshoplist23/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist23/ AWS Glue Studio is GUI based service to create, run, and monitor extract, transform, and load (ETL) jobs in AWS Glue. It helps in visually composing data transformation workflows and run them on AWS Glue’s Apache Spark-based serverless ETL engine. AWS Glue Studio supports both tabular and semi-structured data. AWS Glue Studio also offers tools to monitor ETL workflows and validate that they are operating as intended. Custom Transform (custom code node) in AWS Glue Studio allows to perform complicated transformations on the data using custom code. AWS AI Services Programming Series - Part7 (Personalize) https://aws-dojo.com/workshoplists/workshoplist24/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist24/ Amazon Personalize enables the developers to build applications capable of delivering a wide array of personalization experiences, including specific product recommendations, personalized product re-ranking, and customized direct marketing. Amazon Personalize is a fully managed machine learning service which uses custom ML models to deliver highly customized recommendations to customers across industries such as retail and media and entertainment. In this workshop, you use a sample data to create a model using Amazon Personalize. Create Custom Models using Amazon Rekognition Custom Labels https://aws-dojo.com/workshoplists/workshoplist25/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist25/ Amazon Rekognition Custom Labels help in identifying the objects and scenes in images that are specific to the business needs. It takes a lot of effort, time and skill to develop a custom model to analyze images. Amazon Rekognition Custom Labels makes it easy and takes care of the heavy lifting. Rekognition can begin training in just a few clicks. It starts with image uploading, labeling, a custom image analysis model training and finally using model with API calls to analyze the images. Use AWS Glue Network Connection to call APIs in Glue Jobs https://aws-dojo.com/workshoplists/workshoplist26/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist26/ In many scenarios, you are required to build an AWS Glue job which calls a REST API to fetch data for the ETL purpose. Such jobs can be configured to run either with a schedule or an event. The REST API could be deployed within the AWS Account or outside. In this workshop, you create an AWS Glue Job which calls the REST API hosted outside AWS Account. With little changes, you can create job which can call APIs hosted within the AWS Accounts as well. Handle AWS IoT Messages using AWS Lambda https://aws-dojo.com/workshoplists/workshoplist27/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist27/ AWS IoT Core enables the devices to easily and securely interact with cloud applications and other devices. AWS IoT Core supports HTTP, Web Sockets, and MQTT protocols for the communication. AWS IoT Core also allows to filter, transform, and act upon device data exchanged during the communication. AWS Lambda lets the users run the code without provisioning or managing servers. The users pay only for the compute time you consume. Using Jobs in AWS IoT Device Management https://aws-dojo.com/workshoplists/workshoplist28/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist28/ AWS IoT Device Management used to securely register, organize, monitor, and remotely manage IoT devices at scale. AWS IoT Job is a major component of AWS IoT Device Management. AWS IoT jobs can be used to define a set of remote operations that are sent to and executed on one or more devices connected to AWS IoT. A job is a remote operation such as software or firmware update that is sent to and executed on one or more devices connected to AWS IoT. Using AWS Glue Workflow https://aws-dojo.com/workshoplists/workshoplist29/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist29/ AWS Glue Workflow help create complex ETL activities involving multiple crawlers, jobs, and triggers. Each workflow manages the execution and monitoring of the components it orchestrates. The workflow records execution progress and status of its components, providing an overview of the larger task and the details of each step. The AWS Glue console also provides a visual representation of the workflow as a graph. In this workshop, you create a workflow to which orchestrates Glue Crawler and Glue Job. Using Amazon Redshift in AWS based Data Lake https://aws-dojo.com/workshoplists/workshoplist30/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist30/ A data lake is a centralized repository that allows to store all structured and unstructured data at any scale. The data in the data lake is used for different types of analytics—from dashboards and visualizations to big data processing, real-time analytics, and machine learning. Amazon Redshift is a managed, petabyte-scale data warehouse service in the cloud. Most of the times, Redshift based warehouse is registered as data source to the data lake. Using AWS Lake Formation Blueprint https://aws-dojo.com/workshoplists/workshoplist31/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist31/ AWS Glue Workflow is used to create complex ETL pipeline. The workflow uses crawlers, jobs and triggers to orchestrate the activities of the pipeline. The workflow can handle both snapshot and incremental data ETL and it can run on schedule or on demand. One can use Lake Formation Blueprint to simplify the workflow creation. Blueprint uses templates to enable ETL workflow configuration from the sources such as database and log files. AWS Glue DataBrew - working with AWS RDS https://aws-dojo.com/workshoplists/workshoplist32/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist32/ AWS Glue DataBrew is a visual data preparation tool to clean, transform and normalize data for the analytics and machine learning purpose. The tool is primarily used by data analysts and data scientists. There are more than 250 pre-built transformations to automate data preparation tasks. There is no need to write any code when using these transformations. The transformations can be easily converted into automated and scheduled job. In this workshop, you use DataBrew with data stored in RDS database and then convert the transformation into a job. AWS Glue Studio - working with AWS RDS https://aws-dojo.com/workshoplists/workshoplist33/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist33/ AWS Glue Studio is GUI based service to create, run, and monitor extract, transform, and load (ETL) jobs in AWS Glue. It helps in visually composing data transformation workflows and run them on AWS Glue’s Apache Spark-based serverless ETL engine. AWS Glue Studio supports both tabular and semi-structured data. AWS Glue Studio also offers tools to monitor ETL workflows and validate that they are operating as intended. In this workshop, you create an ETL job using AWS Glue Studio which reads data from Amazon RDS Database, performs transformation and writes to the S3 bucket. Getting Started with Amazon EMR https://aws-dojo.com/workshoplists/workshoplist34/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist34/ Amazon EMR is a big data platform for processing large scale data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. Amazon EMR is easy to set up, operate, and scale for the big data requirement by automating time-consuming tasks like provisioning capacity and tuning clusters. In this workshop, you launch an EMR cluster. You then use Jupyter Notebook to do PySpark based programming with EMR Cluster. Introduction to AWS IoT Analytics https://aws-dojo.com/workshoplists/workshoplist35/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist35/ AWS IoT Analytics is a managed service to run and operationalize analytics on large volumes of IoT data. AWS IoT Analytics stores data in time-series format and can apply filters, transforms, and enriches IoT data before storing. The analytics data can be analyzed by running ad hoc or scheduled queries using the built-in SQL query engine, or perform more complex analytics and machine learning inference. In this workshop, you configure AWS IoT Analytics to store time-series device data and analyze the data using Jupyter Notebook. Creating Private API in Amazon API Gateway https://aws-dojo.com/workshoplists/workshoplist36/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist36/ It is possible to create private REST APIs using AWS API Gateway. The private APIs are accessible only from Amazon VPC by using an interface VPC endpoint. Further, API level resource policy is used to allow or deny access to the API from selected VPCs and VPC endpoints. From on premises environment, VPN or AWS Direct Connect can be used to access the private API using VPC endpoints. In this workshop, you learn to configure private REST API and call from within the VPC. Amazon Redshift Federated Query with RDS PostgreSQL https://aws-dojo.com/workshoplists/workshoplist37/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist37/ Amazon Redshift federated query allows query and analysis of data across operational databases, data warehouses, and data lakes. Federated Query integrates queries from Amazon Redshift on live data in external databases with queries across the Amazon Redshift and Amazon S3 environments. Federated queries can work with external databases in Amazon RDS for PostgreSQL and Amazon Aurora with PostgreSQL compatibility. In this workshop, you configure federated query between Redshift and Amazon Aurora with PostgreSQL compatibility. Create Lambda with Container Based Runtime https://aws-dojo.com/workshoplists/workshoplist38/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist38/ The Lambda function can use container as the runtime. The Lambda function code and dependencies can be package as a container image. This image is then uploaded to a repository in Amazon Elastic Container Registry (ECR). Using this image as runtime, a Lambda function can be configured. In this workshop, you configure a Lambda Function which uses container as the runtime. The following diagram shows the scenario you are going to build. AWS Data Wrangler Series - Part3- Working with Amazon Redshift https://aws-dojo.com/workshoplists/workshoplist39/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist39/ Amazon Redshift is a managed, petabyte-scale data warehouse service in the cloud. AWS Data Wrangler is an open source initiative from AWS Professional Services. It extends the power of Pandas by allowing to work AWS data related services using Panda DataFrames. Once can use Python Panda and AWS Data Wrangler to build ETL with major services - Athena, Glue, Redshift, Timestream, QuickSight, CloudWatchLogs, DynamoDB, EMR, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL). Custom Text Classification using Amazon Comprehend https://aws-dojo.com/workshoplists/workshoplist40/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist40/ Amazon Comprehend can be used to build own models for the custom classification. The first step is to train a custom classifier to recognize the custom classes. The custom classifier is then used to classify text or documents. The classification can be conducted in real-time or in a batch process. In this workshop, you learn how to use Amazon Comprehend Custom Classification to identity real / fake news title. The following diagram shows the scenario you are going to build. Using Amazon EMR with AWS Glue Catalog https://aws-dojo.com/workshoplists/workshoplist41/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist41/ Amazon EMR is a big data platform for processing large scale data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. Amazon EMR is easy to set up, operate, and scale for the big data requirement by automating time-consuming tasks like provisioning capacity and tuning clusters. AWS Glue is used to catalog the data for the discoverability and accessibility purpose. In this workshop, you launch an EMR cluster. Athena Federated Query with Amazon RDS https://aws-dojo.com/workshoplists/workshoplist42/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist42/ Amazon Athena is a serverless and interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Amazon Athena Federated Query can be used to run SQL queries across data stored in relational, non-relational, object, and custom data sources. Athena uses data source connectors that run on AWS Lambda to execute federated queries. The data source connectors help connect with data sources like CloudWatch, DocumentDB, DynamoDB, HBase, JDBC data sources (like Redshift, MySql, SQL Server) etc. Getting Started with Amazon Elasticsearch https://aws-dojo.com/workshoplists/workshoplist43/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist43/ Amazon Elasticsearch Service is a managed service to deploy, secure, and run Elasticsearch in cost effective and scalable way. Elasticsearch allows to store, search, and analyze large volumes of data quickly in near real-time. It uses index for the fast search responses. The service supports open source Elasticsearch APIs, managed Kibana, integration with Logstash and other AWS services. In this workshop, you configure Elasticsearch domain and perform CRUD operations with the data. Securely Manage Private VPC EC2 Instances using Systems Manager https://aws-dojo.com/workshoplists/workshoplist44/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist44/ AWS Systems Manager provides a unified user interface to view operational data from multiple AWS services and allows to automate operational tasks across the AWS resources. Systems Manager enables resource and application management and makes it easy to operate and manage the infrastructure securely at scale. Systems Manager can manage resources like Amazon EC2 instances, Amazon EKS clusters, Amazon S3 buckets, or Amazon RDS instances. In this workshop, you manage EC2 instances deployed in a private subnet using AWS Systems Manager. Getting Started with Amazon Elastic Container Service https://aws-dojo.com/workshoplists/workshoplist45/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist45/ Amazon Elastic Container Service (Amazon ECS) is a managed container orchestration service. Amazon ECS is a highly scalable, fast container management service that makes it easy to run, stop, and manage containers on a cluster. In this workshop, you learn to deploy an application on Amazon ECS. The following diagram shows the scenario you are going to build. Start the workshop Deploying Application to Amazon ECS Fargate https://aws-dojo.com/workshoplists/workshoplist46/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist46/ AWS Fargate is a serverless compute engine for containers that works with both Amazon ECS and Amazon EKS. Fargate removes the need to provision and manage servers and let the developers to focus on building the applications. In this workshop, you learn to deploy an application on Amazon ECS Fargate. The following diagram shows the scenario you are going to build. Start the workshop Deploy Simple Web Applications in Lambda https://aws-dojo.com/workshoplists/workshoplist47/ Wed, 06 May 2020 09:52:45 +0200 https://aws-dojo.com/workshoplists/workshoplist47/ AWS Lambda is a serverless compute service that lets run the code without provisioning or managing servers. With Lambda, one can run code for virtually any type of application or backend service. One can write Lambda functions in language (Node.js, Python, Go, Java, and more) of their choice. In this workshop, you learn to deploy simple web application in AWS Lambda. The following diagram shows the scenario you are going to build.