Using AWS Glue Workflow

   Go back to the Task List

  « 2. Create IAM Role    4. Create Glue Database »

3. Create S3 Bucket

You will create an Amazon S3 bucket which serves many purposes. It will be used to store sample data (customers.csv) which is cataloged in the Glue Database. It will have folder (target) which is used by Glue Job to write processed data. It will also have a folder (scripts) which is used by the Glue Job.

Download the sample data (customers.csv) from the link.

  1. Go to the S3 Management Console and create a S3 bucket with name dojo-data. If the bucket name is not available, then use a name which is available. In this bucket, create three folder customers, target and scripts.

    S3 Bucket

  2. Open customers folder and upload customers.csv to it.

    S3 Bucket

  3. The data is ready. In the next step, you create a Glue Database.