AWS Data Wrangler Series - Part3- Working with Amazon Redshift

   Go back to the Task List

  « 5. Configure Glue Connection    7: Write Data Wrangler Code »

6: Setup Jupyter Notebook

You create developer endpoint and launch Jupyter Notebook which is used for the Data Wrangler programming.

  1. Goto the AWS Glue console, click on the Dev endpoints option in the left menu and then click on the Add endpoint button.

    Glue

  2. On the next screen, type in dojoendpoint as the name. Select dojo-glue-role as the IAM role. Then click on the Next button.

    Glue

  3. On the next screen, select Choose a connection as the option. Select dojoconnection as the connection. Click on the Next button.

    Glue

  4. On the next Add an SSH public key (Optional) screen, click on the Next button.

  5. On the next Review screen, click on the Finish button. The endpoint creation will start. It will take some 8-10 mins for the developer endpoint to be ready. Wait till the status changes to READY.

    Glue

  6. Once the developer endpoint is ready, select it and click on Create Sagemaker notebook under the Action dropdown menu.

    Glue

  7. On the next screen, enter dojonotebook as notebook name, select Create an IAM role as the option, type in dojonotebookrole for the IAM Role. Keep rest of the configuration to the default and click on the Create notebook button.

    Glue

  8. The notebook creation will start. Wait till the notebook status changes to Ready.

    Glue

  9. The notebook is associated with an IAM Role AWSGlueServiceSageMakerNotebookRole-dojonotebookrole. This role needs required Glue permission in order to use Glue Connection. Open this role details in AWS IAM Management Console and attach AWSGlueServiceRole policy to it.

    Glue

  10. The development environment is ready. Let’s launch Jupyter Notebook and write code there.