Loading Data into a Redshift Cluster

1.5 hours
  • 3 Learning Objectives

About this Hands-on Lab

In this lab, you will load data from your choice of S3 bucket or DynamoDB table into a running Redshift cluster. You will need to create an IAM role that provides access to the S3 bucket or DynamoDB table and apply it to the Redshift cluster. Experience with creating S3 buckets or DynamoDB tables will be useful.

Learning Objectives

Successfully complete this lab by achieving the following learning objectives:

Prepare the Source Data

Download the appropriately formatted sample data.

If using an S3 bucket, download the CSV-formatted redshift-data.csv and upload to a newly created S3 bucket with a name of your choosing. Please note, the bucket name must be unique.

If using a DynamoDB table, download the JSON-formatted redshift-data.json and load into a DynamoDB table with a name of your choosing.

Create IAM Role

Create an IAM role with the appropriate permissions for your source type:
AmazonS3ReadOnlyAccess for an S3 bucket or AmazonDynamoDBReadOnlyAccess for a DynamoDB table.

Once the role is created, add it to your Redshift cluster.

Load the Data

Connect to the Redshift cluster and create a table to receive the imported data making sure the columns match those of the source data. Once the table is created, use the COPY command to load the data.

Please note the ARN of the IAM role needs to be surrounded by single quotes.

Additional Resources

In this lab, you are working as a Database Administrator and manage your company's Redshift cluster. The Development team has requested a way to import data into the Redshift cluster from either an S3 bucket or DynamoDB table. They have provided you with a small sample of the data to be imported from the S3 bucket or DynamoDB table into Redshift.

The sample data is available in different formats depending on which source you choose (S3 or DynamoDB).

Use the CSV format for loading into the S3 bucket. redshift-data.csv

Use the JSON format for loading into the DynamoDB table. redshift-data.json

Use the following credentials to access the Redshift cluster:

  • User: masteruser
  • Password: MasterPasswd2020!

If you need a refresher, check out these courses:

What are Hands-on Labs

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Sign In
Welcome Back!

Psst…this one if you’ve been moved to ACG!

Get Started
Who’s going to be learning?