Prepare Amazon Redshift Consumer Cluster

Lab 3

Now, let’s create the consumer Amazon Redshift cluster (we will refer this as consumer cluster throughout the lab) in us-west-1 region, and remember, we will not load the sample dataset in this cluster.

Step-1: Create Redshift Consumer Cluster

  1. Login into AWS Console (make sure us-west-1 region is selected in top right corner), and click Create Cluster.

  2. Provide Cluster name as redshift-cluster-west, and select ra3.xlplus node type.

NOTE: If you get access error launching cluster with ra3.xlplus node type, then select ra3.4xlarge node type. Please note, Amazon Redshift Data Sharing feature is not supported for previous generation dc2 node types, and Amazon Redshift only supports data sharing on the ra3.16xlarge, ra3.4xlarge, and ra3.xlplus instance types for producer and consumer clusters. Amazon Redshift ra3 nodes incurs cost as these nodes are not part of the Amazon Redshift free trial, or AWS Free Tier.

Create Cluster

  1. Do not select “Load Sample data”.

  2. Supply a password for Admin user.

Sample data Other configuration settings can be left as default.

  1. Click the Create Cluster button – it will take few minutes to create the cluster.

Step-2: Connect to database using query editor

Once the cluster is created (Status = Available), using one of the Amazon Redshift query editors is the easiest way to query the Amazon Redshift database. After creating your cluster, use the query editor v2 to connect to newly created database.

Query editor

Step-3: Validate database

In the query editor, click on the newly created cluster, and it will establish connection to the database. You will then see two databases created automatically – dev, sample_data_dev. The dev database has one schema called public, which will not have any tables as we did not select “Load Sample Data” during cluster creation, unlike produce cluster in us-east-1 region. We will refer this as consumer database throughout the lab. Expand the public schema under dev database:

Query editor 2

We now have both producer and consumer clusters installed, configured, and loaded sample dataset in Producer database. Next, we will baseline existing metrics and KPIs.