AWS – Redshift

25

AMAZON REDSHIFT

  • Amazon Redshift is a fast, fully managed, petabyte scale data warehouse service that makes it simple and cost-effective
  • It analyze all the data using standard SQL and existing Business Intelligence (BI) tools.
  • Use Nodes and Clusters to organize data.

Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud.

Nodes represents the collection of computing resources in data warehouse of Amazon Redshift.

These Nodes are organized into a group called a Cluster.

Each cluster runs an Amazon Redshift engine and contains one or more databases.

Clusters and Nodes

The key component of an Amazon Redshift data warehouse is a cluster.

Each cluster is composed of a one leader node and one or more compute nodes.

Amazon Redshift currently has support for six different node types and each has a different mix of CPU, memory, and storage.

The six node types are grouped into two categories: Dense Compute and Dense Storage.

26

Clusters and Nodes

The disk storage for a compute node is divided into a number of slices.

The number of slices per node depends on the node size of the cluster and typically varies between 2 and 16.

The nodes all participate in parallel query execution, working on data that is distributed as evenly as possible across the slices.

Amazon Redshift Management

The Amazon Redshift service manages all the work such as setting up, operating, and scaling a data warehouse.

These tasks include provisioning capacity, monitoring and backing up the cluster, and applying patches and upgrades to the Amazon Redshift engine.

Amazon Redshift

Amazon Redshift Management:

1.Managing Clusters

2.Access and security to cluster

3.Monitoring Clusters

4.Databases

1.Managing Clusters

An Amazon Redshift cluster is a set of nodes, which consists of a leader node and one or more compute nodes.

The type and number of compute nodes depends on the size of data, the number of queries execute and the query execution performance.

  • Creating and managing cluster
  • Reserving Compute nodes
  • Creating Clusters Snapshots

2.Access and Security to cluster

Access and security in Amazon Redshift clusters have many features.

These features help user to control access to their cluster, define connectivity rules and encrypt data and connections.

  • AWS account and IAM credentials
  • Security Groups
  • Encryption
  • SSL Connection

3.Monitoring Clusters

There are many features related to monitoring cluster in Amazon Redshift such as:

  • Database Audit logging
  • Events and Notifications
  • Performances

4.Databases

Amazon Redshift creates one database when user provision a cluster.

User use this database to load data and run queries.

More databases can be created by running a SQL command.

Super user : When a cluster is provision user specify a master user who has access to all of the databases that are created within the cluster.

Master user can create additional super users and users.

Amazon Redshift uses parameter groups to define the behavior of all databases in a cluster, such as date presentation style and floating-point precision.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s