- Amazon Redshift is a fast, fully managed, petabyte scale data warehouse service that makes it simple and cost-effective
- It analyze all the data using standard SQL and existing Business Intelligence (BI) tools.
- Use Nodes and Clusters to organize data.
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud.
Nodes represents the collection of computing resources in data warehouse of Amazon Redshift.
These Nodes are organized into a group called a Cluster.
Each cluster runs an Amazon Redshift engine and contains one or more databases.
Clusters and Nodes
The key component of an Amazon Redshift data warehouse is a cluster.
Each cluster is composed of a one leader node and one or more compute nodes.
Amazon Redshift currently has support for six different node types and each has a different mix of CPU, memory, and storage.
The six node types are grouped into two categories: Dense Compute and Dense Storage.
Clusters and Nodes
The disk storage for a compute node is divided into a number of slices.
The number of slices per node depends on the node size of the cluster and typically varies between 2 and 16.
The nodes all participate in parallel query execution, working on data that is distributed as evenly as possible across the slices.
Amazon Redshift Management
The Amazon Redshift service manages all the work such as setting up, operating, and scaling a data warehouse.
These tasks include provisioning capacity, monitoring and backing up the cluster, and applying patches and upgrades to the Amazon Redshift engine.
Amazon Redshift Management:
2.Access and security to cluster
An Amazon Redshift cluster is a set of nodes, which consists of a leader node and one or more compute nodes.
The type and number of compute nodes depends on the size of data, the number of queries execute and the query execution performance.
- Creating and managing cluster
- Reserving Compute nodes
- Creating Clusters Snapshots
2.Access and Security to cluster
Access and security in Amazon Redshift clusters have many features.
These features help user to control access to their cluster, define connectivity rules and encrypt data and connections.
- AWS account and IAM credentials
- Security Groups
- SSL Connection
There are many features related to monitoring cluster in Amazon Redshift such as:
- Database Audit logging
- Events and Notifications
Amazon Redshift creates one database when user provision a cluster.
User use this database to load data and run queries.
More databases can be created by running a SQL command.
Super user : When a cluster is provision user specify a master user who has access to all of the databases that are created within the cluster.
Master user can create additional super users and users.
Amazon Redshift uses parameter groups to define the behavior of all databases in a cluster, such as date presentation style and floating-point precision.