Redshift
Amazon Redshift is a fast and powerful, fully-managed, petabyte-scale data warehouse service.
Redshift can be configured as follows
Single node
Multi-node
Leader node - manages client connections and receives queries
Compute node - store data and perform queries and computations. Can create up to 128 compute nodes.
Redshift advanced compression
Columnar data stores can be compressed much more than row-based data stores because similar data is stored sequentially on disk. Amazon Redshift employs multiple compression techniques and can often achieve significant compression relative to traditional relational data stores. In addition, Amazon Redshift does not require indexes or materialized views, and so uses less space than traditional RDS. When loading data into an empty table, Amazon Redshift automatically samples data and selects the most appropriate compression scheme.
Massively Parallel Processing (MPP)
Amazon Redshift automatically distributes data and query load across all nodes. Redshift makes it easy to add nodes to your data warehouse and enables you to maintain fast query performance as your data warehouse grows.
Backups
Enabled by default, with 1 day retention period.
Maximum retention period is 35 days.
Redshift always attempts to maintain at least 3 copies of your data (the original and replica on the compute nodes and a backup in S3).
Redshift can also asynchronously replicate your snapshots to S3 in another region for disaster recovery.
Encryption
Encrypted in transit using SSL
Encrypted at rest using AES-256
By default Redshift takes care of key management, but KMS or CloudHSM keys can be used to.
Availability
Currently only available in 1 AZ
Can restore snapshots to new AZ in the event of an outage.
Last updated