Amazon Redshift Database Developer Guide

User Manual:

Open the PDF directly: View PDF PDF .
Page Count: 1006 [warning: Documents this large are best viewed by clicking the View PDF Link!]

Scroll down to view the document on your mobile browser.

Ad

Amazon Redshift
Table of Contents
Welcome
- Are You a First-Time Amazon Redshift User?
- Are You a Database Developer?
- Prerequisites
Amazon Redshift System Overview
- Data Warehouse System Architecture
- Performance
- Columnar Storage
- Internal Architecture and System Operation
- Workload Management
- Using Amazon Redshift with Other Services
Getting Started Using Databases
- Step 1: Create a Database
- Step 2: Create a Database User
  - Delete a Database User
- Step 3: Create a Database Table
  - Insert Data Rows into a Table
  - Select Data from a Table
- Step 4: Load Sample Data
- Step 5: Query the System Tables
- Step 6: Cancel a Query
  - Cancel a Query from Another Session
  - Cancel a Query Using the Superuser Queue
- Step 7: Clean Up Your Resources
Building a Proof of Concept for Amazon Redshift
- Identifying the Goals of the Proof of Concept
- Setting Up Your Proof of Concept
  - Designing and Setting Up Your Cluster
  - Converting Your Schema and Setting Up the Datasets
- Cluster Design Considerations
- Amazon Redshift Evaluation Checklist
- Benchmarking Your Amazon Redshift Evaluation
- Additional Resources
Amazon Redshift Best Practices
- Amazon Redshift Best Practices for Designing Tables
- Amazon Redshift Best Practices for Loading Data
- Amazon Redshift Best Practices for Designing Queries
- Working with Recommendations from Amazon Redshift Advisor
  - Viewing Amazon Redshift Advisor Recommendations in the Console
  - Amazon Redshift Advisor Recommendations
Tutorial: Tuning Table Design
- Prerequisites
- Steps
- Step 1: Create a Test Data Set
  - To Create a Test Data Set
  - Next Step
- Step 2: Test System Performance to Establish a Baseline
  - To Test System Performance to Establish a Baseline
  - Next Step
- Step 3: Select Sort Keys
  - To Select Sort Keys
  - Next Step
- Step 4: Select Distribution Styles
- Step 5: Review Compression Encodings
  - To Review Compression Encodings
  - Next Step
- Step 6: Recreate the Test Data Set
  - To Recreate the Test Data Set
  - Next Step
- Step 7: Retest System Performance After Tuning
  - To Retest System Performance After Tuning
  - Next Step
- Step 8: Evaluate the Results
  - Next Step
- Step 9: Clean Up Your Resources
  - Next Step
- Summary
  - Next Step
Tutorial: Loading Data from Amazon S3
- Prerequisites
- Overview
- Steps
- Step 1: Launch a Cluster
  - Next Step
- Step 2: Download the Data Files
  - Next Step
- Step 3: Upload the Files to an Amazon S3 Bucket
  - Next Step
- Step 4: Create the Sample Tables
  - Next Step
- Step 5: Run the COPY Commands
  - COPY Command Syntax
  - Loading the SSB Tables
- Step 6: Vacuum and Analyze the Database
  - Next Step
- Step 7: Clean Up Your Resources
  - Next
- Summary
  - Next Step
Tutorial: Configuring Workload Management (WLM) Queues to Improve Query Processing
- Overview
  - Prerequisites
  - Sections
- Section 1: Understanding the Default Queue Processing Behavior
- Section 2: Modifying the WLM Query Queue Configuration
- Section 3: Routing Queries to Queues Based on User Groups and Query Groups
- Section 4: Using wlm_query_slot_count to Temporarily Override Concurrency Level in a Queue
  - Step 1: Override the Concurrency Level Using wlm_query_slot_count
    - To Override the Concurrency Level Using wlm_query_slot_count
  - Step 2: Run Queries from Different Sessions
    - To Run Queries from Different Sessions
- Section 5: Cleaning Up Your Resources
Tutorial: Querying Nested Data with Amazon Redshift Spectrum
- Overview
  - Prerequisites
- Step 1: Create an External Table That Contains Nested Data
- Step 2: Query Your Nested Data in Amazon S3 with SQL Extensions
- Nested Data Use Cases
- Nested Data Limitations
Managing Database Security
- Amazon Redshift Security Overview
- Default Database User Privileges
- Superusers
- Users
  - Creating, Altering, and Deleting Users
- Groups
  - Creating, Altering, and Deleting Groups
- Schemas
- Example for Controlling User and Group Access
Designing Tables
- Choosing a Column Compression Type
- Choosing a Data Distribution Style
- Choosing Sort Keys
- Defining Constraints
- Analyzing Table Design
Using Amazon Redshift Spectrum to Query External Data
- Amazon Redshift Spectrum Overview
  - Amazon Redshift Spectrum Regions
  - Amazon Redshift Spectrum Considerations
- Getting Started with Amazon Redshift Spectrum
- IAM Policies for Amazon Redshift Spectrum
- Creating Data Files for Queries in Amazon Redshift Spectrum
- Creating External Schemas for Amazon Redshift Spectrum
  - Working with Amazon Redshift Spectrum External Catalogs
- Creating External Tables for Amazon Redshift Spectrum
- Improving Amazon Redshift Spectrum Query Performance
- Monitoring Metrics in Amazon Redshift Spectrum
- Troubleshooting Queries in Amazon Redshift Spectrum
Loading Data
- Using a COPY Command to Load Data
- Updating Tables with DML Commands
- Updating and Inserting New Data
- Performing a Deep Copy
- Analyzing Tables
- Vacuuming Tables
- Managing Concurrent Write Operations
Unloading Data
- Unloading Data to Amazon S3
- Unloading Encrypted Data Files
- Unloading Data in Delimited or Fixed-Width Format
- Reloading Unloaded Data
Creating User-Defined Functions
- UDF Security and Privileges
- Creating a Scalar SQL UDF
  - Scalar SQL Function Example
- Creating a Scalar Python UDF
- Naming UDFs
  - Overloading Function Names
  - Preventing UDF Naming Conflicts
- Logging Errors and Warnings in UDFs
Tuning Query Performance
- Query Processing
- Analyzing and Improving Queries
- Troubleshooting Queries
Implementing Workload Management
- Defining Query Queues
- WLM Query Queue Hopping
- Short Query Acceleration
  - Maximum Run Time for Short Queries
  - Monitoring SQA
- Modifying the WLM Configuration
- WLM Queue Assignment Rules
  - Queue Assignments Example
- Assigning Queries to Queues
- WLM Dynamic and Static Configuration Properties
  - WLM Dynamic Memory Allocation
  - Dynamic WLM Example
- WLM Query Monitoring Rules
- WLM System Tables and Views
SQL Reference
- Amazon Redshift SQL
  - SQL Functions Supported on the Leader Node
    - Examples
  - Amazon Redshift and PostgreSQL
- Using SQL
- SQL Commands
- SQL Functions Reference
- Reserved Words
System Tables Reference
- System Tables and Views
- Types of System Tables and Views
- Visibility of Data in System Tables and Views
  - Filtering System-Generated Queries
- STL Tables for Logging
- STV Tables for Snapshot Data
- System Views
- System Catalog Tables
Configuration Reference
- Modifying the Server Configuration
- analyze_threshold_percent
- datestyle
- describe_field_name_in_uppercase
- enable_result_cache_for_session
  - Values (Default in Bold)
  - Description
- extra_float_digits
  - Values (Default in Bold)
  - Description
- max_cursor_result_set_size
  - Values (Default in Bold)
  - Description
- query_group
  - Values (Default in Bold)
  - Description
- search_path
- statement_timeout
- timezone
- wlm_query_slot_count
Sample Database
- CATEGORY Table
- DATE Table
- EVENT Table
- VENUE Table
- USERS Table
- LISTING Table
- SALES Table
Appendix: Time Zone Names and Abbreviations
- Time Zone Names
- Time Zone Abbreviations
Document History
- Earlier Updates

Navigation menu