Questions tagged [amazon-redshift]

Amazon Redshift is a petabyte-scale data warehousing service using existing business intelligence tools to analyze the data. Redshift is a column-oriented MPP database based on ParAccel and ParAccel was itself based on PostgreSQL.

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more. Redshift is a column-oriented database based on PostgreSQL 8.0.2

Source: Amazon Redshift

Although Redshift to some extent is based on PostgreSQL they are substantially different. Do not add the postgresql tag to questions involving Amazon Redshift

Related Tags

sql

8534 questions

votes

1 answer

Are there any benefits to storing data in DynamoDB vs S3 for use with Redshift?

My particular scenario: Expecting to amass TBs or even PBs of JSON data entries which track price history for many items. New data will be written to the data store hundreds or even thousands of times per a day. This data will be analyzed by…

amazon-web-services amazon-s3 amazon-dynamodb amazon-redshift amazon-redshift-spectrum

asked Nov 29 '17 at 16:44

Daniel Kobe

9,376
15
62
109

votes

3 answers

Copy data from DynamoDB into Redshift across two different AWS accounts?

For reasons beyond my control, I have the following: A table CustomerPhoneNumber in DynamoDB under one AWS account. A Redshift cluster under a different AWS account (same geographic region; EU) Is there any way to run the COPY command to move data…

amazon-web-services amazon-dynamodb amazon-redshift aws-redshift

asked Nov 28 '17 at 21:28

Ray

3,137
8
32
59

votes

1 answer

"Error: invalid input syntax for integer:" when inserting NULL values for an SMALLINT column in a Redshift table?

I have this locally defined python function that works fine when inserting data into a redshift table: def _insert_data(table_name, values_list): insert_base_sql = f"INSERT INTO {table_name} VALUES" insert_sql = insert_base_sql +…

python postgresql amazon-redshift

asked Nov 27 '17 at 23:08

Scott Borden

votes

1 answer

How to make projections with future dates using Redshift

I currently have a table called quantities with the following data: +------+----------+----------+ | item | end_date | quantity | +------+----------+----------+ | 1 | 26/11/17 | 100 | +------+----------+----------+ | 2 | 28/11/17 | 300 …

sql date amazon-redshift future projection

asked Nov 26 '17 at 16:04

Henry

votes

0 answers

Redshift - Load data which has newline in field

amazon-web-services amazon-redshift etl

asked Nov 22 '17 at 13:42

Nirmal Prabhu

votes

2 answers

Group contiguous blocks for aggregation in SQL (Redshift)

I've got a table like this: id time activity 1: 1 1 a 2: 1 2 a 3: 1 3 b 4: 1 4 b 5: 1 5 a 6: 2 1 a 7: 2 2 b 8: 2 3 b 9: 2 4 b 10: 2 5…

sql amazon-redshift

asked Nov 21 '17 at 22:11

Gregor Thomas

136,190
20
167
294

votes

0 answers

Redshift DELETE using slow Hash Join while equivalent SELECT uses Merge Join

We are using the recommended method defined here for performing "upserts": http://docs.aws.amazon.com/redshift/latest/dg/merge-replacing-existing-rows.html It is taking almost two minutes to load a file of just 150 rows. Almost all of this time is…

amazon-redshift

asked Nov 20 '17 at 22:52

Jared Gommels

votes

0 answers

Failed to run connection test on Redshift endpoint in AWS DMS service

I have created an Redshift endpoint in AWS DMS service. When I run the test connection I get the following error message: Error Details: [errType=CALL_SERVER_ERROR, status=0, errMessage=Failed executing command on Replication Server,…

amazon-redshift aws-dms

asked Nov 09 '17 at 17:53

Cyrus

votes

2 answers

using regular expressions in redshift

This query works in mysql but I am not sure how to write the same query in redshift / postgresql. update customer_Details set customer_No = NULL WHERE customer_No NOT REGEXP '^[[:digit:]]{12}$'

sql regex amazon-redshift

asked Nov 09 '17 at 16:51

shantanuo

31,689
78
245
403

votes

2 answers

Filter data based on a condition in Redshift

I came across one more issue while resolving the previous problem: So, I have this data: For each route -> I want to get only those rows where ob exists in rb. Hence, this output: I know this also needs to worked through a temp table. Earlier I…

sql amazon-redshift

asked Nov 09 '17 at 04:47

Rishabh Verma

votes

2 answers

Extracting Time from Timestamp in SQL

I am using Redshift and am looking to extract the time from the timestamp. Here is the timestamp: 2017-10-31 23:30:00 and I would just like to get the time as 23:30:00 Is that possible?

sql amazon-redshift

asked Nov 02 '17 at 07:07

user8659376

votes

1 answer

Getting 0 rows while querying external table in redshift

We created the schema as follows: create external schema spectrum from data catalog database 'test' iam_role 'arn:aws:iam::20XXXXXXXXXXX:role/athenaaccess' create external database if not exists; and table as follows: create external table…

amazon-web-services amazon-redshift amazon-redshift-spectrum

asked Oct 31 '17 at 13:13

Harshinee.R

votes

3 answers

Signature of the Redshift internal "identity" function

While working on a legacy Redshift database I discovered unfamiliar pattern for default identity values for an autoincrement column. E.g.: create table sometable (row_id bigint default "identity"(24078855, 0, '1,1'::text), ... And surprisingly I…

amazon-redshift

asked Oct 31 '17 at 08:34

Boris Uvarov

votes

2 answers

Redshift large 'in' clause best practices

We have a query in which a list of parameter values is provided in "IN" clause of the query. Some time back this query failed to execute as the size of data in "IN" clause got quite large and hence the resulting query exceeded the 16 MB limit of the…

amazon-redshift

asked Oct 28 '17 at 08:50

Gagan Maheshwari

votes

1 answer

Amazon Redshift Sum window function on group

Is there a way to use the sum window function to get the following results in green I can get the total by using the following, but its givings a runnning total, I am looking for a group total sum(groupvolume) OVER ( PARTITION BY geo_group ORDER…

sql select sum amazon-redshift

asked Oct 26 '17 at 16:53

warrior_z

Prev 1 2 3

…

100 Next