Questions tagged [amazon-redshift]

Amazon Redshift is a petabyte-scale data warehousing service using existing business intelligence tools to analyze the data. Redshift is a column-oriented MPP database based on ParAccel and ParAccel was itself based on PostgreSQL.

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more. Redshift is a column-oriented database based on PostgreSQL 8.0.2

Source: Amazon Redshift

Although Redshift to some extent is based on PostgreSQL they are substantially different. Do not add the postgresql tag to questions involving Amazon Redshift

Related Tags

8534 questions
2
votes
1 answer

Is there an equivalent to Redshift's ACCEPTINVCHARS option for an INSERT INTO statement?

There are many online articles that talk about solving the following error: String contains invalid or unsupported UTF-8 codepoints. Bad UTF-8 hex sequence: a4 (error 3) Unfortunately they talk about solving this issue for the COPY command, however…
Adel Helal
  • 642
  • 1
  • 9
  • 20
2
votes
2 answers

Is there a way to change the datatype for a column without changing the order of the column?

I have a column where I want to change the data type. I currently am using Redshift. I know I can use the alter table statement to change the datatype, but this would change the order of the columns. Is there a way to change the datatype without…
user8659376
  • 369
  • 4
  • 8
  • 19
2
votes
1 answer

Azure C# App Service ODBC connection to Redshift

I have developed an Azure Api Service (Rest API) that needs to connect occasionally to a aws Redshift database. This is working fine on my development environment (once I installed the Redshift ODBC driver, and created an DSN entry) however fails…
Mark Ruse
  • 387
  • 1
  • 4
  • 12
2
votes
0 answers

Error on redshift UNLOAD command

I am trying to UNLOAD a Redshift table to an S3 bucket, but I am getting errors that I can't resolve. When using 's3://mybucket/' as the destination (which is the documented way to specify the destination), I have an error saying…
Thierry J.
  • 2,148
  • 16
  • 23
2
votes
1 answer

Are Redshift system tables immutable and well ordered?

Redshift system tables only story a few days of logging data - periodically backing up rows from these tables is a common practice to collect and maintain proper history. To find new rows added in to system logs I need to check against my backup…
Kirk Broadhurst
  • 27,836
  • 16
  • 104
  • 169
2
votes
0 answers

Redshift External tables via Hive metastore

I've a redshift DB setup and we do periodic archival of the data into S3. I would like to create redshift external tables on top of these archived files. AWS documentation suggests that this can be done either via athena or via hive metastore. Since…
skrisshnaswamy
  • 149
  • 1
  • 9
2
votes
3 answers

AWS Glue returning error while running job

I want to generate parquet file format from the data stored on aws redshift for which I have created connection and crawler to fetch the data information. All the database catalogs have been fetched successfully but while running job to fetch the…
Anant Goswami
  • 318
  • 3
  • 14
2
votes
2 answers

How to process data faster in python from redshift?

i am new to python .. i have my data in redshift and i want to process data faster in python. i am using python because i want to run various algorithms and also do various calculations on these data and this is not possible in redshift . i saw…
2
votes
3 answers

Division between data in rows - SQL

The data in my table looks like this: date, app, country, sales 2017-01-01,XYZ,US,10000 2017-01-01,XYZ,GB,2000 2017-01-02,XYZ,US,30000 2017-01-02,XYZ,GB,1000 I need to find, for each app on a daily basis, the ratio of US sales to GB sales, so…
Craig
  • 1,929
  • 5
  • 30
  • 51
2
votes
1 answer

[Amazon](500310) Invalid operation: Assert

I am using spark-redshift and querying redshift data using pyspark for processing. The query works fine if i run on redshift using workbench etc.But spark-redshift unloads data to s3 and then retrieves it and it is throwing the following error when…
2
votes
2 answers

How to optimize AWS DMS MySql Aurora to Redshift replication?

I've been using AWS DMS to perform ongoing replication from MySql Aurora to Redshift. However, the ongoing replication is causing constant 25-30% CPU load on the target. This is because it produces many small files on S3 and loads/processes them…
2
votes
1 answer

Creating Table in AWS Redshift with AWS Java SDK

I am trying to create a temporary table inside of AWS redshift using the java SDK. // Redshift JDBC 4.1 driver: com.amazon.redshift.jdbc41.Driver String command = "CREATE TABLE test (FirstName varchar(255));" …
cwiens
  • 21
  • 4
2
votes
0 answers

Redshift change user password from workbench/j

I have a weird problem when trying to change Redshift user password using SQL workbench/j. Here is the command I am running: alter user someUser password 'someNewPassword'; It returns response "ALTER USER someUser successful" However, when I try to…
Tofig Hasanov
  • 3,303
  • 10
  • 51
  • 81
2
votes
5 answers

is there is a way to automate the redshift vaccum process through udf?

I have more that 300+ table in redshift. Data is getting update daily basic just want to know can i create a udf in redshift to automate the vaccum process. I found a link automate using python but not that great python coder i am so looking for…
2
votes
1 answer

loading a pg_dump off of s3 into redshift

I'm trying to load a complete database dump into Redshift. Is there a single command to restore the data from a pg_dump living on s3 into Redshift? If not, what are the best steps for tackling this? Thanks
nkramer52
  • 53
  • 6