I am trying to execute a query with grouping on 26 columns. Data is stored in S3 in parquet format partitioned by day. Redshift Spectrum query is returning below error. I am not able to find any relevant documentation in aws regarding this.
Request ran out of memory in the S3 query layer
- Total Number of rows in table : 770 Million
- Total size of table in Parquet format : 45 GB
- Number of records in each partition : 4.2 Million
- Million Redshift configuration : Single node dc2.xlarge