0

I have parquet files on s3 which need to be queried using S3 Select. The parquet files are generated from JSON files with inner arrays. The S3 Select query can get the first array but if i tried to query the records in the inner array it fails to return the ids. Saying its an invalid data source

What I tried:

Looking up documentation on Amazon proves no use
Multiple formats of the s3 select query

Json Structure

  {
     "Array": [
        {
          "Id": "1"
        },
        {
          "Id": "2"
        }
     ]
   }

Query

  select s.Array[*].id from s3object s

Expect to get all the ids back from the query so should return Id 1 and 2.

1 Answers1

0

select s.Id from S3Object[*].Array[*] s limit 5 will return all the ID's in the Array.

Dharman
  • 30,962
  • 25
  • 85
  • 135
Sandeep Rao
  • 1,749
  • 6
  • 23
  • 41