I have datasets in HDFS which is in parquet format with snappy as compression codec. As far as my research goes, currently Redshift accepts only plain text, json, avro formats with gzip, lzo compression codecs.
Alternatively, i am converting the parquet format to plain text and changing the snappy codec to gzip using a Pig script.
Is there currently a way to load data directly from parquet files to Redshift?