Is there any performance benefit resulting from the usage of using nested data types in the Parquet file format?
AFAIK Parquet files are usually created specifically for query services e.g. Athena, so the process which creates those might as well simply flatten the values - thereby allowing easier querying, simpler schema, and retaining the column statistics for each column.
What benefit is there to be gained by using nested data types e.g. struct
?