I have files with .snappy.parquet extension that I need to read into my Jupyter notebook, and convert it to pandas dataframe.
import numpy
import pyarrow.parquet as pq
filename = "part-00000-tid-2430471264870034304-5b82f32f-de64-40fb-86c0-fb7df2558985-1598426-1-c000.snappy.parquet"
df = pq.read_table(filename).to_pandas()
The error is:
ArrowNotImplementedError: lists with structs are not supported