Is it possible to use pandas
to selectively read rows from Parquet files using its column index?
Similarly, when writing a Pandas DataFrame to a Parquet file, such as using pd.DataFrame.to_parquet()
, is it possible to specify the DataFrame column or index level to be used as the Parquet column index?
I am hoping that the use of Parquet index can speed up read/writes.
Currently using fastparquet 0.4.0, pandas 1.0.3, and Python 3.8.3.