I have a parquet file which has a simple file schema with a few columns. I read it into python using the code below
from fastparquet import ParquetFile
pf = ParquetFile('inout_files.parquet')
This runs fine, but when I convert it into pandas using the code below I get the following error:
df = pf.to_pandas()
The error is:
NotImplementedError: Encoding 4
To find the source of the error I ran df=pf.to_pandas(columns=col_to_retrieve)
adding the columns separately and notice the error raises from one of the columns which has list of strings (e.g. ("a","b","c")) as value for each cell of the column.
Do you know how to convert it to pandas knowing that there is column with type set(string)?