0

pyarrow: .parquet file that used to work perfectly is now unreadable.

The file was created with pandas a few days ago.

when trying to read the file:

pd.read_parquet(filename)

I get:

ArrowIOError: Corrupted file, smaller than file footer

What can cause this corruption? Is there a way to read the file?

Dror Hilman
  • 6,837
  • 9
  • 39
  • 56
  • Possible duplicate of [Pandas cannot read parquet files created in PySpark](https://stackoverflow.com/questions/54201799/pandas-cannot-read-parquet-files-created-in-pyspark) – Sundeep Pidugu Aug 08 '19 at 11:00
  • Can you please open a JIRA issue on https://issues.apache.org/jira/ – Wes McKinney Aug 08 '19 at 17:36

0 Answers0