I am using PySpark 3.0.1 to generate parquet files.
When executing the following command
sparkDF.write.mode("overwrite").parquet(file_name)
The 9999-12-31 00:00:00.0000000
datetime is written as 1816-03-29 11:56:08.066277376
in the parquet file.
The 0001-01-01 00:00:00.0000000
datetime is written as 1754-08-29 04:43:41.128654848
in the parquet file.
In contrast, sparkDF.write.mode("overwrite").csv(file_name)
outputs the correct datetime value in CSV format.
Does anybody know what is going on? Thanks.