If anyone experimented with pandas-profiling package, help me with any insights you might have with making it run faster. The output report from the package is very neat and detailed, but creating the report takes way too long even with moderate sized dataset. About 10 columns and 400K rows from Kaggle bulldozers dataset took 21 min (non-gpu). Wondering if its worth investigating further.
df.shape
(401125, 9)
start = datetime.datetime.now()
profile = df.profile_report(title="Exploring Dataset")
profile.to_file(output_file=Path("./data_report.html"))
end = datetime.datetime.now()
print(end-start)
0:21:23.976324