Looking at documentation of awswrangler.s3.to_csv
or awswrangler.s3.to_parquet
, there is a dataset
parameter.
From testing, it looks like setting dataset=True
allows, among other things, to append new data to an already existing set. It also looks like when dataset=True
, I can't specify the file name and AWS autogenerates the names for the files which are added to the specified path
.
Apart from that, I can't find more information on what dataset
means. Is it just referring to the general concept or is there a specific meaning within the context of AWS? What exactly is dataset
and when should it be set to True
?