I want to read a selected list of Parquet files from AWS S3. I know how to read all files in a directory using *parquet or just one single file by specifying just that key. However I would like to read only a specific list of files based on some prior user input.
Is this possible?
The following code is from their API Docs but does not address my requirement:
import dask.dataframe as dd
df = dd.read_parquet('s3://bucket/path/to/data-*.parque')
(OR)
df = dd.read_parquet('s3://bucket/path/to/file.parque')
Is there a way to pass in a list of target files in the read_parquet parameters instead?