I am importing 4000+ csv files all with the same columns, columns=['Date', 'Datapint']
the importing the csv's to dask is pretty straight forward and is working fine for me.
file_paths = '/root/data/daily/'
df = dd.read_csv(file_paths+'*.csv',
delim_whitespace=True,
names=['Date','Datapoint'])
The task I am trying to achive is to be able to name the 'Datapoint'
column the filename of the .csv. I know you can set a column to the path using include_path_column = True
. But I am wondering if there is a simple way use that pathname as a column name with out having to run a separate step down the line.