I have two (or more) parallel text files stored in S3 - i.e. line 1 in first file corresponds to line 1 in second file etc. I want to read these files as columns into a single dask dataframe. What would be the best/easiest/fastest way to do it?
PS. I can read each of them into a separate dataframe, but then I cannot join them on index because dataframe index values seem to be neither unique nor monotonic. At the same time the correspondence of lines is defined by their position in each file.