In my code I read data as mltable
for the Azure SDK v2
. Then I convert that into pandas DataFrame
, do some work on the data and then I want to save it back as an mltable
in order to use that as a part of my pipeline.
Unfortunately I was unable to find any proper documentation with straightforward answer to that.
My code looks like this:
path = {
'file': f'azureml://subscriptions/{subscription_id}/resourcegroups/{resource_group}/workspaces/{workspace_name}/datastores/my_container/paths/raw_data/my_data.csv'
}
tbl = mltable.from_delimited_files(paths=[path])
df = tbl.to_pandas_dataframe()
At the same time I wonder if I should save it as a URI file
in the Data section in order to use that for my pipeline component? If so I would also be happy to get some inputs on how to convert pandas into URI file
. I found no examples in the official documentation.
Thank you in advance.