I have an xlsx
file File.xlsx
, it has more than 100 sheets.
I need to perform a certain function f()
on each sheet's data and finally return a list of each sheet's operated appended together.
I tried using pandas, reading each sheet's data one by one and then applying the function on it and appending in a list. Which is taking a lot of time. Need to the reduce the operation time.
How can I make the sheet wise execution in parallel? Can i use DASK aor anything else?
Note: need to jsonify each sheet's data so using df.to_json()