I'm experimenting with Dask by running a local cluster with four workers on my laptop.
I distribute a Pandas dataframe between the workers, but when I run a function on them I see from the dashboard that only one of them is actually used.
What am I missing?
Here the code
from distributed import Client
client = Client('127.0.0.1:56947')
dd = client.scatter(df, broadcast=True) # df is a pandas Dataframe
r = client.submit(process_df, dd)