I have read through Dask-ML documentation and I have googled around, but have 2 questions I would like some clarity on if anyone could assists:
- by saying "using a cluster of computers", does it mean processing is distributed across other people's computers who are connected to the network? If yes, would there be any privacy issues with this?
- would we be able to know how many computers are there in a cluster at any point in time?
Please point me to the right direction or if there are any link that I could read more about this. Thanks in advance.