I am curious about the cluster configuration environment in terms of the ML Training performance of H2O.
If there are three nodes, is there a performance difference between configuring a generic H2O Multi-node Cluster and configuring an H2O Spark Cluster based on Spark?
From our experiments, we conclude that there is no obvious performance difference between the two.
However, many of the H2O documents tell me that H2O Sparkling Water is more effective at ML Training.
- Reference
- H2O Multi-node Cluster: http://docs.h2o.ai/h2o/latest-stable/h2o-docs/starting-h2o.html#flatfile