By default, Cloud Dataproc runs a Hive Metastore local to the Dataproc cluster. This means:
The metastore is ephemeral with the cluster
It can be a pain to have multiple clusters using a single metastore
Is it possible to point Dataproc clusters to a single Hive metastore? Is it also possible to have the metastore live outside the cluster so running a cluster for a metastore is not required?