I want to use the GlueMetaStore with spark.sql.
For that I configured the EMR-Cluster (5.16) and set the following configurations:
{
"Classification":"hive-site",
"ConfigurationProperties":
{
"hive.metastore.client.factory.class":"com.amazonaws.glue.catalog.metastore.AWSGlueDataCatalogHiveClientFactory"
},
"Configurations":[]
},
{
"Classification":"spark-hive-site",
"ConfigurationProperties":
{
"hive.metastore.client.factory.class":"com.amazonaws.glue.catalog.metastore.AWSGlueDataCatalogHiveClientFactory"
},
"Configurations":[]
}
I used the spark-core to query the databases in JupyterHub, but I just get the default database which is empty. There should be a lot of more database when it works.
Do I need to enbaleHiveSupport or something simular to get the connection working, if yes how can I set it in JupterHub, because the context is already loaded?