Right now I can submit Spark jobs over Livy with the spark submit command, and in the command there is a --proxy-user livy
parameter so Livy can impersonate spark and run the spark submit. However, I want to know how to do this without having the --proxy-user
parameter because the company has thousands of Hadoop nodes and we are not allowed to create new users on them (for testing we had to create livy user adduser livy
on all the worker nodes for the proxy-user
parameter to work, otherwise it would error about livy user not found.
I am currently submitting my spark job via Livy through RESTapi POST call according to this documentation: https://livy.incubator.apache.org/docs/latest/rest-api.html, and in the POST section of the documentation, we can see that there is a proxyUser
parameter, and it is creating the --proxy-user
parameter in the spark submit command. I need a way to not use this proxyUser
but I cannot find a way to disable it. If I don't specify the proxyUser
parameter in the POST call, it automatically generates it in the spark submit command anyway as --proxy-user livy
but I don't want that.
Does anyone know how to disable the proxyUser
parameter?