Running a spark(scala) job on HDP cluster. However every time the job executes(both client and cluster mode) a parallel Tez session is also created and application is submitted to YARN. As part of spark job the, couple of sql jobs which are being executed on the cluster are done using 'SparkSession.spark.sql'.
The Tez session is created before stage 0, task 0 as per YARN logs. Trying to understand why 2 jobs are running on RM, everytime spark-submit is called.
Checked already - No explicit connections made to Hive.
Any leads would be appreciated.