I am trying to understand what the entries in my Spark UI signify.
Calling an action results in creation of a job. I am finding hard to understand
- How many of these jobs get created? and is that proportional to the number of micro-batches?
- What does the Duration column signify?
- What is the effect of setting the batch duration when instantiating the streaming context? Where is that visible in the Spark UI?
new StreamingContext(sparkSession.sparkContext, Seconds(50))