0

Working with Livy on the Hortonworks Sandbox we found the problem that on some cases the Livy API provide a result for the statement before the actual work has ended as indicated by the Spark monitoring tool. In some cases Livy provide a result in just less than 5 seconds for a job with normal execution time of more than 10 minutes. If we repeat the same task with the same initial conditions more than once on some cases we have seen this problem happen. Any idea?

yeiniel
  • 2,416
  • 15
  • 31

1 Answers1

0

Just check if you are using the same Livy session for multiple jobs or not. If same session is used then intermediate Spark DataSet is getting cached and results are retrieved faster.

dassum
  • 4,727
  • 2
  • 25
  • 38