My Spark Streaming
application has a cycle of 5 minutes. Since past 2 months, there is recurrent pattern seen on the Driver Node
. Presenting below snapshot of the performance metrics
as seen from ganglia
. The same behavior is seen in past weeks too.
- Setup : A
driver
with twoexecutors
(8G, 10 cores) onEMR
- Spark Version: 1.5.2
- GC :
CMS
for both driver & executor
Important: The application was started on Saturday as shown in the charts.
Can there be any explanation to this behavior? If I need to investigate, what observational pointers can I derive from this snapshot?