0

Given a default DataProc cluster, are there any configurations to avoid overloading the job queue with too many tasks on the YARN side?

For instance, if a spike of job submits occur, is there a way to force the cluster to honor a concurrency, such that the entire spike of jobs doesn't deplete/crash the YARN master?

deepelement
  • 2,457
  • 1
  • 25
  • 25

1 Answers1

2

As @igor-dvorzhak from Google mentioned, the resolution for this is https://stackoverflow.com/a/49693693/1195652

Ref: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.3/bk_system-admin-guide/content/setting_application_limits.html

deepelement
  • 2,457
  • 1
  • 25
  • 25