I have jobs running on job clusters. And I want to send metrics to the CloudWatch. I set CW agent followed this guide.
But issue is that I can't create useful metrics dashboard and alarms because I always have InstanceId dimension, and InstanceId is different on every job run. If you check the link above, you will find init script and part of the json for configuring cw agent is
{
...
"append_dimensions": {
"InstanceId": "${aws:InstanceId}"
}
Documentation says that if I remove append_dimension than hostname will be dimension, and again... hostname always has different IP address, so not much useful.
Does someone have experience with Databricks on AWS and monitoring/alerting with CloudWatch? If so, how you resolved this issue?
I would like to set dimension which will be specific and for each executor and same each time it runs.