I'm testing Bosun (open-source monitoring and alerting system by Stack Exchange) and I'm quite confused about how to monitor "boolean" metrics.
I would like to get alerted if some process is not running.
To collect the metric and I've tried 2 ways of doing it:
In the documentation of scollector I see that some processes can be configured I don't receive any related metric. Do I need any special configuration for enabling those processes checks?
I've created a custom collector to count those processes.
For getting alerted, I created the following rule:
alert test {
template = test
crit = avg(q("avg:myprocess.running{host=*}", "10m", "")) < 1
}
Is this the proper way of doing it or is there a better way?