Questions tagged [bosun]

Bosun is an open-source, MIT licensed, monitoring and alerting system by Stack Exchange. It has an expressive domain specific language for evaluating alerts and creating detailed notifications. It also lets you test your alerts against history for a faster development experience.

Documentation

76 questions
8
votes
1 answer

How to handle NaN in Bosun?

I have 2 metrics and try to find the difference of average value between them in percentage like 100*(m1+m2)/m1 but this obviously produces NaN if m1 turns to zero. How should I handle this case if I don't want to alert when the metrics turn to…
timurb
  • 5,405
  • 2
  • 22
  • 17
7
votes
1 answer

How do I format numbers within Bosun templates?

In a Bosun template, is it possible to format the output of an evaluated variable from the alert to less decimal places of precision? Simple Example: template test_template{ subject = test body = {{.Eval .Alert.Vars.average_runtime}}…
Jon
  • 83
  • 4
6
votes
2 answers

Bosun adding external collectors

What is the procedure to define new external collectors in bosun using scollector. Can we write python or shell scripts to collect data?
Sandesh
  • 428
  • 9
  • 22
5
votes
2 answers

Integrate graphite metrics with bosun

I am running Docker container for bosun. I want to integrate the graphite metrics with bosun. What are the configuration changes that need to be done for this?
Sandesh
  • 428
  • 9
  • 22
3
votes
1 answer

Collect data for Bosun from multiple endpoints

In the observability system we're building from scratch, we'd like to have a single scollector to collect data from all the web servers and send it to Bosun, instead of having an instance of scollector on each server. Do you know if there's a way to…
SirAnderson
  • 101
  • 2
  • 14
3
votes
1 answer

Configure scollector to log if a process is running

I'm trying to use Bosun to determine if certain processes are running and then eventually alert on if they are up or down. I'm probably misinterpreting the docs but I can't figure this out. Bosun is running just fine. I have the scollector running…
Paul Lemke
  • 5,494
  • 3
  • 47
  • 66
3
votes
1 answer

Bosun send alert specifc data via json post body

When using Bosun you can send a HTTP POST request using an alert notification with a specified json body like so: post = myurl body = {"foo": "bar"} I have an external application which listens to myurl and sends and email based on the context of…
Chris Edwards
  • 1,518
  • 2
  • 13
  • 23
3
votes
1 answer

Check if a process is running on Bosun

I'm testing Bosun (open-source monitoring and alerting system by Stack Exchange) and I'm quite confused about how to monitor "boolean" metrics. I would like to get alerted if some process is not running. To collect the metric and I've tried 2 ways…
Javier Segura
  • 658
  • 4
  • 13
2
votes
1 answer

Counter difference graph

Image that we have a time series with timestamps t_1, t_2 ... t_n. And a metric m with values m_1, m_2 ... m_n for the respective timestamps. Metric m is a counter so it will only increase. Then there is a second metric k that is calculated from two…
m3nthal
  • 413
  • 5
  • 14
2
votes
2 answers

Return 0 if results aren't found from elasticsearch query

I'm running bosun to alert against an elasticsearch data set. The scenario is that there's a number of cron jobs that do various things. If these execute successfully, they will log a success message. If they die / fail to run for whatever reason…
user101289
  • 9,888
  • 15
  • 81
  • 148
2
votes
1 answer

Bosun how to add series with different tags?

I'm trying to add 4 series using bosun expressions. They are from 1,2,3,4 weeks ago. I shifted them using shift() to have current time. But I can't add them since they have the shift=1w etc tags. How can I add these series together? Thank you edit:…
sgarg
  • 2,340
  • 5
  • 30
  • 42
2
votes
1 answer

Bosun HA and scalability

I have a minor bosun setup, and its collecting metrics from numerous services, and we are planning to scale these services on the cloud. This will mean more data coming into bosun and hence, the load/efficiency/scale of bosun is affected. I am…
2
votes
1 answer

Bosun: Lookup is giving integer when string is expected

I'm tweaking my bosun.conf to allow my os.cpu.high alert to use a lookup when determining which duration to to use depending on the host: lookup high_cpu { entry host=* { time = 5m } entry host=*graylog* { time = 1h …
2
votes
1 answer

Bosun: How to handle empty number sets with ungroup?

I'm trying to setup Bosun and Graphite to alert on error ratio, compiled from two different sources: API traffic and web app traffic. Here's what I have now: $web_rate = avg(graphite("sumSeries(collectd.*.statsd.web.*.rate)", "5m", "",…
Vince Gatto
  • 415
  • 3
  • 13
2
votes
1 answer

http post notifications are not being sent to server

I am trying to get an alert notification from bosun(using docker image) when my cpu usage is high at client vm using bosun's UI, it is showing up critical condition but not sending the notifications, also finding a way to debug the config file. My…
ell
  • 21
  • 2
1
2 3 4 5