I'm currently using munin to monitor a bunch of linux servers (as well as a few WinXP clients). However, munin does its data collection sequentially, and seems to be very susceptible to timing out when clients disconnect in the middle.
Are there any parallel versions?
Is there any way to handle the case of a disconnected client quicker?
Right now, many of my data collection tasks take longer than the 5 minutes until the next collection starts, leading to both warnings and missed data points.