It is a tower server machine located in datacenter. I have complete access to the machine.
I'm experiencing short dropouts, usually under 10 minutes a couple times a week or so. Unfortunately, last saturday, it has been off for like 5 minutes, on for 3, off for 10 and so on for about half an hour. I had to reboot the outlet since I simply couldn't afford to debug at the moment, there are websites I can't left inaccessible for long time. After like 40 minutes, when system booted up after hard-drive check, network was just fine again. Since that a day stable, then last night, another dropout 1-10 minutes long (I'm pinging from another machine every 10 minutes to get a status).
I have never found anything useful in logs, if I'm looking in right places. No load spikes either. I tried to get KVM connected several times during the dropout but it always go off before support can setup KVM. Only once I managed to get the access over KVM while dropout. I can confirm I couldn't reach the network but machine was working just fine. Unfortunately, it was too short to find out anything else.
Everytime, my housing provider isn't aware of any dropout on his behalf. I have several more machine there, they all run just fine. But still it could be misbehaving router or simply bad ethernet cable.
I need to find the cause of these dropouts because I can't afford much more website interruptions like that anymore.
Is there any nice tool (network monitor), I could use? I need something simple enough, so I could actually understand the log and point at specific cause.
Also, does it strike to you as software issue, machine HW issue or problem outside the machine, within the network? Is there even way to tell which one, if network goes offline just like that. For instance, I guess there won't be any preceding errors if it is a bad cable somewhere.