Problem
My debian server seems to lose network connection after a random amount of time (sometime after an hour, sometimes after a week, sometimes something in between). The server is hosted at a datacenter, so I do not have console access, except I order a KVM. When I restart the server remotely and boot in a rescue system, the server regains network connectivity.
At first I was not sure, whether the server froze or the network connection got lost. I ran a script which pinged 1.1.1.1 every minute and logged that to a file. I just checked that file and can now confirm, that the script was still running but could not reach 1.1.1.1 before I restarted the server.
Troubleshooting workflow
- Restart server via remote reboot (feature of the datacenter)
- Reboot into rescue system (virtual KVM)
- Check logs in /var/logs
Question
What can I do to determine the cause of this problem?