I know, the question is kinda generic, but I really can't be more specific because I simply have no idea what's going on:
It has now happened twice (once on our live server and once on our test server), that the response becomes completely unresponsive. I can't reach our website in any browser and I can't connect via SSH, all I get is Connection closed by remote host
. I have to do a hard reset using hetzner's web-tool to manage servers. That's the only way I can get the server to be reachable again. A soft reset (hetzner says is the equivalent of sending ctrl+alt+del) has no effect at all. After doing a hard reset I can SSH in again and the website is back up.
The server in question is an EX41-SSD from hetzner with Ubuntu 14.04.5. Our website runs using the LAMP stack, meaning Apache, MySQL and PHP.
The thing is, I have no real idea where to start looking for the source of this issue. I'm a web developer who's done some server admin stuff before, but nothing more than setting up machine, hardening them, setting up puppet. Never really had to investigate server outage and the likes.
The one thing I did find is a 3h gap in /var/log/syslog
. There is some output from a cron of ours, then a whole lot of "^@^@^" and then the output caused by the hard reset or rather the boot process following it.
Any idea where else I might look? Or what I can do, to make investigations in the future easier? Are there some monitoring tools I should install/setup?
Thank you anyway!