I'm have a domestic proxmox installation which was running fine for a month or two. Recently, I started to have issues with two of the Debian-based virtual machines: the one that I use as router and the one I use as NAS.
Apparently, these virtual machines cannot stand high CPU load (NAS crash when I try to do a rsync), or any load at all (router machine cannot even boot). They are like frozen. This apparently happens only to these two machines, I do NOT encounter the problem with the other.
This is an example of how they look like from the graphical interface.
My hardware is this: a CPU Ryzon 5 2600 with 16 Gb of RAM ECC.
How can I understand what is going on with the two virtual machines?
root@server:~# journalctl -r -p3
--- Logs begin at Mon 2019-03-18 20:13:55 CET, end at Mon 2019-03-18 20:42:01 CET.
Mar 18 20:18:34 server pvedaemon[3540]: <root@pam> end task UPID:server:0000115A:00006C05:5C8FEF0A:vncproxy:105:root@pam: Failed to run vncproxy.
Mar 18 20:18:34 server pvedaemon[4442]: Failed to run vncproxy.
Mar 18 20:18:34 server qm[4444]: VM 105 qmp command failed - VM 105 not running
Mar 18 20:18:31 server pvedaemon[3540]: <root@pam> end task UPID:server:00001114:00006AC3:5C8FEF07:vncproxy:105:root@pam: Failed to run vncproxy.
Mar 18 20:18:31 server pvedaemon[4372]: Failed to run vncproxy.
Mar 18 20:18:31 server qm[4374]: VM 105 qmp command failed - VM 105 not running
Mar 18 20:18:08 server pvedaemon[3541]: VM 105 qmp command failed - VM 105 qmp command 'guest-ping' failed - got timeout
Mar 18 20:18:04 server pvedaemon[3541]: VM 103 qmp command failed - VM 103 qmp command 'guest-ping' failed - got timeout
Mar 18 20:17:58 server pvedaemon[3539]: VM 105 qmp command failed - VM 105 qmp command 'guest-ping' failed - got timeout
Mar 18 20:14:10 server pveupdate[4787]: <root@pam> end task UPID:server:000013BA:0000080C:5C8FEE01:aptupdate::root@pam: command 'apt-get update' failed: exit code 100
Mar 18 20:14:10 server pveupdate[5050]: command 'apt-get update' failed: exit code 100
Mar 18 20:14:04 server iscsid[4331]: iSCSI daemon with pid=4333 started!
Mar 18 20:13:55 server kernel: Error: Driver 'pcspkr' is already registered, aborting...
Mar 18 20:13:55 server kernel: Couldn't get size: 0x800000000000000e
Mar 18 20:13:55 server kernel: ACPI Exception: AE_AML_OPERAND_TYPE, Could not execute arguments for [IOB2] (Region) (20170831/nsinit-426)
Mar 18 20:13:55 server kernel: ACPI Error: Needed [Integer/String/Buffer], found [Region] (ptrval) (20170831/exresop-424)
As you can see from the image, this happens to just two of the machines in the network. All of them are always Debian 9 machine from the same ISO image. There are no major differences between them.
Update
I wanted to attach to the terminal tty0 from the host and cause a crash of the VM. Router VM is no longer usable because it crashes at each reboot just after a few seconds. On the other hand, the VM nas
instead allow me to play for a while. For instance I can safely play with the command stress, but the VM crash when I try to run an apt upgrade
.
Nevertheless, on the serial terminal I am attached to from the host, there are NO error messages of any sort.