We noticed in some of our hardware that our switch (an integrated KSZ9477) stops responding, with the logs showing the following:
[ 9585.977985] ksz9477-switch 0-005f port1: Link is Down
[ 9585.983206] br0: port 2(port1) entered disabled state
[ 9585.993337] ksz9477-switch 0-005f port1: failed (err=-11) to del object (id=2)
[12664.128536] Microchip KSZ9477 dsa-0.0:01: Master/Slave resolution failed
When the bridge port goes down, it is impossible to stop/restart the systemd network. A reboot is the only workaround we have.
I searched for both "entered disabled state" and "Master/Slave resolution failed", and I see people reporting without seeing a common/cohesive root cause.
Any idea of why a switch port would enter into a disabled state, and what does "Master/Slave resolution failed" means?
Thanks.