I tried to get more information from the error log of an SAS disk by running the following command that prints values and descriptions of the SAS (SSP) Protocol Specific log page.
# smartctl -d scsi -l sasphy /dev/sg1
Protocol Specific port log page for SAS SSP
relative target port id = 1
generation code = 79
number of phys = 1
phy identifier = 0
attached device type: end device
attached reason: power on
reason: loss of dword synchronization <======================== (?)
negotiated logical link rate: phy enabled; 6 Gbps
attached initiator port: ssp=1 stp=1 smp=1
attached target port: ssp=0 stp=0 smp=0
SAS address = 0x5000...
attached SAS address = 0x5b8...
attached phy identifier = 6
Invalid DWORD count = 0
Running disparity error count = 0
Loss of DWORD synchronization = 24194 <======================== (?)
Phy reset problem = 0
...
relative target port id = 2
generation code = 79
number of phys = 1
phy identifier = 1
attached device type: no device attached
attached reason: unknown
reason: unknown
negotiated logical link rate: phy enabled; 1.5 Gbps
attached initiator port: ssp=0 stp=0 smp=0
attached target port: ssp=0 stp=0 smp=0
attached SAS address = 0x0
attached phy identifier = 0
...
From the above, I note that there is a high loss of DWORD synchronization, which, according to IBM, is an error that occurs when a PHY stops detecting an incoming stream of DWORDs. I tried to search for further information regarding this error but can't seem to find any.
How does loss of DWORD synchronization affect the health of an SAS disk? Do I need to worry about it? And at what threshold level should I be monitoring it?