I've been trying to improve our ceph recovery speed and every option I've come across in ceph documentation and on various forums seems to have no effect.
I've tried setting a combination of options found online at this point with no change in recovery speed. Current settings are set via:
for i in $(sudo ceph osd ls)
do
sudo ceph tell osd.$i injectargs --osd-max-backfills=7 \
--osd-recovery-max-active=50 \
--osd-recovery-op-priority=100 \
--osd-recovery-max-active-hdd=50 \
--osd-client-op-priority=3
done
No matter what recovery stays around:
io:
client: 857 MiB/s rd, 357 MiB/s wr, 748 op/s rd, 745 op/s wr
recovery: 53 MiB/s, 16 objects/s
Any help on how to get ceph to recover faster would be greatly appreciated.
Ceph Version: 15.2.9
OS: Ubuntu 20.04
Storage Type: SATA HDDs
Network: 2x10Gbps each node in LACP Teaming mode
Number of Nodes: 15
Disks per Node: 5 Nodes with 90x10TB Disks and 10 Nodes with 60x14TB Disks