This will not be a short question - but please bear with me :)
Back in 2012 I've set up svn synchronization from one of our datacenters in US to multiple mirrors around the world (3 to be exact). It generally worked without any problems all these years (it hosts test data, it's currently 1.2TB @ revision 573453).
Recently I needed to rebuild the master server performing the synchronization (without touching the mirrors) and here's where the problems started... Sometimes, for bigger revisions only, the synchronization from the new master to remote mirrors fails - we have 3 such mirrors and sometimes it fails to all of them and sometimes to just one or two... It never fails to 3 other mirrors within the same datacenter (svn versions 1.10.3, and 2x 1.7.5 (this will be important below ;) )).
When I did an strace svnsync...
it looked like the svnsync process got a "Connection reset by peer" error when reading some data way into the synchronization of the "bigger/problematic" revision and then it exited after closing the http connections (it always proceeds for a good while before the failure occurs).
I was not able to find anything obvious in any of the logs (not that I'm too competent in this area ;) - managing these is a 4th level side job).
SVN is served with apache mod_svn on all systems - it fails the same regardless if I use http or https as the source protocol - the targets are all http atp.
Once this fails it will fail time and again (to 1,2 or 3 mirrors) and you can attempt it dozens of times and it will fail more or less at the same place (not exactly but close) but...
Here's the kicker...
It will always work, from execution nr 1 and without any fail so far, when the old master is used as a middleman by executing the exact same svnsync command on it...
The old server uses:
bash-4.1$ svnsync --version
svnsync, version 1.7.5 (r1336830)
compiled May 15 2012, 17:55:12
The new server has:
bash-4.4$ svnsync --version
svnsync, version 1.10.2 (r1835932)
compiled Feb 10 2021, 09:25:28 on x86_64-redhat-linux-gnu
Could using the older svnsync process cause a different protocol/mode to be selected for the synchronization between the same two repositories that does not have some problem that exists when a matching svnsync is used to synchronize to an "older" repository?
svnsync 1.10.2 SYNC REMOTE FROM 1.10.2 TO 1.7.5 == FAIL (sometimes after a while/longer period of time pushing data)
svnsync 1.7.5 SYNC REMOTE FROM 1.10.2 TO 1.7.5 == SUCCESS (each and every time)
What logs could I check and what logging can I enable to try and narrow it down?
ps. Since this is a "corporate" environment I tried to make sure no "strange network things" ("transparent caches and accelerators and such") are enabled/active and the network teams think they are not ;)
pps. All these servers are VMs run on one or another vmware solution.