I have a developer running automated tests which spike above 50% during setup, as seen in the below image.
He is arguing that the tests are unstable and difficult to troubleshoot, so downsizing is an inappropriate action. Ignoring the obvious response of "you need to fix your tests," is that even a fair comment? It looks like it's just spiking during setup, so if I cut this from an m5.xlarge to an m5.large is it going to see a stability change?
We have dozens of these being spun up at any given time, so the costs are becoming significant.