For the needs of a project i have created 2 Kubernetes clusters on GKE.
Cluster 1: 10 containers in one Pod
Cluster 2: 10 containers in 10 different Pods
All containers are connected and constitute an application.
What i would like to do is to generate some load and observe how the vpa will autoscale the containers..
Until now, using the "Auto" mode i have noticed that VPA changes values only once, at the begin and not while i generate load and that the Upper Bound is soo high, so it doesn't need any change!
Would you suggest me:
1) to use Auto or Recommendation mode?
and
2) to create 1 or 2 replicas of my application?
Also i would like to say that 2 of 10 containers is mysql and mongoDB . So if i have to create 2 replicas, i should use statefulsets or operators, right?
Thank you very much!!