I currently have an API service running on AWS Lambda, but it's getting expensive due to high AWS costs. My aim is to replace 95 % of the usage with a cluster of servers and still handle peaks with Lambda. So, I would need a hybrid setup where the load balancer sends the requests to Lambda when the cluster can't handle them.
I don't have much experience with this kind of setup, so I'm asking for suggestions. Something that comes to mind is a load balancer + Kubernetes + Lambda, but I'm not sure if that's possible.
That's where you come in. Can you suggest something that would make this possible? As said, the aim is to reduce costs, so I would use free software where possible (and pay for the hardware, i.e. servers).
Thanks.