I have developed and deployed a JSP/Servlet java web app on Google App Engine. The web app makes http requests to a third-party web site to retrieve data via REST services and show the data on the client side to the user who accesses my web app via "appspot". So it does not store any data on Google Cloud.
I would have to stick with the free plan of Google Cloud due to limited budget. The free plan allows up to 28 hours of frontend instance hours. According to the Google App Engine doc,
Instance usage is billed by instance uptime, at a given hourly rate. Billable time starts when an instance starts, and ends fifteen minutes after it shuts down.
I am wondering:
if an instance is able to handle multiple requests from the client side within the first fifteen mins of its uptime.
What is the maximum number of requests an instance can handle from its uptime?
How is that related to the "Frontend Instance Hours" (28 hrs) quota? What is the best practice to measure how many requests (users) my web app is able to handle per day?
In my particular case, are requests sent from the web app to the 3rd-party website included in the quota?