0

I develop API using Django for AI model but it's prediction is slow and i have lots of request which take time to execute through AI API, I need help to handle multiple Request at same time

Ayaz Khan
  • 104
  • 1
  • 9

1 Answers1

2

If your AI model is stateless meaning new request can be processed independent of the previous requests, then you can run multiple instances of your AI model. You can use a Deployment with multiple replicas. Then, use a Service to load-balance between the instances.

Emruz Hossain
  • 4,764
  • 18
  • 26