I am developing the framework for Image processing using the Deep learning neural network YOLOv7. I have 20000 live data stream images per second, which is produced by the real-time machine. I want to handle these 20000Frames in minimal time using the current GPU architecture. Is this possible for I can increase the processing FPS using any open-source tools-techniques? multithreading or Kafka Concept.
Any suggestion or idea will be appreciated.
Note: I am using the NGINX server and docker-Kubernetes combo.