I have a FastAPI endpoint on a cluster with addess 0.0.0.0:8084/predict
. And I want to send a request to this endpoint from a React App which is locally hosted on my computer. I have a Personal access token for the workspace but dont know how to send the request using PAT. If anyone knows, please mention how can I hit http://0.0.0.0:8084/predict
endpoint in a cluster so that I can do realtime prediction with low latency. Mentioning a URL template to hit would be a great help.
PS. My current requirement does not allow me to use Model serving feature of Databricks.
My current requirement does not allow me to use Model serving feature of Databricks. So, I have no clue about this.