0

I need to build an Image Similarity engine in Azure. I use Spotify Annoy library to build a Vector search DB. These vector db files are roughly near 10-15 MB.

The functionality is: User uploads an image. Embeddings for this image is generated using a pretrained model. These embeddings are used to find ANNs from the vector file. The ANN search gives the similar images name. Then the end user is shown these end images.

As of now I have created an Azure function which has the static Vector DB file inside the Azure function code files. And the web interface is hosted in a Storage container with the images (19000 total images).

My concern is that the static Vector DB file is big and everytime the function is created, it will have performance as well as pricing issues.

I am new to Cloud/Azure. Also can I do the same using Azure ML Studio?

halfer
  • 19,824
  • 17
  • 99
  • 186
Lakshay Dulani
  • 1,710
  • 2
  • 18
  • 45

1 Answers1

0

SentenceTransformers provides models that allow to embed images and text into the same vector space. This allows to find similar images as well as to implement image search. We can define a query using text or using a reference image.

Here is the sample using the Visual Search with OpenAI: Clip https://github.com/retkowsky/visual_search_openai_clip

or

https://github.com/Azure-Samples/miyagi/tree/main/ancillary/experiments/langchain/qdrant_miyagi_example

Ramprasad
  • 87
  • 4