I am trying to use Vision API to extract the caption for an image
Looks like the Vision API does not provide the caption about the image. It provides the list of objects in the Image.
Looks like IBM Caption Generator provides the Caption for the image.