0

I am looking for a search engine, or only the technology, that can comprehend natural language semantics (to some level, of course) in order to find correspond images.

The other way around can work as well.

For example: a software that can distinguish between two men hugging and two men wrestling roman style. Or - input: a photo of a desk with a laptop, a cup of coffee, Rubik's cube and a star wars toy that would yield the output: programmer desk.

Meir Elad
  • 75
  • 2
  • 11

2 Answers2

0

I came across Metamind.io for my work, which uses some deep Neural Network wizardry to recognize and label images the right way.

You might get good results from their trained classifier, or train your own.

Glorfindel
  • 21,988
  • 13
  • 81
  • 109
Jiby
  • 1,865
  • 1
  • 13
  • 22
0

Very late to the party here but I thought this might help others searching for the same thing.

In terms of basic image analysis this area has moved forward in leaps and bounds recently and there are a lot of people offering this functionality. The quality varies quite a bit and depends on how well trained and how big a corpora the provider has. A couple of examples I have worked with are IBM and Clarifai but it's a booming area.

What they won't give is the type of context you are after. Not yet anyway. They are unlikely to differentiate between two men hugging and two men wrestling (hey who can tell the difference as a human sometimes anyway?). They may however pick out a desk, a cup of coffee, a book etc.

Video search and contextualisation is another challenge entirely and it is in its infancy. There is one company at least making big inroads in this area (full disclaimer, I work there). Movida Labs analyses and indexes many factors in a video to provide a lot of context so in your example it could very likely tell that this was a video with two men wrestling. I have to admit that this is not because of some sort of breakthrough in technology (although it is very advanced) but because the video in its entirety provides that context.

kbcool
  • 695
  • 6
  • 18
  • I saw the demo that your company offers, it is very limited, so I could barely see the "entities" labeling, let alone the semantic information. – Meir Elad Aug 30 '16 at 10:56
  • 1
    @meirElad: Unfortunately stackoverflow doesn't have a way of PMing other users but visit our site and fill in the contact form and we can further discuss your needs. I am more than happy to help if you can give some further examples of exactly what you are after. It would also be good to know how you felt it was limited given the huge amount of context we can extract from a video. – kbcool Aug 31 '16 at 06:19