1

I need to index all the annotations and features generated after GATE processing into SOLR. I need to search upon annotations as well as features. What is the best way to do this? I would prefer moving processing to hadoop.

I am using behemoth at the moment, but it does not index the annotations and features, it indexes only the text.

Can someone please help me out?

madzie
  • 47
  • 1
  • 9
  • 2
    Take a look at http://www.findwise.com/blog/how-to-index-and-search-xml-content-in-solr/ for some ideas – arun Mar 31 '13 at 23:24

1 Answers1

2

See answer on the DigitalPebble mailing list where you should ask questions about Behemoth https://groups.google.com/d/msg/digitalpebble/o_6xHtt1rpk/ZdKa6JKcRGIJ

Julien Nioche
  • 4,772
  • 1
  • 22
  • 28
  • Thanks! I had asked this question here before asking on the mailing list. I managed to solve it in the same manner just before you replied. Thanks again. – madzie Apr 14 '13 at 14:35