1

I've installed tika with solr , and it's working well for arabic pdf , is there any tutorial to make this happen , I've seen a similar question to this and the solution was to include ICU4J.jar , but I don't now what does it mean

sel_space
  • 187
  • 2
  • 12
  • 1
    What's the question? You say "*it's working well for arabic pdf*" so I'm not sure what isn't working and what you need help with? – Gagravarr Apr 18 '12 at 15:55
  • It's Working for other document format such as doc , odt etc ... but for pdf it doesn't extract arabic well , I think they have found a solution here http://stackoverflow.com/questions/7834401/solr-for-arabic , but I'm newbie with Java . – sel_space Apr 18 '12 at 16:10

1 Answers1

1

ICU4J can be downloaded here: http://site.icu-project.org/download

Walter Underwood
  • 1,201
  • 9
  • 11
  • THank for your response , but how to install it ?? – sel_space Apr 21 '12 at 18:38
  • WEB-INF/lib is the standard place for additional libraries (jar files) in a web application (like Solr). If you are running the Solr war file, then look for a shared library directory for your servlet container (probably Tomcat or Jetty). – Walter Underwood Apr 23 '12 at 15:40
  • unfortunatly I'm a PHP programer and I'm not using tomcat , instead I'm using apache2 , with apachesolr as a server. Any detailed Howto will be so much apriciated, Thanks for your reply – sel_space Apr 26 '12 at 11:44
  • Apache Solr is a web application written in Java. It is installed in a servlet container, usually Jetty or Tomcat. If you are going to use Solr, you will need to learn the basics of configuring Java webapps. – Walter Underwood May 03 '12 at 18:37
  • Hi sel_space, have you been able to get this working? Did you understand how you can include ICU4J? I am struggling with this as well.... – perpetual_dream Nov 27 '12 at 21:35