I've installed tika with solr , and it's working well for arabic pdf , is there any tutorial to make this happen , I've seen a similar question to this and the solution was to include ICU4J.jar , but I don't now what does it mean
Asked
Active
Viewed 1,534 times
1
-
1What's the question? You say "*it's working well for arabic pdf*" so I'm not sure what isn't working and what you need help with? – Gagravarr Apr 18 '12 at 15:55
-
It's Working for other document format such as doc , odt etc ... but for pdf it doesn't extract arabic well , I think they have found a solution here http://stackoverflow.com/questions/7834401/solr-for-arabic , but I'm newbie with Java . – sel_space Apr 18 '12 at 16:10
1 Answers
1
ICU4J can be downloaded here: http://site.icu-project.org/download

Walter Underwood
- 1,201
- 9
- 11
-
-
WEB-INF/lib is the standard place for additional libraries (jar files) in a web application (like Solr). If you are running the Solr war file, then look for a shared library directory for your servlet container (probably Tomcat or Jetty). – Walter Underwood Apr 23 '12 at 15:40
-
unfortunatly I'm a PHP programer and I'm not using tomcat , instead I'm using apache2 , with apachesolr as a server. Any detailed Howto will be so much apriciated, Thanks for your reply – sel_space Apr 26 '12 at 11:44
-
Apache Solr is a web application written in Java. It is installed in a servlet container, usually Jetty or Tomcat. If you are going to use Solr, you will need to learn the basics of configuring Java webapps. – Walter Underwood May 03 '12 at 18:37
-
Hi sel_space, have you been able to get this working? Did you understand how you can include ICU4J? I am struggling with this as well.... – perpetual_dream Nov 27 '12 at 21:35