I am testing Apache Tika Server (v2.4.1) and I see that it fails for large documents with an error of:
Error 500 Server ErrorHTTP ERROR 500 Server Error
URI:/rmeta/form/text STATUS:500 MESSAGE:Server Error SERVLET:- CAUSED BY:org.apache.cxf.interceptor.Fault: Java heap space CAUSED BY:java.lang.OutOfMemoryError: Java heap space
Powered by Jetty:// 9.4.46.v20220331
I am providing it these options and it fails for a PDF file of 340MB in size:
-spawnChild -JXmx6g
So, I can't figure out why this happens. Does the heap setting apply only to the parent process and not to the spawned child process?
Am I missing something or I simply need more heap space?
From the looks of it, the child process fails while using only 2GB of RAM, so pretty far away from the 6GB limit specified...