We're using Solr and Tika to search external data such as PDF and Docs. However with this we're getting only the raw text without the formatting. We would like to get also the formatting and meta data such as captions and bullets. Is there any way to get it?
Thank you, Moshe