How to Remove Mask or Corrupted Image from PDF?

Asked Feb 14 '17 at 12:44

Active Feb 20 '17 at 09:07

Viewed 221 times

I am working on a Ruby on Rails application to extract text and images from PDF files. While extracting images few of them get corrupted.

Is there any way to identify those corrupted images after extraction? Anyone know why they get corrupted?

I am using pdftohtml and pdftotext (poppler) Ubuntu utilities.

Thanks in advance.

edited Feb 20 '17 at 09:07

forchetan01

asked Feb 14 '17 at 12:44

sam

0 Answers0