I am working on a tool which will identify those PDF files (Scanned Documents in pdf forms) which are not readable for human eye i mean they can be blur or not clear (Less DPI). This tool is needed because there are millions of files and Its very difficult for us to open file one by one and make sure if its readable/clear What i have tried: I used spire.pdf library, using this library i am extracting images from pdf and check DPI of each image and compare its dpi with 150 dpi, The 150 DPI is the required and standard DPI for us. So i extract images from pdf and compare each image's dpi with standard if image's dpi is less than standard i mark that "Not clear Image" What i am facing problem after the above solution There are images in PDF of which DPI is less than 150 but they are clear, and there are some images of witch DPI is Good and tool is mark it Good but they are blur or not clear.
Have any of you worked on same requirement or if you have any suggestion i will grateful.