I have a set of PDFs, for which some pages have had partial contents redacted through Adobe Acrobat. I would like to programmatically iterate through each page and determine whether the page contains redacted content, preferably using Python (note that I'm not having trouble iterating through the PDF pages, just determining presence of redacted content).
I've used PyMuPDF's getText() function to check for any "ghost" indicators in the PDF's text layer of redacted space, but there doesn't seem to be any clues. I'm wondering if there's any other data hiding in the PDF that I could extract that would point to a redaction layer.