cannot write mode PA as PNG

Question

pdf_file=fitz.open(r"C:\Users\user\Downloads\example.pdf")
for page_index in range(len(pdf_file)):
            page=pdf_file[page_index]
            print(page.get_pixmap())

OSError: cannot write mode PA as PNG

How i can get images from pdf file ?

I try to get images from pdf file

Please show your actual code, not just a small part of it. We can't tell what `pdf_file` is from this. — AKX, Feb 23 '23 at 10:49
You might want to consider using pypdf: https://pypdf.readthedocs.io/en/latest/user/extract-images.html — Martin Thoma, Feb 23 '23 at 23:03

score 1 · Accepted Answer · answered Feb 23 '23 at 11:00

The documentation for the PyMuPDF library you're using has an explicit section on extracting images from PDF documents, with this example code (which is a bit too long to include here, and under the GPL anyway).

It simplifies to something like

import fitz

doc = fitz.open(filename)
seen_xrefs = set()
for page_num in range(doc.page_count):
    for img in doc.get_page_images(page_num):
        xref = img[0]
        if xref in seen_xrefs:
            continue
        image = doc.extract_image(xref)
        imgfile = f"img{xref:05d}.{image['ext']}"
        with open(imgfile, "wb") as fout:
            fout.write(image["image"])
        seen_xrefs.add(xref)
        print(f"Page {page_num}: {imgfile} ({image['width']} x {image['height']}")

when not taking masks and color spaces into account.

@YAŞAREMREDOĞRU I'm sorry, I don't understand what you mean with that. — AKX, Feb 23 '23 at 12:54

cannot write mode PA as PNG

1 Answers1