Python PyMuPDF looping next pages

Question

I'm using below code to open a PDF file and convert into an image file as output. Now, i'm trying to figure it out how can I loop the next page and convert it as same output file. Any help is much appreciated!

# display image on the canvas
def openFile(self, _value=False):
    global fileImg, output


    path = os.path.dirname(ustr(self.filePath)) if self.filePath else '.'
    fileImg = QFileDialog.getOpenFileName(self, '%s - Choose file' % __appname__, path)

    # convert PDF to image file
    pdffile = fileImg
    doc = fitz.open(pdffile)
    page = doc.loadPage(0)
    pix = page.getPixmap(matrix=fitz.Matrix(100 / 72, 100 / 72))
    output = "output.png"
    pix.writePNG(output)

score 2 · Accepted Answer · answered Aug 19 '20 at 10:01

2

You can simply loop over the doc object to get the next pages.

doc = fitz.open(file_name)  # open document
for page in doc:  # iterate through the pages
    pix = page.getPixmap(...)  # render page to an image
    pix.writePNG("page-%i.png" % page.number)  # store image as a PNG

check the PyMuPDF documentation for more information.

answered Aug 19 '20 at 10:01

Vishal Singh

6,014
2
17
33

what parameter do we pass in this part `pix = page.getPixmap(...) `? – YasserKhalil Feb 28 '22 at 14:54
1

@YasserKhalil try taking a look at the original documentation. – Vishal Singh Feb 28 '22 at 15:12

score 0 · Answer 2 · answered Aug 19 '20 at 06:13

You can use minecart and use this snippet to split pdf into images

import minecart
from PIL import Image    
file =open('Yourdoc.pdf','rb')
    doc = minecart.Document(file)
    page=doc.iter_pages()
    pageref=[]
    for j,i in enumerate( page):
        im = i.images[0].as_pil()
        im.save(f"folderlocation/{j}.jpg")

Python PyMuPDF looping next pages

2 Answers2