I want to split a long PDF document into many parts, e.g. part 1 comprising pages 3-14, part 2 comprising pages 15-19, part 3 comprising pages 20-27, using PyPDF2.
I coded a loop that takes the relevant pages out of the original PDF and saves them as a new doc, for each part. The only problem is that part 2 still includes all the pages from part 1, and part 3 still includes the pages from parts 1 & 2.
I assume I somehow have to 'reset' output = PdfFileWriter(), but if I put it into the while loop I get a long error message.
output = PdfFileWriter()
input = PdfFileReader(open("%s" % pdf, "rb"))
current_row = 2
i = sheet.cell(row = current_row, column = 4).value
j = sheet.cell(row = current_row, column = 5).value
org = sheet.cell(row = current_row, column = 1).value
n = sheet.cell(row = current_row, column = 7).value
while i > 0:
while i <= j:
p = i-1
output.addPage(input.getPage(p))
i += 1
print(i, p, j)
print org
outputStream = file("%s_%s_%s.pdf" % (mysheet, n, org), "wb")
output.write(outputStream)
outputStream.close()
current_row += 1
i = sheet.cell(row = current_row, column = 4).value
j = sheet.cell(row = current_row, column = 5).value
org = sheet.cell(row = current_row, column = 1).value
n = sheet.cell(row = current_row, column = 7).value