Take this pdf as an example. I can extrac the table of contents (TOC) with dumppdf.py -T 1707.09725.pdf
:
<outlines>
<outline level="1" title="1 Introduction">
<dest>
<list size="5">
<ref id="513"/>
<literal>XYZ</literal>
<number>99.213</number>
<number>742.911</number>
<null/>
</list>
</dest>
<pageno>14</pageno>
</outline>
<outline level="1" title="2 Convolutional Neural Networks">
<dest>
<list size="5">
<ref id="554"/>
<literal>XYZ</literal>
<number>99.213</number>
<number>742.911</number>
<null/>
</list>
</dest>
<pageno>16</pageno>
</outline>
...
Can I do something similar with PyPDF2?