I want to know how to extract particular table column from pdf file in python.
My code so far
import tabula.io as tb
from tabula.io import read_pdf
dfs = tb.read_pdf(pdf_path, pages='all')
print (len(dfs)) [It displays 73]
I am able to access individual table column by doing print (dfs[2]['Section ID']) I want to know how can I search particular column in all data frame using for loop.
I want to do something like this
for i in range(len(dfs)):
if (dfs[i][2]) == 'Section ID ' //(This gives invalid syntax)
print dfs[i]