Suppose I have a pdf file containing the following table info
Trainer: Giannis
Pokedex: Incomplete
Name | Type | Weight | Height | Color |
---|---|---|---|---|
Pikachu | Electric | 6.0 kg | 0.4 m | Yellow |
Bulbasaur | Grass/Poison | 6.9 kg | 0.7 m | Green |
Charizard | Fire/Flying | 90.5 kg | 1.7 m | Orange |
Jigglypuff | Normal/Fairy | 5.5 kg | 0.5 m | Pink |
Gyarados | Water/Flying | 235.0 kg | 6.5 m | Blue |
I am using the Form Parser to extract the table information.
If I know that the table columns will always be [Name, Type, ... , Color]
is there a way to pass this info to the FormParser
processor to help it better determine the header rows?
Thank u in advance for your time!