Aws Textract Tables and Rawtext in a single document

Asked Oct 10 '22 at 13:36

Active Oct 11 '22 at 05:33

Viewed 254 times

I am using Amazon Textract to extract data from pdf files to an s3 location. My documents consist of Tables and Paragraphs. when I parse through the extract i get the raw text of all data however the table data is distorted. Below is the actual image. The data is read line or word by word but not the contents on the Table within or inside a block. Is it possible to get the block data in raw text as if i choose tables it ignores rest of the stuff not in table

enter image description here

edited Oct 11 '22 at 05:33

asked Oct 10 '22 at 13:36

Barry

Aws Textract Tables and Rawtext in a single document

0 Answers0