Questions tagged [amazon-textract]

Amazon Textract enables document text detection and analysis in applications. The Amazon Textract Text Detection API can detect text in a variety of documents including financial reports, medical records, and tax forms. For documents with structured data, you can use the Amazon Textract Document Analysis API to detect linked text, tables, option buttons (radio buttons), and check boxes.

Amazon Textract documentation

226 questions
0
votes
1 answer

Is the AWS Textract API configurable?

I want to extract structured text from PDF contracts with AWS Textract. Is the service configurable? For example, can I set the minimum vertical blank space the system uses to split paragraphs? Thank you!
0
votes
2 answers

Textract async read PDF

From the textract documentation: Documents for synchronous operations can be in PNG or JPEG format. Documents for asynchronous operations can also be in PDF format. I have a Node.js application where I use async Textract to read PDF file. My code…
AGoranov
  • 2,114
  • 3
  • 15
  • 27
0
votes
1 answer

Filtering out data returned by AWS Textract function

I have extracted data returned by Textract AWS function. The return data type of this Textract function is of the following type: { "AnalyzeDocumentModelVersion": "string", "Blocks": [ { "BlockType": "string", …
0
votes
1 answer

If I have the JobId from AWS textract method(StartDocumentAnalysis) how to find the text extracted?

I have this code here- const _ = require("lodash"); const aws = require("aws-sdk"); const config = require("./config"); aws.config.update({ accessKeyId: config.awsAccesskeyID, secretAccessKey: config.awsSecretAccessKey, region:…
0
votes
1 answer

Is there any User Interface offered to view the AWS Textract OCR output side-by-side the source document?

The AWS textract outputs a JSON result after passing it a document. I'm interested in using this output to somehow view a side-by-side of the source document(say pdf) and the output in some way. Is this supported by any tool right now? For example,…
Judy007
  • 5,484
  • 4
  • 46
  • 68
0
votes
1 answer

Aws Textract checkbox checked value from pdf document in c#

I am using Amazon Textract to extract the Text from pdf document (refer link), its working fine.I need to get checked item of checkbox from same pdf document.How do i get the checkbox selection from pdf document.Please through some light
Ash
  • 469
  • 1
  • 8
  • 23
0
votes
2 answers

Parse / Extract table from a messed .csv file?

I am parsing an image (png) with Amazon Textract and extracting the tables. Here is an example of such csv when I open it with open(file_name, "r") and reading it's lines: ['Table: Table_1\n', '\n', 'Test Name ,Result ,Flag ,Reference Range ,Lab…
SteveS
  • 3,789
  • 5
  • 30
  • 64
0
votes
1 answer

OCR from game scoreboard

I want to get scoreboard/leaderboard data from the image of different games like battlefield and rocket league etc. For that, I used amazon textract API to get scoreboard data from the image. But Amazon textract API sometimes gives me a wrong result…
0
votes
0 answers

AWS Textract not picking up radio buttons

According to AWS Textract's documentation, it is capable of picking up checkboxes and radio buttons. However, I'm uploading literally the most simple example I can think of, clean sheet, clearly space radio button options: I tried another version,…
Yu Chen
  • 6,540
  • 6
  • 51
  • 86
0
votes
1 answer

Can Microsoft Computer vision extract font information

I am extracting text using Microsoft computer vision, it is returning me a JSON response. Does Microsoft computer vision has an ability to extract font information along with the JSON response. Or for that matter, does any OCR technology return…
0
votes
1 answer

AWS Textract table extraction broke rows with integers that has comma inside it into another column

I would like to use AWS Textract to convert my image into tables in python and download it as CSV. So, I followed the documentation and examples code from AWS here:…
0
votes
1 answer

TextRact Response status code does not indicate success: 400 (Bad Request) + IAM Keys not found

I have pushed my textract code on staging server, and now I am receiving an error. It is working on a development system. I can't understand why it is happening. I am using dotnet core 3.0 I am following code sample provided here. …
Varinder
  • 1,780
  • 2
  • 11
  • 18
0
votes
1 answer

Amazon Textract - How to define my key-value pairs

I have tried textract and I can see that it extracts few interesting key-value pairs. I have an image dataset each annotated with a set of domain-specific key-value pairs which are different of what textract found. Is there anyway to make textract…
Rams
  • 63
  • 3
0
votes
0 answers

Use Google Vision Python client from a FileMaker Pro (NOT ON A SERVER) solution built for historical research?

I'm what Cornelis van Lit from Digital Orientalist calls a "centaur," or a scholar who devotes time to developing software solutions to humanities research problems. I've run into a problem that been NOT BEEN able to solve by searching stackoverflow…
0
votes
4 answers

Amazon Textract - How to extract key-value pair

I am using AmazonTextract .NET SDK to extract texts from images. Its returning list of blocks as part of the response. I need to extract key-value pair out of extracted texts. I guess we need to iterate through the list of block, check for…
1 2 3
14
15