Amazon Textract enables document text detection and analysis in applications. The Amazon Textract Text Detection API can detect text in a variety of documents including financial reports, medical records, and tax forms. For documents with structured data, you can use the Amazon Textract Document Analysis API to detect linked text, tables, option buttons (radio buttons), and check boxes.
Questions tagged [amazon-textract]
226 questions
0
votes
1 answer
Amazon Textract: How to select 'Raw text' option
We are trying integrate amazon Textract api in our node.js application. we are facing some issue, FeatureType parameter while processing image. we need to achieve the below option via api:
We are not finding the option in the AWS JavaScript…

Robert
- 3,373
- 1
- 18
- 34
0
votes
0 answers
TypeError: a bytes-like object is required, not 'SimpleUploadedFile'
I am receiving a base64 image via a JSON post action into my Django REST Framework API. Once I have the image, I am using https://aws.amazon.com/textract/ to get the fields and related values from the image. I would like to send the image to…

Etienne
- 9
- 1
- 4
0
votes
0 answers
SSL VALIDATION FAILED [SSL: WRONG_VERSION NUMBER] error in aws
i am trying to access aws s3 or textract on remote computer(VDI) of my company. it uses some kind of proxy too. while accessing the remote computer i am getting these error and it is fine while accessing via my personal computer. i have to contact…

Hariharan Kannan
- 11
- 1
0
votes
1 answer
Is there a way to show pdf in its original structure in the human review custom entity labelling in aws sagemaker?
I have modified this sample to read PDFs in tabular format. I would like to keep the tabular structure of the original pdf when doing the human review process. I notice the custom worker task template uses the crowd-entity-annotation element which…

Luis
- 114
- 2
- 10
0
votes
1 answer
Endpoint is weird Amazon Textract Python
I'm trying to use textract in python. I got the code from this url: https://github.com/aws-samples/amazon-textract-code-samples/blob/c8f34ca25113100730e0f4db3f6f316b0cff44d6/python/02-detect-text-s3.py.
I only changed s3BucketName and documentName…
user15256253
0
votes
2 answers
Amazon Textract without using Amazon S3
I want to extract information from PDFs using Amazon Textract (as in How to use the Amazon Textract with PDF files). All the answers and the AWS documentation requires the input to be Amazon S3 objects.
Can I use Textract without uploading the PDFs…

jkortner
- 549
- 2
- 8
- 23
0
votes
1 answer
How to extract text from an image with a variety of noisy texts and numbers?
I have an image here:
I need to extract the meter readings from this image which is "0005053" at the centre.
I have tried pytesseract as follows:
import pytesseract
from PIL import Image
text =…

vaisxn
- 71
- 8
0
votes
1 answer
How to read from an Amazon S3 Bucket and call AWS services
I am able to call AWS Textract to read an image from my local path. How can I integrate this textract code to read the image uploaded onto a created S3 bucket with the S3 bucket codes.
Working Textract Code to textract images from local path
package…

Alan Chu
- 15
- 2
- 7
0
votes
1 answer
How to use AWS Textract in React/NextJS
Hey i'm hoping someone can help me tidy this up or even just point me in the right direction. I'm hoping I'm at least close/on the right track.
Firstly I am using this.
My code is below.
UPDATE: Error: err TypeError: input.filter is not a…

MomasVII
- 4,641
- 5
- 35
- 52
0
votes
0 answers
Error: ENOENT: no such file or directory, open 'C:\Users\\.aws\credentials'
node index.js scan helloworld is giving the above error
Error: ENOENT: no such file or directory, open 'C:\Users.aws\credentials'
the credentials.txt is there in the above folder path.
and the contents are in place already.this issue is…

Srikanth Bejjenky
- 19
- 1
- 3
- 7
0
votes
1 answer
How to rearrange and map data from one CSV to another CSV using python
I have extracted Table data from an image having multiple tables using Amazon textract and trying to map all the extracted data into a Output template CSV,
However there are multiple tables in the Extracted Input CSV file which are listed one below…

NKJ
- 457
- 1
- 4
- 11
0
votes
2 answers
Export all table data from PDF to Excel using Amazon textract
Looking out to extract PDF data to Excel/CSV using Amazon Textract. How we can Insert the Input PDF data from the local folder.
Having PDF with multiple Tables, we need to extract all the tables from their respective pages and export the data to…

Manz
- 593
- 5
- 23
0
votes
3 answers
Why am I getting the error "UnhandledPromiseRejectionWarning: TypeError: Cannot read property 'forEach' of undefined"?
I have an index.js, gettext.js and extract.js.
index.js
app.post("/anaData", async function(req, res) {
const someData = await docExtract(req.body['file_name'])
const data = await getData(someData)
.....
....
}
gettext.js
async function…

elementory
- 133
- 3
- 7
0
votes
1 answer
AWS: S3 bucket policy with Textraxt
I want to give AWS Textract access to use my specific S3 bucket but struggling to find the ideal bucket policy.
After doing some research found that the aws:CalledVia function may be appropriate but don't know how to make the policy.
Would…

Deep
- 31
- 6
0
votes
1 answer
AWS Textract to create searchable PDF - looking for python code
I would like to extract handwritten text from a scanned image - using say Amazon AWS Textract. And then would like to be able to create a searchable PDF with the output - so convert the image into a pdf with a text layer.
Amazon has provided a blog…

jim70
- 515
- 1
- 5
- 18