Questions tagged [cloud-document-ai]

This tag is for the Document AI product within Google Cloud Platform.

200 questions
0
votes
1 answer

Running Google Cloud DocumentAI sample code on Python returned the error 503

I am trying the example from the Google repo: https://github.com/googleapis/python-documentai/blob/HEAD/samples/snippets/quickstart_sample.py I have an error: metadata=[('x-goog-request-params',…
mommomonthewind
  • 4,390
  • 11
  • 46
  • 74
0
votes
1 answer

Give direct access of local files to Document AI

I know there is a way by which we can call Document AI from python environment in local system. In that process one needs to upload the local file to GCS bucket so that Document AI can access the file from there. Is there any way by which we can…
0
votes
2 answers

Translate language in image preserving structure

We are looking to translate images found in PDF documents from different languages to English. They are scanned images and many times have tables or some structure in them. We would like to translate to English but preserve the structure of document…
0
votes
0 answers

Google Cloud Client libraries for Document AI API's in Laravel hosted on Microsoft Azure

I am using v1 API to extract table coordinates in a Laravel web app. But I have to generate tokens after they expire. curl -X POST \ -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) \ -H "Content-Type:…
Pankaj S Y
  • 11
  • 1
  • 4
0
votes
1 answer

Document AI v1Beta3 API coordinate mapping to pdf coordinates

I parsed a PDF file using API version v1beta3 and got the coordinates of a table as seen below. "normalizedVertices": [ { "x": 0.6894705, "y": 0.016400337 }, { …
Pankaj S Y
  • 11
  • 1
  • 4
0
votes
1 answer

Using Document AI on PDF with text included still includes text errors

Using the Document AI Processor to extract text from PDF (english, german, italian) works quite good, but sometimes the OCR mismatches. Especially in situations where the "word" is not a word from a dictionary, but has problems with part numbers…
mooose
  • 60
  • 7
0
votes
0 answers

Call GCP Document AI add authentication

I need to call the GCP Document AI, specifically the OCR processor, from Swift using a base64 encoded image. However, when I send my request using the code from below, it always times out due to the missing authorization header. Here's the link to…
0
votes
1 answer

Document AI Invoice Processor - inconsistent bounding polygons for extracted entities

I use the Document AI Invoice Processor for processing scanned invoices. I am using the Java client libraries. I recently noticed that there is an incosistency in the bounding polygons of the extracted entities when the input image is rotated 90deg…
antk
  • 21
  • 1
  • 1
0
votes
3 answers

How can I split a PDF in Google cloud storage?

I have a single PDF that I would like to create different PDFs for each of its pages. How would I be able to so without downloading anything locally? I know that Document AI has a file splitting module (which would actually identify different files.…
0
votes
1 answer

Python - Google Cloud Document AI API- Not reading the whole .pdf file

I am trying to read a pdf stored in gcs i Python using Google Document AI API and return the text from the pdf as a string.I do not want the parser to read tables and images as iam only interested in text. Below is the code i am using to parse the…
0
votes
1 answer

Exception in SynchronizationContext when calling OperationFuture outside of callstack

I am using Google's DocumentAI SDK but this error stems from the gRPC SDK it seems. I am calling an asynchronous operation in DocumentAI which returns an OperationFuture. When I call the method OperationFuture.get() within the callstack frame where…
PentaKon
  • 4,139
  • 5
  • 43
  • 80
0
votes
1 answer

GCP Document AI Example Not Working - Receiving INVALID_ARGUMENT: Request contains an invalid argument

The error is in the batchProcessDocuments: { code: 3, details: 'Request contains an invalid argument.', metadata: Metadata { internalRepr: Map { 'grpc-server-stats-bin' => [Array] }, options: {} }, note: 'Exception occurred in…
0
votes
1 answer

Permission denied when invoking Document AI v1beta3 from Cloud Function

I'm trying to call to DocumentAI v1beta3 from Cloud Function with the code snippet as follow: client = documentai.DocumentProcessorServiceClient() input_doc = documentai.types.Document( content=file_to_be_extract, …
imationyj
  • 145
  • 2
  • 8
0
votes
2 answers

How can I send a PDF document to Google Cloud Document AI using the v1beta3 API?

I have been successful in sending a PDF File stored in GCS to the Document AI v1beta2 API. But in v1beta3 API, the file approach is no longer supported. It requires me to send the content in the JSON. Here is the documentation I am following:…
David
  • 55
  • 6
0
votes
2 answers

google document ai doesn't parse form

I am working with the form parser in google document ai. when I send the request : curl -X POST -H "Authorization: Bearer "$(gcloud auth application-default print-access-token) -H "Content-Type: application/json; charset=utf-8" -d @request.json…