Questions tagged [cloud-document-ai]

This tag is for the Document AI product within Google Cloud Platform.

200 questions
2
votes
1 answer

What are advanced_ocr_options[] for Google Cloud Vision?

I'm trying to find any information on the advanced_ocr_options[] under textdetectionparams of ImageContext. https://cloud.google.com/vision/docs/reference/rpc/google.cloud.vision.v1#textdetectionparams Google has several pages in their documentation…
2
votes
1 answer

Does google document AI support thai language?

For a thai language document, I am trying to extract the text and key-value pairs using Google Document AI. On seeing the result, I can see that the thai language is not retained. Is there any parameter that needs to be passed for thai characters to…
2
votes
2 answers

Does Document AI not detect the shopping receipt?

I can successfully extract the details of the shopping receipt (products, store name, etc.) with the expense parser in Document AI. I wrote the store name and product name in handwriting on normal paper and uploaded it, but it still took out the…
codinges
  • 61
  • 6
2
votes
1 answer

Google DocumentAI -> ValueError: Protocol message Document has no "file" field

In my script, I have the following: response = requests.get(list_url[0], allow_redirects=True) s = io.BytesIO() s.write(response.content) s.seek(0) mimetype="application/octet-stream" document = {'file': s.read(), 'mime': mimetype} request =…
2
votes
2 answers

Document AI - Improving batch process time for a single document?

I'm working on a GCP Document AI project. First, let me say this - the OCR works fine :-). I'm curios to know about possibilities of improvement if possible. What happens now I have a python module written, which will get the OCR done for a tiff…
Kris
  • 8,680
  • 4
  • 39
  • 67
2
votes
1 answer

Tables recognition using Google Vision API

I use OCR function (DOCUMENT_TEXT_DETECTION) of Google Vision API to process different medical documents. There are some tables in them. According to Google's documentation, there is a special BlockType for tables. However, I can't get this sort of…
Dmitrii
  • 88
  • 1
  • 7
2
votes
1 answer

How can I convert "google.cloud.documentai_v1.types.document" object to json

I am using Google Cloud Document AI's Invoice Parser. API response is google.cloud.documentai_v1.types.Document object. I tried to write below approaches for converting it to JSON but nothing works: json.dumps() but it gives…
kushagra
  • 131
  • 3
  • 10
2
votes
1 answer

Permission denied when invoking Document AI v1beta3 from Cloud SDK Interactive python (abbr. ipython or Ipython)following GCP tutorial for Form_Parser

I am following the tutorial at https://codelabs.developers.google.com/codelabs/docai-form-parser-v3-python#7 I followed all the steps they specified..... I used Cloud SDK for the development purpose as specified in the tutorial but then the code…
Code99
  • 33
  • 6
2
votes
1 answer

DocumentError "Permission 'documentai.processors.processOnline' denied on resource 'my processors' (or it may not exist)

im trying to implement documentai with NodeJS and im stuck while trying to run DocumentProcessorServiceClient() i got this error Error: 7 PERMISSION_DENIED: Permission 'documentai.processors.processOnline' denied on resource…
2
votes
1 answer

FixedCredentialsProvider gives unauthorized exception when calling Google Cloud service

I am trying to call Google Cloud DocumentAI through a google service account. I have the json key that was generated for it and I load it into my application via the FixedCredentialsProvider and a GoogleCredentials object since it's not possible to…
PentaKon
  • 4,139
  • 5
  • 43
  • 80
1
vote
1 answer

Laravel: Expect String Error for Google DocAI - but variable is string

i am trying to use DocAI Api to parse invoices. I am using Laravel, but i dont know why the validation for string doesn't work correctly - my variable is a string but doesnt pass the checkstring function, it returns me: InvalidArgumentException …
1
vote
1 answer

How to extract different sections of a pdf with Document Ai

I want to be able to show a list of different sections of the pdf file like what is shown on the. I'm calling the processor through REST api via Flutter Web. I tried getiing the entities from the api response using fieldMask but got nothing for the…
1
vote
1 answer

Can I get an already recognized document through Google OCR or do I have to request recognition again?

Here https://cloud.google.com/document-ai/docs/process-documents-client-libraries describes how to send a document for recognition and receive various data on it. But the question is: can I get a document that I have already sent for recognition by…
Kronchik X
  • 23
  • 2
1
vote
1 answer

One Processor For Multiple Documents

Is it possible to use one Custom Document Extractor for different documents? It would be provided training and testing data for each document, but would this affect the overall efficiency of the processor? Is it recommended to create a processor per…
sk2023
  • 15
  • 3
1
vote
1 answer

Document AI - Converting the normalized_vertices to the orginal scale of the document

I am using Google Cloud - Document AI service. I have custom built some processors for "form data extraction" using the "Custom Entity Extractor" which processes PDF documents. I annotated the dataset and I completed training my model. Now i am able…
1
2
3
13 14