0

I need to train a custom document classifier though Document AI API, but I couldn't finding anything related in the documentation or code samples. In particular, I was able to define a new custom processor but I don't know how to define my training and test sets.

The code sample closer to what I need is this: Train a processor version. It uses the class documentai.BatchDocumentsInputConfig to define training and test set, but I don't understand how I can use it to associate a label with my documents on Cloud Storage.

I suspect that this functionality is not available via API. Can you help me? Thanks

1 Answers1

0

The code sample you linked requires the Document.JSON files in Google Cloud Storage to be labeled already.

There's not a public API to explicitly label documents, the recommended process is to use the Cloud Console to create the labeled data, then you can use the training API to trigger the training process.

Holt Skinner
  • 1,692
  • 1
  • 8
  • 21