0

When a batch gets created documents should get separated automatically without using separator sheet or Barcode separator.

How can I classify documents for Invoice and supporting document. In our project we get many invoices with supporting document so the scanning person has to insert the separator sheets manually, so to avoid this we want to automatically classify the supporting documents.

Manish Sharma
  • 61
  • 1
  • 2
  • 9

2 Answers2

1

In general the concept would be that you would enable separation in the project and then train your classes with examples to be used for the layout or content classifiers.

However, as I'm sure you've seen, the obstacle with invoices is that they are different enough between vendors that it would not reliably classify all to an Invoice class. Similarly with "Supporting Documents" which are likely to be very different from each other, so unfortunately there isn't a completely easy answer without separator sheets (or barcode stickers affixed to supporting docs).

What you might want to do is write code in the one of the separation events like Document_AfterSeparate event. Despite the name, the document has not yet been split at this point, but the classifiers have run. See Scripting Help topic "Server Script Events Sequence > Document Separation > Standard Document Separation" for more detail. Setting the SplitPage property on the CDocPage (pXDoc.CDoc.Pages.ItemByIndex(lPage).SplitPage) will allow you to use your own logic to determine which pages to separate.

For example if you know that you will always have single page invoices, you can split on the first page and classify accordingly. Or you can try to search for something that indicates the end of the invoice like "Total" or other characteristics. There is an example of how you can use locators to help separation in the Scripting Help topic "Script Samples > Use Locator Results for Standard Document Separation". The example uses a Barcode Locator, but the same concept works if you wanted to try it with a Format Locator or anything else.

Stephen Klancher
  • 1,374
  • 15
  • 24
0

Without Separator sheets you will need a smart classification software like Kofax Transformation Module (KTM). Its kind of expensive. you will need to verify the cost saving and ROI.

  • We have KTM with us how can we achieve this please guide – Manish Sharma Dec 30 '14 at 13:46
  • Dear Manish. Kofax always recommends that a certified processional is to make the configurations (otherwise they wont support it). In the certification training you will find that when you add KTM server in the KC workflow you will have the full range of classification and extraction options available inside KTM. – Marcos Souza Dec 30 '14 at 13:55
  • Dear Marcos, I know how to classify the documents using barcode separator and using Separator sheets but I have requirement where we want to do separation without using separator sheets and Bar code separation. – Manish Sharma Dec 31 '14 at 11:32
  • Dear Manish, Working with KTM is a totally different thing. You have to create your project inside KTM Project Builder, create the classes , feed reference document samples for document separation and classification inside each class. Once you see it working inside the builder you have to Synchronize the project with Kofax Capture. Thhe synchronization creates the classes and indexes inside the KC structure for production. As you see is not quite simple procedure and requires a certified professional. – Marcos Souza Dec 31 '14 at 13:21
  • Your KC workflow will be something like this SCAN -> KTM Server -> KTM Validation -> Export. You will have to remove the Recognition Server and Validation, because KTM is a different solution. – Marcos Souza Dec 31 '14 at 13:23