You are viewing limited content. For full access, please sign in.

Question

Question

Perform OmniPage OCR in Pre-Classification if page doesn't have text

asked on January 22, 2021

I have a QuickFields session that identifies the first page of a document using barcode and pattern matching tokens, for that reason I perform an OmniPage OCR in the pre-classification process. Those files are being processed using the Laserfiche Capture Engine (documents already in Laserfiche).  Some of the files already have OCR on them so is redundant and slow to do OCR again.

How I can validate if the page already has the text and don't do the OCR for that page? If I can do that, the process could perform the session more quickly than is doing right now.

0 0

Replies

replied on January 22, 2021 Show version history

From my experience .. you will not get a more optimal performance enhancement with .. logic that will either .. retrieve text or OCR the entire page. 

I notice from my own experience it would be best to set up like this:

 

has text .. then process 

 

has no text .. then just do post processing. (post processing will have remote OCR that will allow you to speed up the QF dramatically due to the fact that you will not be OCR in that session and barcode reading should be extremely fast. This post processing should include WF that will send the OCRed Document back to the QF source folder and retrieve its text and utilize the text instead of wanting to OCR.

Im not sure about the whole flow of your QF but i would say that OCRing the entire page is probably the reason why the PPM is super low. 

 

***This would also apply to Zone OCR but really depends on the size of your Zone. 

 

BTW.. if im wrong .. please tell me that im wrong Lmao :) this is just my $0.02 

0 0
replied on January 22, 2021

Hi Ethan,

I will change the process using a workflow on the staging folder (Inbox).  If the document has text on all pages, move it to the Inbox\A folder, if not to Inbox\B folder.  From those 2 folders, I will run 2 separate sessions (one with OCR on pre-classification and the other no). Because we are using QF agent I will automatize the process using both QF session.

Thanks for your input.

Pedro

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.