You are viewing limited content. For full access, please sign in.

Question

Question

Inaccurate document count with the extract text option for PDF using Universal Capture in Quickfields without the Zone OCR and Validation module in the QF license

asked on June 19, 2014

 I have a very unique situation and I would like to know if this is a bug, intended as working or a license issue. We have a Quickfields session that uses Universal Capture to read PDF's with extract text option and keep each file as a separate document enabled. We are capturing tokens (pattern matching) by using the name of the document. It is a very straight forward and basic QF session. We have two workstations that uses the session. The session is not being shared and is stored locally on each workstation. Station A is where the Quickfields session was created from. The current modules licensed on Station A is Zone OCR and Real Time lookup. Station B is the other Quickfields station and the only module licensed is Real Time Lookup.

 

There are 20 PDF's that the session will read and thus 20 documents should be created in Quickfields. Station A works fine every time and we never had an issue. However Station B, was not creating the same number of documents. The same 20 documents would be processed by Station B, but only 16 documents were identified. We could process the same 20 documents on Station B again, but the next time it, Quickfields would identify 14 instead.In order to fix that, I had to uncheck the option to extract text. After the option is disabled, the number of identified documents in Quickfields was correct. Before I tried disabling the extract option, I upgraded Quickfields to the latest version (9.0.1.481) and still did not work correctly. Unchecking the extract text option was the way to fix it.

 

So, my question is Zone OCR needed to use Extract Text option for PDF using Universal Capture? The only difference between the two workstations is one is licensed for Zone OCR is working correctly and the other one is not which is not working correctly.

 

Thanks for taking the time to read this.smiley

0 0

Answer

APPROVED ANSWER
replied on June 24, 2014

You can still open a support case, something is going on and we'll need to look into it more closely to be able to give you a good reason as to why it is happening.

0 0

Replies

replied on June 19, 2014

Zone OCR is not necessary to extract text from a PDF, the only interaction it has with it is it can use the extracted text instead of having to run the OCR on the image to read it.

To know why they're failing I'd need more information. What are the conditions? In the output pane what is it saying as the failing condition? Are you zone OCRing in that condition?

0 0
replied on June 20, 2014 Show version history

There are no conditions for first page identification. No, the output pane does not give any information. No, I am not using ZoneOCR in that condition. Since I am using the 'separate document' option, there is no need to identify the document.

0 0
replied on June 23, 2014

Are you keeping the PDF after using it? It may be worth opening a support case to look into this.

0 0
replied on June 24, 2014

We are moving the PDF's to another folder after Quickfields processes them. The only difference between the two workstations is the license which is why it falls out of the realm of support which why I am posting here. Of course, the workaround is to add the OCR process to the QF session.

0 0
APPROVED ANSWER
replied on June 24, 2014

You can still open a support case, something is going on and we'll need to look into it more closely to be able to give you a good reason as to why it is happening.

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.