PDF Zone OCR Not Correct Compared to Sample

asked on February 17, 2015

I have been working on a QF Session (9.0.1.481) that will process a multipage PDF using Universal Capture. I have the "retrieve files" set for PDF files and the "Document Content" set to "Generate images for each page" (nothing else selected).

In the class I added a new sample page, but had to save a page from the actual PDF as a tiff since the Sample Pages does not support adding a PDF. Under "First Page Identification" I added an Zone OCR process that looks to see if the word Transcript is contained in the text.

Under "Page Processing" I have a Zone OCR that has 5 OCR regions.

On the sample page, it classifies and processes the document correctly without any errors, but when I try it with a PDF, it fails by not classifying the document. After manually assigning a document class it fails to grab any of the correct values for the 5 OCR regions.

Here is what the top of the page looks like that it grabs the OCR from:

Here is the result when running the PDF through:

It seems to be trying the Zone OCR in the wrong places because it is grabbing the students address for the DOB and the Student Name for the Student Number.

Any ideas on how to correct this?

0 0

SELECTED ANSWER

replied on February 17, 2015

Do the sample and the scanned image have the same resolution?

0 0

View 1 previous reply

replied on February 17, 2015

Nevermind. I played with exporting it as a .tiff file and changing out the sample image in Quick Fields until the zonal OCR fields looked close to where it was pulling values. After that, I adjusted the fields and then it worked like a charm. Thanks Miruna!

0 0

replied on December 11, 2015

Blake, I am having the same issue here. What do you mean by exporting it as a .tiff? Where are you exporting it from?

0 0

replied on December 11, 2015

You can either scan it through Quick Fields and have it generate pages and the pull the image out of the session's temp folder (C:\ProgramData\Laserfiche\Quick Fields\Files) or import it into LF and generate pages, then export the image.

0 0

Question

Question

PDF Zone OCR Not Correct Compared to Sample

Answer

Replies

Sign in to reply to this post.