You are viewing limited content. For full access, please sign in.

Question

Question

Snapshot Question

asked on August 12, 2014

What's the difference between Obtain text from print job and Perform OCR on the images created from the print job?  When should you use one vs. the other?  I have a client using 9.1.1.548 and are having issues with OCR'ing, but if we change it to Obtain from print job, it works, but I want to make sure we're not going to be missing anything.

 

Thanks!

 

0 0

Answer

SELECTED ANSWER
replied on August 12, 2014 Show version history

“Obtain Text” retrieves text directly from the file being printed via the text layer in the file. This function works well for electronic documents such as those provided by Microsoft or Adobe. This is a good option unless the file does not have a text layer, in which case OCR should be used. This is the case for PDFs that are not searchable. The advantage of using “Obtain text” is that it is much faster and requires less resources than OCR, Additionally, because the text is being pulled directly from the file instead of being read from the page, the accuracy is greatly increased with the use of “obtain text”. Consider making “Obtain text” the default and if, upon printing, text is missing from a document, it can be OCR’ed in the Client. 

0 0

Replies

replied on August 12, 2014

Going out on a limb, I'm guessing obtain from print job is when printing from a supported application maybe adobe reader or internet explorer where the text can be obtained electronically as a pose to be 'read' from the page. Lets say you print from CAD or SAGE or something similar which doesn't output a text layer with the print job. You actually need to OCR the image.

I'm guessing obtain from print job is also much more accurate.

 

I'm totally guessing here.....but that is my assumption of how it works....

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.