You are viewing limited content. For full access, please sign in.

Question

Question

Help Understanding OCR

asked one day ago Show version history

Hi all, 

 

Finally got my OCR up and running, got the DCC workers, etc. Everything runs fine in my workflow, at the moment I am just using the OCR Scheduler.  I check in the web client. The documents don't seem to be affected. 

 

Is my expectation wrong. I have a PDF in a repository folder which is just canned document. After the OCR task, I expected to go into that document and find it converted to text?

Have been reading through the docs but maybe I've missed something where it tells me clearly what the OCR Scheduler task is doing (apart from sending it to the scheduler)

Thanks

 

0 0

Answer

SELECTED ANSWER
replied one day ago

Your wording may be off, but I wanted to clarify that OCRing a document does not convert it to text, it extracts or reads text from a document and creates a text file that is associated with the original document. You would see that text in the text pane when you open the original document.

0 0
replied one day ago Show version history

Everything Blake said and also Schedule OCR does not process PDFs, you'd want to use a Schedule PDF Page Generation activity first to make image pages that can be OCRed. 

0 0
replied one day ago

Thanks Miruna, I will use the Page Generation Activity

0 0

Replies

replied one day ago

Thank you, very much appreciated, that makes much more sense. I was thrown a request and had to dive into OCR as quickly as possible. I think I'll look deeper into some training to get my head around files in LF repository.

 

Forgive me, one last question I was looking for the text pane, and have found the help file for it.

I found the help file here

https://doc.laserfiche.com/laserfiche.documentation/11/userguide/en-us/Subsystems/client_wa/Content/Document_Viewer/text_Pane.htm

And found this post here when I couldn't find the text pane

https://answers.laserfiche.com/questions/117497/View-text-pane-in-Laserfiche-Cloud-Web-Client

Has any of that changed since 2017, I guess I was looking for an easy way to identify if a doc had that text file extract.

I'm on this version (not completely up to date.

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.