You are viewing limited content. For full access, please sign in.

Question

Question

Import: OCR Failed

asked on November 17, 2021

We have a folder for an automated process that drops a multipage tiff and relate xml template into a monitored import folder. Right now OCR is set to true for the import process.

Sometimes we see that the import process fails and the event log error is just that OCR failed. If i turn ocr to false and import it will import fine, and running the OCR engine from laserfiche after works fine as well.

The latest example would be a 741 page greyscale tiff document with some b/w images and handwriting that shows up lighter due to the writing medium used. 

 

Is this just something Import shouldnt be OCR'ing and saved for a workflow process? or is there something within import i can do to help prevent these errors?

0 0

Answer

SELECTED ANSWER
replied on November 17, 2021 Show version history

I usually prefer doing OCR after the document is in the repository using DCC since you can avoid import errors and you get more flexibility on how to handle things.

However, something you could do is look at your OCR settings. Once thing I've found is that certain documents can trigger errors for certain OCR options.

Despeckle seems to cause a lot more errors than any other settings with OCR so I rarely use it at all, and my first OCR attempt typically uses the following settings.

 

In my workflows, if/when an OCR job fails with my "standard" settings, which is rare, I have it retry with "minimum" settings, and after 4 years I haven't really had any errors with the minimum settings.

If you really prefer to use Import Agent, I suppose you could have a second profile with the "minimum" OCR settings monitoring a different folder.

Then, instead of sending it to an IAError folder in the first profile, you could send it to this "failover" folder to try again with the lighter OCR settings.

If it still fails with the second profile, then you send it to the IAError folder.

0 0
replied on November 18, 2021

Awesome, Thank you for the response! I've been leaning towards a workflow solution and will push for that. Either way I am definitely implementing a failover system as you suggested.

0 0

Replies

You are not allowed to reply in this post.
You are not allowed to follow up in this post.

Sign in to reply to this post.