You are viewing limited content. For full access, please sign in.

Question

Question

Do I need to re-OCR documents that have been OCR'd by another program prior to import?

asked on May 9, 2019

I have a number of PDFs that have been OCR'd and cleaned up through ABBYY Fine Reader, and I'm wondering if there's a way that Laserfiche can retain and utilize that OCR? Right now it looks like it's discarding that information when it generates Laserfiche pages, which means I have to re-OCR the documents after import. 

0 0

Replies

replied on May 9, 2019

I'm not sure that ABBYY Fine Reader is able to export it's OCR data.  I know that ABBYY Flexicapture is able to export into Laserfiche using the Add-on from ABBYY.

0 0
replied on May 10, 2019

I'm not sure I understand, as the file is exported as a PDF with searchable text that can be read by Adobe Reader or my web browser. But I'll look into Flexicapture - if that solves my problem, great!

 

0 0
replied on May 10, 2019

You need to set the option to generate/extract text on import then it will pull the text from the e-doc text layer.

In the client, go to Tools - Options - New Documents - Settings and make sure the checkbox is checked for "Generate searchable text".

On that same tab, you can also set the option When importing PDFs Generate Laserfiche Pages.

If you already have those settings set and it is still not working properly, check the Tools - Options - Generate Text - Advanced Settings for PDFs

0 0
replied on July 15, 2019

I thought this solution was working for me alright, until I had a document where the OCR was incomplete due to a poor image in the original. I realized Laserfiche was only importing the text layer underneath the document, and not both the image layer and the OCR layer. Is there any way to work around this so that I retain all the information in the original document?

1 0
You are not allowed to follow up in this post.

Sign in to reply to this post.