I have a number of PDFs that have been OCR'd and cleaned up through ABBYY Fine Reader, and I'm wondering if there's a way that Laserfiche can retain and utilize that OCR? Right now it looks like it's discarding that information when it generates Laserfiche pages, which means I have to re-OCR the documents after import.
Question
Question
Do I need to re-OCR documents that have been OCR'd by another program prior to import?
Replies
I'm not sure that ABBYY Fine Reader is able to export it's OCR data. I know that ABBYY Flexicapture is able to export into Laserfiche using the Add-on from ABBYY.
You need to set the option to generate/extract text on import then it will pull the text from the e-doc text layer.
In the client, go to Tools - Options - New Documents - Settings and make sure the checkbox is checked for "Generate searchable text".
On that same tab, you can also set the option When importing PDFs Generate Laserfiche Pages.
If you already have those settings set and it is still not working properly, check the Tools - Options - Generate Text - Advanced Settings for PDFs
I thought this solution was working for me alright, until I had a document where the OCR was incomplete due to a poor image in the original. I realized Laserfiche was only importing the text layer underneath the document, and not both the image layer and the OCR layer. Is there any way to work around this so that I retain all the information in the original document?