I am working on workflow to ensure that the whole repository is thoroughly OCRed but I have come across a question about the search I am using after reading the definitions in the manual:
This states:
Advanced search syntax can be used to search for documents by their OCR status—whether searchable text has been generated for them—using the following syntax. All advanced search types can be customized with advanced searchoperators and wildcards.
- {LF:OCR=All/Some/None}
Which seems to indicate that Laserfiche keeps track of which pages within all the documents it has applied its own OCR process to - ie it is not just looking for pages that have ascii/plain text metadata associated with them - and which of course could have come from another page scanning and OCRing application (like EzeScan).
Is the search looking for pages that have been OCRed by Laserfiche's schedule or Manual OCR process or is it only identifying OCR pages by the presence or absence of all, some or none with text present???
If it is truly a flag that provides reliable information about Laserfiche's successful OCRing of the pages in documents it would alleviate the need to set flags for docs that have been OCRed or not by workflow where it is possible that documents with some pages that have text are in fact fully OCRed (and have blank or photographic pages that genuinely don't contain any text and consequently could have been fully/successfully OCRed ALL pages resulting in a document with TEXT on SOME pages)
Best
W