When a fillable pdf form is imported into Laserfiche the OCR results do not contain the fillable information. If same form is imported into Laserfiche using the thick client the OCR results contain the fillable form. Also if you OCR the document inside the thick client after the document is imported with Web Access the fillable info is there.
Question
Question
Importing fillable pdf through web access and the fillable fields are not ocred
Answer
If you are using Web Access, it currently cannot OCR (attempt to convert the image of the text to text) upon import: instead it will extract text already associated with the PDF. According to the help files, she is seeing the expected behavior:
Note: When text is extracted directly from a PDF form, only the standardized form text will be included. The text input by the user in the form's fields will not be included in the extracted text. This text can be generated by OCRing the image pages created from PDFs rather than extracting the text from the electronic document.
If you want to perform OCR on the image pages in Web Access, you must have Distributed Computing Cluster.
Replies
How are you importing the PDF into LF exactly?
The customer says that she clicks import and then selects the file. I didn't think it would even ocr if you do that. I am getting ready to test on my var kit
If you are using Web Access, it currently cannot OCR (attempt to convert the image of the text to text) upon import: instead it will extract text already associated with the PDF. According to the help files, she is seeing the expected behavior:
Note: When text is extracted directly from a PDF form, only the standardized form text will be included. The text input by the user in the form's fields will not be included in the extracted text. This text can be generated by OCRing the image pages created from PDFs rather than extracting the text from the electronic document.
If you want to perform OCR on the image pages in Web Access, you must have Distributed Computing Cluster.