asked on June 22, 2017

We have some PDF files in Laserfiche which do not have searchable text.  Now we want to add it.  In the LF Client, I highlight a document and select "Generate Searchable Text".    I click "Options", then "Advanced Settings for PDFs".

When I have "Use native text extraction" selected, for some PDFs no text is produced.  I don't know why, but I gather that not all PDF files are created equal somehow.  So now I am trying the "Use an alternative method to generate text" feature.

I find that with either "OCR existing pages" or "Use PDF IFilter text extraction", text is produced, but only when "Generate images and text for PDF files without a text stream" is also selected.

The problem with this is that I end up with a PDF document, a searchable text file, and a TIFF page for each page of the PDF document.  I want to have just the original PDF document plus a searchable text file.

Using the LF Client, is there a way to generate the searchable text file without creating all the TIFF pages?  Or barring that,  is the a way to remove the TIFF pages after the fact, so just the PDF and searchable text file remain?

Thanks.

 

0 0