You are viewing limited content. For full access, please sign in.

Question

Question

Laserfiche Cloud - Automated Text Extraction

asked on September 27, 2021

Hello,

I was wondering about the Automated Text Extraction feature that's built into LF Cloud. The one that works off hours to auto extract text from all docs in the repo that don't have searchable text.

Is this only text extraction that happens on electronic documents? Or does this feature also OCR text from scanned images?

 

Thanks,

Nareg

0 0

Answer

SELECTED ANSWER
replied on September 27, 2021

I think you have that backwards. Laserfiche Cloud has behind-the-scenes OCR for image pages. It does not extract text from electronic documents.

1 0
replied on September 27, 2021 Show version history

Oops. So the 'Automated Text Extraction' feature that's listed in Laserfiche Cloud Package Tiers FAQ Guide refers to OCR'ing images and not Text Extraction from e-docs?

0 0
replied on September 27, 2021

Correct.

0 0
replied on September 27, 2021

Got it, thanks Miruna.

0 0
replied on September 28, 2021 Show version history

A few notes to elaborate on this, since this is a common point of confusion:

The client applications support automated client-side text extraction (i.e. pulling existing text from electronic documents such as Office or PDF docs and putting that text into what Laserfiche calls "text pages"), which allows for certain actions such as using pattern matching to capture certain information within the text. 

However, if your use case is simply searchability, it's worth noting that the text from documents that have an iFilter (which includes Office docs and some but not all PDFs) is automatically placed into search indexing through server-side processing even if text pages aren't created. 

3 0
replied on October 1, 2021

Thanks Tessa, that's definitely good to know. In this particular case, it's for searchability.

1 0

Replies

You are not allowed to reply in this post.
You are not allowed to follow up in this post.

Sign in to reply to this post.