You are viewing limited content. For full access, please sign in.

Question

Question

OCR or Grab Text Stream and Generate one txt file for a multiple page document

asked on May 12, 2015

Is is possible to OCR or extract text from a multi page document and have the resulting txt stream be just one text file?  Thanks for the help.

0 0

Replies

replied on May 15, 2015

Yes, it is possible to extract and combine text from a multi-page document into a single text file upon export.  By going into Laserfiche's options (Tools > Options > Export > Text), you can choose to include all pages into one text file:

I hope this helps!

0 0
replied on May 16, 2015

Hi Madison,

Thank you for the reply, but I need this functionality while importing into Laserfiche or Generating text.  Manually exporting a text file does not work for this scenario.

 

0 0
replied on May 18, 2015

Lance - I would think you could accomplish this in a Workflow script activity but I am not sure I completely understand your desired end result.

  • When do you need access to the concatenated text stream?  i.e. Can you use Workflow to create the text stream after import into Laserfiche?
  • What is the final destination for the concatenated page text?  Are you wanting to do a one-time write of the concatenated text out to a file for later use or are you going to dynamically manipulate the concatenated text while it is in a memory stream? 
  • Do you need to determine which page a portion of the concatenated text came from or can the text be one contiguous stream?  i.e.  Do you need some type of 'page-break' character inserted into the stream on a document page break?
0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.