You are viewing limited content. For full access, please sign in.

Discussion

Discussion

Split documents into single pages before running an ID process?

posted on June 2, 2021

I realize I may be asking for the moon with this one, but...

Is there a way to separate document pages from a pdf (scanned tif files within a single pdf) into all single page documents before running them through a separate identification process? Reason is, due to scan quality the ID processes do not always work, and it's easier to handle the unidentified pages one by one if they are separated.

We are also considering one session to simple split documents into pages, and then using the Capture Engine to reprocess these as single page documents. TIA -

 

0 0
replied on June 3, 2021 Show version history

Hi Bill, that can be done. There are likely some open source PDF manipulation tools that do this but I've been using a paid version of AsposePDF. I can help you out with a workflow activity. 

Distributed Computing Cluster (DCC) v11 can convert the PDF to a multipage TIFF. TIFFs are much easier to split without third-party tools.

-Ben

 

 

1 0
replied on June 3, 2021

Thanks, Ben - 

Your note reminded me of OmniFormat, a practically free ($10) pdf to tiff conversion utility that I have used in many settings for years.

It runs in an unattended manner, and does a great job.

 

1 0
You are not allowed to follow up in this post.

Sign in to reply to this post.