You are viewing limited content. For full access, please sign in.

Question

Question

Capture profile not working on .pdf converted from .docx

asked on October 7

Hello, I am trying to run OCR on files that originate as .docx but are converted to .pdf using the "Update Word Document" activity.  I am able to successfully convert the .docx to .pdf, but then the workflow seems to fail at the step where I would use the Run Capture Profile to extract metadata.  However, when I test by saving the same .docx file to a .pdf in Windows (outside of Laserfiche) and then upload that .pdf file, the OCR step executes as intended.  So, I don't think it's a problem with the OCR itself.  I suspect the workflow is missing a step that would sort of orient the logic to consider the converted .pdf (from .docx) as a "new" file to run through the OCR step.  Hopefully this makes sense.  I've tried several strategies, like adding a delay activity, having the converted .pdf move into a separate folder and then setup a trigger to run the OCR on the folder move, etc.  None of these seem to work.  Appreciate any ideas!

0 0

Replies

replied on October 8

Capture profiles need image pages. If you are not generating image pages when you import the PDF, then there's nothing to work with. Whether OCR happens is irrelevant for this specific activity. 

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.