You are viewing limited content. For full access, please sign in.

Question

Question

"retrieve document text" not retrieving text

asked on November 28, 2018 Show version history

I have a Forms process where a document is uploaded by an end user and stored in the repository.  Then a WF process runs OCR on the uploaded document.   Then a second WF process picks it up and attempts to retrieve the text.  I can see the text if I open the document before the second WF process runs but there is no text retrieved in the workflow process.  When OCR is ran on the document after I drag and drop it into the repository, the second WF properly retrieves the text.  What might I be doing wrong? 

0 0

Replies

replied on November 28, 2018

What triggers the second workflow?  Are you certain it is happening after OCR is complete?  Do you see any messages to explain why the retrieve text activity failed?

1 0
replied on November 29, 2018 Show version history

The first workflow processes the OCR and then moves the documents.  I guess I assume one the file has been moved, the OCR should have completed.  The 'Activities' tab of the workflow process gives me that impression.   So the second workflow is triggered after the document is moved to the second folder. 

The way I verified that the text was not being gathered is I put a 'Track Token' after the 'Retrieve Document Text' token and it is blank after the document is processed. 

0 0
replied on November 29, 2018

Ok, so I just resolved my issue with the help a posted yesterday but then was deleted before I had a chance to reply.  So thank you whom ever that was. I put a 1 min delay on the second WF.  It appears as though the second process was grabbing it before the OCR was truly completed. 

0 0
replied on December 3, 2018

OCR is not synchronous, Workflow just schedules documents for OCR with DCC. Depending on the load on your DCC, 1 minute delay may not be enough to guarantee OCR completed. Your second workflow should be triggered by changes to the document when the text pages are added.

1 0
replied on November 28, 2018

What steps occur in your process between scheduling the OCR and having the document get picked up by the next workflow?

The reason I ask is that the OCR activity in workflow schedules the OCR, but it is basically a "set it and forget it" type of activity so the workflow moves on immediately.

As a result, if your next workflow/activity is initiated right after the Schedule OCR activity, it would almost certainly touch the document before the OCR process is complete.

You are not allowed to follow up in this post.

Sign in to reply to this post.