I have a Forms process where a document is uploaded by an end user and stored in the repository. Then a WF process runs OCR on the uploaded document. Then a second WF process picks it up and attempts to retrieve the text. I can see the text if I open the document before the second WF process runs but there is no text retrieved in the workflow process. When OCR is ran on the document after I drag and drop it into the repository, the second WF properly retrieves the text. What might I be doing wrong?
Question
Question
"retrieve document text" not retrieving text
Replies
What triggers the second workflow? Are you certain it is happening after OCR is complete? Do you see any messages to explain why the retrieve text activity failed?
The first workflow processes the OCR and then moves the documents. I guess I assume one the file has been moved, the OCR should have completed. The 'Activities' tab of the workflow process gives me that impression. So the second workflow is triggered after the document is moved to the second folder.
The way I verified that the text was not being gathered is I put a 'Track Token' after the 'Retrieve Document Text' token and it is blank after the document is processed.
Ok, so I just resolved my issue with the help a posted yesterday but then was deleted before I had a chance to reply. So thank you whom ever that was. I put a 1 min delay on the second WF. It appears as though the second process was grabbing it before the OCR was truly completed.
OCR is not synchronous, Workflow just schedules documents for OCR with DCC. Depending on the load on your DCC, 1 minute delay may not be enough to guarantee OCR completed. Your second workflow should be triggered by changes to the document when the text pages are added.