You are viewing limited content. For full access, please sign in.

Question

Question

Best approach to workflow OCR overnight

asked on June 17, 2015

Hi all,

 

I have a site which is ingesting large amounts of documents via Ezescan, whereby upon scanning only the first, cover page is OCR'd and zone recognised to construct basic metadata, file name and destination in the repository.  

Some docs are 1000+ pages and so we are not doing the full text search OCR on upload.

I want to run a workflow when the business is closed overnight to crawl the repository for documents that need full text search OCR and do it then.

Nothing fancy, no moving of the docs, no metadata work or anything, just ensuring that in a reasonable timeframe after ingestion the documents will be full-text searchable.

This is a Rio 9.2 site.

Any suggestions would be appreciated.

Will

0 0

Answer

SELECTED ANSWER
replied on June 17, 2015

That is exactly what the Distributed Processing is for.  Once set up, it can scan the repository and schedule documents to be OCRed by configured workstations.  You can set up many workstations to distribute the load and speed up the OCRing of the entire batch.

2 0

Replies

replied on June 17, 2015 Show version history

Thanks Bert,

I knew the DCC is part of Rio and I could leverage it but having only one server does it work fine overnight on the main box - and does it do this task as a built in functionality or do you need to call it with workflow.  Sorry for the Noob questions.

I really appreciate your reply.

Will

[edit - reading the fine documentation you linked to now - thank you]

0 0
replied on June 18, 2015

You can have a DCC installation with only one worker node. I wouldn't recommend putting it on the Workflow Server though as OCR is CPU intensive so it may affect the performance of the Workflow Server.

DCC does not generate OCR tasks by itself, they need to be sent through Workflow or from Web Access.

1 0
replied on August 11, 2016

Hi Miruna,

 

I would like to know how the OCR tasks would be sent through Workflow? Do you have any document which further explain the process? Thank you

 

Kind Regards,

Ketsia

0 0
replied on August 12, 2016

See the help file.

0 0
replied on June 22, 2015

Thats fantastic - thanks.  Getting across this now.

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.