You are viewing limited content. For full access, please sign in.

Question

Question

OCR workflow without using DCC

asked on August 11, 2015

Hi!

i am importing a large number of documents (30K+) into Avante 9.2

I was hoping to import them all without OCR'ing and then run the OCR process through a workflow to be able to spread out the processing time over multiple nights.

Everything i seem to read is the workflow's require DCC for this type of operation.

Is there any other way to do this without DCC?

There is a variety of documents, some require OCR, some it would be a waste of time (scans of handwritten forms etc)

I can setup a custom search like   ({LF:OCR=none})

then select a chunk (number of files depending on time taken, so will have to see) and click OCR.

This seems to work, but is fairly 'hands-on'

Can anyone suggest a better way?

 

Thanks

 

0 0

Replies

replied on August 11, 2015

Hi Mark,

 

I don't think there is any "out of the box" functionality to do this within workflow. It might be possible through SDK script. Having said that what is the reason you don't want to use DCC? It ships with both Avante and Rio for free?

 

What if you were to use an old PC or spare piece of hardware and install DCC onto this? That way it can run 24 hours a day, not just overnight without impacting any other infrastructure?

 

Hope this helps!

0 0
replied on August 11, 2015

OCR is very CPU-intensive, so for performance reasons we do not have built-in activities that would OCR on the Workflow server. You can use a script like Chris mentioned, but I wouldn't recommend it for the same reason.

0 0
replied on August 12, 2015

Hi!

For some reason i was sure i was told DCC was an option in Avante.  Apart from the potential cost, I was hoping to avoid DCC time/complexity issues.  This is a really small and simple install and i have a limited amount of time i can spend on the setup. I've never setup DCC before, so i didn't know how long it would take. Also, once the initial backlog is OCR'd, DCC would probably not be needed, as there aren't a huge number of docs coming in on a daily basis.

I think i'm just going to import/OCR a small section to see how long its all likely to take. I'll decide then if DCC is needed.

Thanks for the help.

0 0
replied on August 12, 2015

Hi Mark,

Yes, Laserfiche Avante includes the preview of DCC. As far as installation and configuration goes, you don't need to set up in a cluster if that's not useful for you - you can just install it all to OCR on one machine (one node installation, basically) while the backlog is being processed and then uninstall it when you're down. 

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.