You are viewing limited content. For full access, please sign in.

Question

Question

Schedule OCR proper search query syntax

asked on July 6, 2016 Show version history

Hi All,

I'm trying to setup and test one of these schedule OCR processes for the first time and I'm hitting a snag. I have the DCC setup and I have the workflow prepared. It does appear that the Schedule OCR process is correctly connected (since I am seeing it's tallying jobs run in DCC on the web admin console, none have been successful though with the syntaxes I have tried to use.) I'm not sure what's supposed to be put in the search query area on the search repository step. This will be setup to look for documents that have been scanned or placed in during the day that need OCR'd. This will run overnight sometime. I'm sure this is done just about the same way in all of these so I just need to know what should go there?

search repository settings.PNG
schedule OCR settings.PNG
0 0

Answer

SELECTED ANSWER
replied on July 6, 2016

I would recommend your search syntax be similar to this:

({LF:AssociatedPages="Y"} & {LF:OCR=none})&{LF:pagecount > 0}&{LF:totalimagesize > 0}

What you will want to do is search for documents that have no OCR or some OCR (you'll have to add this to the above search) and you will want documents that have images.

2 0
replied on July 13, 2016

ok, so I was able to set this up properly and with the testing I did with just the scheduler it worked fine. I was able to pull OCR from non OCR'd documents. the other portion of this is setting up workers though. I have read some information on this regarding scripting etc......some of it's a little high level for me right now so I'm not sure if anyone has some truncated way to explain the information and the steps involved here. also, how many should typically be setup?

0 0
replied on July 14, 2016

If the licensing is allowed I would start with 2-3 worker machines and increase the amount if needed. You would install the other machines as workers and then within the Web Administration console you can register the worker machines from there. 

1 0
replied on July 14, 2016

ok, I was reading something on scripting to get DCC to run off hours but I think that is if you are not using workflow or something else entirely? the workflow I have made should kick everything off just fine based off the starting rule time I create. this is the way I take it.

thanks for the information!

0 0

Replies

You are not allowed to reply in this post.
You are not allowed to follow up in this post.

Sign in to reply to this post.