The OCR engine is overloaded and timing out at 50 minutes on a single document. The error in event viewer is:
Task ID: 1302.1302.2, Task Type: OCR.OCR, Host Name: HOST Context Message: Running OCR exceeded its timeout of 600000 milliseconds. Running OCR exceeded its timeout of 600000 milliseconds.
The documents that it's OCRing are difficult, with a repeated text watermark and a laurel border. The workflow which sends documents to the scheduler is set to OCR in standard mode, so I'm not sure if that's a part of it. The worker machine is allowed to run two tasks concurrently, and it looks like each task is only allowed to use 25% of the CPU.
The machine is running with 12 GB of memory, and has a 4x E5-2699 cores @ 2.30 GHz. It seems like that should be enough, but maybe not if it's limited to 25%. Right now it's showing that it's using 75% CPU on three processes, and 4 GB of memory.
My questions are:
1) Is it possible to manually increase the limited CPU resources that a single OCR process can take?
2) Does this machine have enough resources to handle OCR tasks?
3) Is there anything I'm not thinking of that could be causing the OCR processes to time out?