I've setup LFDCC to leverage OCR processing via workflow and webaccess. Installation seems to have gone very well. One thing I notice within the WebAdmin DCC job reporting pages:
OcrEngineOptions: Decolumnize: False LanguageTag: en OptimizationMode: Balance OcrEntriesInSubFolders: False AutoOrient: False PerformImageCleanup: True SkipPagesThatAlreadyHaveText: False ImageCleanupOptions: Deskew: True Despeckle: True SpeckleSizeInPixels: 2 Rotate: False RotationAmountInDegrees: 0 HorizontalLineRemoval: True VerticalLineRemoval: True LineRemovalCharProtection: True
Is the ocr engine committing the deskew/despeck within image cleanup? Where are these ocr engine and image clean-up options coming from?