We have a client that has noticed some odd behavior in Import Agent. Even with the "OCR image files" turned off there is still extracted text on the PDFs they import. It seems that import agent uses the text stream of the PDF to create text by default.
They wanted to use DCC to OCR the pages after the documents are imported. Is there a way to disable text extraction from PDF text streams or is this not a resource intensive process?