I am processing a collection of PDF documents that were imported into the repository via import agent. I wasn't part of that effort, so i do not know the specifics used in importing. That said, I have about a 1000 pdfs that were not placed in the correct directory and I need to process them. Our DIr structure is by Year-Month and i just need to put the docs in the correct folder (creating a folder using the case number as the name.)
in order to get the date, i am trying to use the Retrieve Document Text workflow object and then use pattern matching on the text, however, I receive the error provided in subject when i hit the first doc. I am able to open the pdf in the repository and it has 3 pages within it, so i am a bit baffled by this. I assume there is a separate type of page it needs and from other posts, it might be that generate text was not selected, so this wont work. If this is the case, please confirm or provide some guidance to resolving this issue.