I have a simple task to do but with a large number of documents (more than 1 million) to do it to. All I need to do is remove hyphens from the SSN field in each of those 1 million+ documents. They all reside in a system of subfolders under the same root level folder. Here are the options I have considered so far.
1) Search Repository action that finds all documents under that root level folder. For each document, invoke a workflow on it. That auxiliary workflow updates the SSN field on it.
2) Search Repository action that finds all documents under that root level folder. For each document, update the SSN field.
3) Search Repository action that finds all folders under that root level folder. For each folder, invoke a workflow. That auxiliary workflow searches for all documents directly under the current folder (no subfolders) and updates the SSN on them. I received this general idea from Laserfiche Presales for another large job.
4) Find Entries under the root level folder. For each entry, if the entry is a document, update the SSN field, and if it is a folder, invoke this same workflow on it. (idea from https://answers.laserfiche.com/questions/144601/Update-metadata-in-all-documents-in-folder-and-subfolders#144607)
I am worried about this because I have been trying to do choice #1, and after 65 hours of progress it has only completed 340,000 documents. I would have expected it to be faster since it is only one activity for each document (Assign Field Values.) Which of those 4 options would be the most optimal, and is there another strategy that would be better?
Thanks