Greetings,
I'm looking to set up a Quickfields session to redact SSN's from all documents in our repository. There are multiple issues to be addressed in creating this session, and I was hoping to get some feedback that I was on the right track.
1: Finding *all* SSN's
Because the SSN's are usually handwritten, I need a multi-pronged approach for finding all instances of them. I am thinking I should do at least three different types of searches, one for "SSN", one for "Social Security Number", and one for "\d\d\d-\d\d-\d\d\d\d" After finding as many as possible we come to the next problem. Is it possible to use the Auto-Annotation on these three searches and produce the result I'm looking for? I figure the pattern matching case where the SSN is typed would be the easiest, correct?
2: SSN's are located at different places within documents, on the page and not always on the first page.
Using the three searches outlined above, to redact the SSN after "SSN" and "Social Security Number", I'm assuming I'll need to define a zone to be redacted. Is that correct or does the auto-annotation feature work on a different principle? If a zone does have to be defined, the fact that the zone to be redacted will be located in various places around the page might be a problem. Is there a way to avoid having to make multiple sessions to address each placement of an SSN throughout various documents?
3: Verifying success of searches and redactions
What is the best way to verify that I have found at least 80% of the SSNs in the repository and redacted the correct information? I plan to do lots of test cases, but would like to reduce the need to go individual document by individual document.
Any help or comments would be greatly appreciated.