asked one day ago

Hi! I'm working with Laserfiche Workflow 12 locally. I receive a very large PDF (sometimes over 1,000 pages) with invoices from multiple vendors and occasionally messy pages (I can't use fixed areas or templates).
What I've set up so far:

  • Retrieve the text from the document.
  • C# script that parses the text and generates lines like: INV=123456|PAGES=2-5. If a page doesn't have a SAP, I deduce the group by matching (PO, customer, or vendor + total) and inherit the SAP for the rest of the group.
  • Pattern matching + For Each to iterate each line and copy/move pages:
  • If there is a SAP → I create/use \Invoices\{SAP} and compile the final document.
  • If there are no clear references → I send it to \Invoices\Manual Review.

 

It works, but it's complex to maintain, and I want to simplify it, reduce the code, and make it design-agnostic. Do you recommend any recipes or patterns for the local platform to sort by page and then rebuild with Workflow, or a script-free flow to:

  • extract keys by page (SAP/PO/Customer/Vendor/Total) without fixed zones,
  • propagate keys detected on one page to the rest of the group,
  • handle ambiguities (repeated POs, "Total" on multiple pages),
  • and decide whether to copy or move pages in large batches?

If anyone has a checklist, example, or QF session/Workflow reference, I'd greatly appreciate it! 🙌

0 0