You are viewing limited content. For full access, please sign in.

Question

Question

Laserfiche Repository Field - Constraint for Values not Case Sensitive

asked on November 7, 2016 Show version history

Hello,

I need to define a field constraint so the user can enter values that are not case sensitive; for instance :

> Please OCR page [1-9]\d*

> This page is a packing slip

For the 2 formats above, the field constraint becomes:

[Pp][lL][eE][aA][sS][eE]\x20[Oo][Cc][Rr]\x20[pP][aA][gG][e]\x20[1-9]\d*|[Tt][hH][iI][sS]\x20......

Given the field has 6 additional formats, the constraint reaches actually the 400 characters limit so I'm stuck. Is there another mechanism - a sort of (?i) - so I could shorten the field constraint ?

Thanks in advance

0 0

Replies

replied on November 7, 2016

There is no way to set the whole regex as case insensitive. Why do you have your field setup with those types of values? Maybe it would work better as two fields where one of them is a list and the other is the optional numeric value.

0 0
replied on November 7, 2016 Show version history

Having 2 fields is definitely unacceptable for the current scenario. How about pushing the 400 characters limit?

0 0
replied on November 7, 2016

Can you please explain the current scenario in more detail? I'm hoping we can find an easier solution for you that avoids the need to create such complex constraints. 

0 0
replied on November 8, 2016

Hello Tessa,

Can you please tell me what information is unclear/missing in my request ?

0 0
replied on November 8, 2016

We would like to know why your fields are configured this way, what business problem is being solved.

0 0
replied on November 8, 2016

Document piles - composed of different document types, e.g. invoice, bill of lading, etc - are scanned; each page is OCRed and pattern matched to identify which document type it belongs to. So let say a pile has 500 pages; a workflow pattern matches each page as per its document type and ID, e.g. invoice #12345. That information is then stored in a multi-value field (one value per page). Afterwards, the user has to check if each page's information is okay and, if so, another workflow splits the pile into different Laserfiche documents based on consecutive pages, e.g. invoice #12345 runs from pages 9 to page 12, and so on. This issue is: if the workflow fails to pattern match a given page (or do so inappropriately), the user has to enter manually the information into the multi-value field. In doing so, I don't want to restrict the user in typing only lowercase or uppercase letters. For instance, 'Invoice 12345' and 'INVOICE 12345' need to be ok, not 'Invoice ABCDE'. So the constraint looks like ^[Ii][vV][oO][iI][cC][eE]\x20[1-9]\d*$. Given there are many document types, the multi-value field constraint exceeds the actual 400 characters limit.

Hope that helps ;-)

0 0
replied on November 8, 2016 Show version history

I see. Have you considered using Quick Fields for this scenario? This seems like exactly the kind of situation that Quick Fields was made for :)

Quick Fields would be able to run OCR (if needed), identify document types and split the pile into individual documents based on the pattern matching, and much more. It also has many tools to significantly speed up manual review so you could quickly fix any issues with the documents, including misidentified pages and OCR issues, as soon as they are scanned. 

This would avoid the constraint issues altogether, since there would be no need to record the problems in fields corresponding to each page. The problems could likely instead be fixed in-place by using Quick Fields.

1 0
replied on November 8, 2016

Like I said, the user may have to enter manually the information into the multi-value field when the workflow, or even QF as you suggest, fails to pattern match a given page (or do so inappropriately).

0 0
replied on November 8, 2016

I suppose what I'm confused about is: why do they have to manually enter a description of what to fix instead of just fixing the issue right then? If the fixes are automated based on the user's input, what sort of fixes are they (besides OCR as you mentioned)? Perhaps they are fixes that could be more directly triggered from Quick Fields. 

0 0
replied on November 8, 2016

As described, pattern matching is applied to documents that have been scanned in the first place. Scanning always introduces noise in the image, so OCRization is not perfect and pattern matching fails or yields inappropriate results. In such situations the user has to enter the value manually.

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.