You are viewing limited content. For full access, please sign in.

Question

Question

Remove pages with dark background

asked on November 30, 2022

Is there a way to remove pages that have a very dark background?  These pages serve no purpose and I would just like to have them removed during scanning if possible.

Note: The person doing the scanning is using a multi-function printer to do the scanning to a shared folder on the LF server; then Import Agent pulls the files into the repository to be processed by QF. I have asked if they can just avoid scanning these pages but they are scanning them in bulk so it would be time-consuming for them to go through each packet and remove these pages.

 

 

0 0

Replies

replied on November 30, 2022

I'd say you're probably going to have more luck looking at the scanning software to see if it has any options for removing pages and such.

By the time the document gets to Laserfiche it is a rendered image so detecting this kind of thing becomes exceptionally difficult and far less effective than doing it at the source.

For example, we use Fujitsu scanners, and they have some really great features in the scanning software for situations exactly like this one.

2 0
replied on November 30, 2022

Thanks Jason, but they're using a multi-function printer/scanner to do the scanning so the options aren't necessarily similar to what would be available on a PC with a dedicated scanner and drivers.  I can check and see what's available when they scan, but I was hoping there was something available in QF to handle these types of images.

0 0
replied on December 1, 2022

That makes sense. I think the challenge you're going to have is that there's not really an automated way to detect this sort of thing.

It looks very obvious to us, but I've yet to encounter any image processing software with the contextual awareness to recognize things like this with enough accuracy to be left to run unsupervised.

The thing to consider is that a dark background could be totally legitimate, for example if you scanned a dark photograph in black and white, then you'd end up with something that has very similar properties.

You could try something like running OCR on the page and dropping the page if no text is detected, but there's just not many options to automate this even when looking at third-party software.

Despite how good our Fujitsu scanning software is, it's still not foolproof and it may drop things we don't want dropped so we still don't let it drop pages without having someone keeping an eye on things for QA.

Even if you find an option for this, I would not trust it to run without having someone spot-check the results for accuracy.

1 0
replied on December 1, 2022

Thanks for the feedback; I do think we'll just have to keep these pages for now.  I asked the user if they could start using white paper instead of pink for the "Special Handling" form and they are willing to do that, so the problem will go away once they start scanning with the new forms.

 

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.