You are viewing limited content. For full access, please sign in.

Question

Question

Scanning PDF files & Import Agent Issues?

asked on September 13

User is currently scanning tiff files.  Import Agent performs OCR on the file and imports the tiff.  They are moving to new scanners that will scan PDF files.  Import Agent generates pages, performs OCR on the file, and deletes the PDF file.

Has anyone experienced issues with image quality, corruption, or OCR scanning PDF files and using Import Agent?  The user is looking for any feedback on potential pitfalls by changing the process to using PDF files.

0 0

Replies

replied on September 13

Out of curiosity, is there are reason they aren't scanning directly into Laserfiche with LF Scanning?

0 0
replied on September 13

They are using networked multi-function devices to scan.  The current machines will be replaced with devices that scan color PDF files as opposed to the current set up of black and white tiffs.  They are looking for assurances that they will not experience issues with the change.

0 0
replied on September 13 Show version history

I've never seen any issues with quality when importing color PDF through the current versions, but there are a few other things to consider.

  1. Make sure monochrome imports is not enabled so the import profile will generate color page images.
  2. Make sure the import profiles are configured to pull in PDF files or all files regardless of extension.
  3. Color is an all-or-nothing setting, so every page of the documents will be generated in color; this will significantly increase document sizes compared to having only select pages in color.
  4. The conversion from PDF to TIFF-LZW pages will cause a substantial size increase compared to the source PDF.
  5. Check the LF and IA versions and run some tests because past versions did have bugs that caused conversion issues when importing PDFs generated by some scanning software; this seemed to be source-specific, so if your tests go well it probably wouldn't be an issue even in older versions.

I've never had a problem with corruption. OCR is resource-intensive, so in our environment we use a DCC and OCR Scheduling in Workflow, which also provides more settings/options, to handle OCR after import rather than doing it through the Import Agent profiles.

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.