You are viewing limited content. For full access, please sign in.

Question

Question

Automating Conversion of PDF to Laserfiche Image (TIFF)

asked on May 24, 2017

I can't believe that something so simple to do in Client or Web Access is such a nightmare to automate.

Using Laserfiche Client or Web Access, I can easily complete the following three steps for documents:

  1. Generate Searchable Text
  2. Delete Electronic Files
     

What simple solution can I use to automate this process without having to copy or move my file? I've also posed this question to our VAR. Thanks!

0 0

Answer

SELECTED ANSWER
replied on May 25, 2017

In the Laserfiche Capture Engine, Document Content:

These are the settings we use for PDFs.

Then you'll have images you can run OCR on.  It's a good idea to enable Auto Rotate in the OmniPage OCR process settings.

0 0

Replies

replied on May 24, 2017

Import Agent can do that.

1 0
replied on May 24, 2017 Show version history

Great, will try and update everyone later of my success (I hope). Thanks.

0 0
replied on May 24, 2017

To clarify ... the document is already in the repository ... I want to do strip the PDF and replace with TIFF after it is already in the system. Will Import Agent still work?

0 0
replied on May 24, 2017

As the documents are already in the repository I would use quickfields agent.

0 0
replied on May 24, 2017

Tried that ... and came close. Looking for more guidance from our VAR to see where we may have gone wrong. May go back to quickfields agent as our final solution.

0 0
replied on May 24, 2017

Where did you get stuck in quickfields?

Post Processing is where you go to delete the original.

Page Processing -> OmniPage OCR will get your text.

 

1 0
replied on May 24, 2017

I don't have access to Quickfields on the server so it's hard for me to explain. I was working with IS and he was "driving". I just know the end result was not placed in the folder where we wanted it. We are trying again tomorrow, so we'll see if we can get quickfields to work the way we want. Stay tuned!!!

1 0
replied on May 25, 2017

Erik, this was helpful but I need to know how to create the image that gets OCR'd. Can Quick Fields do that?

0 0
SELECTED ANSWER
replied on May 25, 2017

In the Laserfiche Capture Engine, Document Content:

These are the settings we use for PDFs.

Then you'll have images you can run OCR on.  It's a good idea to enable Auto Rotate in the OmniPage OCR process settings.

0 0
replied on May 30, 2017

That worked! Now I cannot delete the PDF. In OmniPage OCR, I have "Keep PDF after using it to generate Laserfiche pages" checked off.

0 0
replied on May 30, 2017

Close now!  In the Capture Engine, Post-Processing, Add Action > Delete.

 

Make sure you are saving the newly processed image under Document Class > Document Class Options > Document Storage.

0 0
replied on May 30, 2017

Delete is greyed out.

0 0
replied on May 30, 2017

Does the account quickfields uses to connect to the repository have delete permission in the repository?  I've never encountered that problem so that's all I can think to troubleshoot.

0 0
replied on May 30, 2017

Do you have other post-processing actions set? "Delete" is not available when you have other actions, like move or assign tags set since they're incompatible.

1 0
replied on May 31, 2017

@████████, yes, the account does have delete permissions. You've been EXTREMELY helpful! Thank you for your support. With your help, I got Quick Fields to do what we needed it to do. We are using Workflow for the rest of the tasks.

@████████, not sure, will get IS to look today when we meet. Thanks for the feedback.

0 0
replied on May 31, 2017

@████████, you were correct! We did have something post-processing. Removed it and the "delete" function was available to choose. Thanks again.

0 0
replied on May 24, 2017

Depending on whether the PDF files in question are ALREADY in the repository or not. If they're not then Gloria is correct, Import Agent works well for that. If the PDFs are already in the repository then the way I would approach it is to write a workflow script to export the PDFs to a folder on the workflow server that is monitored by Import Agent and imports them back in. Then workflow puts the image back onto to original entry they came from and deletes the PDF. Unfortunately, that is not necessarily trivial to implement.

1 0
replied on May 24, 2017

Thanks! It is already in the repository. Will try this, though.

0 0
replied on May 24, 2017

I believe the document export script needed requires the SDK. Perhaps development can chime in here to confirm.

0 0
replied on March 26, 2020

This is still a big issue in our office and especially so now that we are working off-site and using Web Access!  We can generate pages on pdf's but then cannot find a way to delete the electronic pdf pages.  I wish we could have implemented what Jeremy is suggesting (or did suggest in May of 2017), but I suspect it would be an expensive thing to implement since we do not use SDK yet.

0 0
replied on March 27, 2020

Hi Connie,

 

I have a solution for you that we developed in Workflow to automate all of this. Pop me an email (sheldon@noscotek.co.za) and let's discuss how we can help you.

1 0
You are not allowed to follow up in this post.

Sign in to reply to this post.