Question

Indexing pages or OCRing pages or "Generate Pages" programatically

SDK

Updated November 21, 2016

asked on November 21, 2016

So currently when I save a PDF using the JRA/SDK it goes in fine, it has all the attributes I want, the extension I want it's all beautiful with one problem. It doesn't get OCR'd, so when a user clicks on the link to the doc from the web interface they are prompted with the following:

Normally when a file is uploaded via the web it is automatically OCRd and so when they click on the link to the page it just brings up the editing with the tags and all is well.

I can manually achieve this after JRA'ing a file into the repository then going through the browser interface then choosing to edit the doc then clicking the advanced menu option and clicking "Generate Pages". My question is -- can this be done via the JRA/SDK so that it's ready to edit the minute it gets clicked on and so that the pages are indexed and all that jazz?

0 0

Replies

replied on November 21, 2016

You can use the SDK to OCR documents, you can refer to some sample code here.

1 0

replied on November 21, 2016

Generate Pages is not available directly through the SDK due to 3rd party license restrictions. The one possible method is to use the CAT library to generate pages through the desktop client. Check out the sample code in this presentation for more information. Specifically the "GeneratePages" method on the MainWindow class.

0 0

You are not allowed to follow up in this post.

Question

Question

Indexing pages or OCRing pages or "Generate Pages" programatically

Replies

Sign in to reply to this post.