You are viewing limited content. For full access, please sign in.

Question

Question

Extract text from imported document using DocumentImporter.ExtractTextFromEdoc

asked on September 17, 2019

Hi LF comunity,

 

I'm using SDK to import a PDF document from a local path to Laserfiche and it seems that  

DI.ExtractTextFromEdoc = true;

 doesn't work to extract the text from my PDF, I tried this command : 

C:\Program Files\Common Files\Laserfiche\Text Provider\TextProvider64.exe" -cmd -ExtractTextFromFile c:\your-pdf-file.pdf c:\your-text-file-output-64.txt

and it works fine for me, the text is extracted from the PDF,

The classical method of drag and drop the PDF into Laserfiche and select the generate searchable text works fine also, 

So I don't know what's wrong with my code sad,

 

I will appreciate any kind of help !!

 

RepositoryRegistration myRepoReg = new RepositoryRegistration("192.168.0.40/laserfiche", "Test");
          
           Session mySess = new Session();
           mySess.LogIn("admin", "admin", myRepoReg);
           DocumentInfo document = new DocumentInfo(mySess);

            document.Create("\\Import\\COM", "DEFAULT", EntryNameOption.AutoRename);
            DocumentImporter DI = new DocumentImporter();
            DI.Document = document;
            DI.ExtractTextFromEdoc = true;

            //DI.OverwritePages = true;
           
            DI.ImportEdoc("application/pdf", "C:\\ImortTest\\com.pdf");
            document.Unlock();

Thank you,

0 0

Replies

replied on September 17, 2019

The Adobe PDF IFilter 11 has known problems, it examines the name of the calling process and aborts the operation if the process name is not whitelisted. Try using a different ifilter such as version 9.5.5.

0 0
replied on September 30, 2019

Hi Robert,

sorry for the delay, I tried with different ifilters and different version of ifilter and the problem still the same :/ , however the classical method works fine !!

any clue ?

Thank you. 

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.