You are viewing limited content. For full access, please sign in.

Question

Question

SDK DocumentServices - TextExtractor

asked on September 17, 2015
In the next example, the OCR process works well, but not the TextExtractor process , any suggestion?
 
      docInfo.Lock(LockType.Exclusive)
            If docInfo.EntryType = EntryType.Document Then
                If docInfo.Extension.Trim = "" Then

                    Using ocrEngine As OcrEngine = ocrEngine.LoadEngine()
                        ocrEngine.AutoOrient = True
                        ocrEngine.Decolumnize = True
                        ocrEngine.OptimizationMode = OcrOptimizationMode.Accuracy
                        ocrEngine.Run(docInfo)
                    End Using
                Else

                    Using objExtraerTexto As TextExtractor = TextExtractor.LoadExtractor
                        objExtraerTexto.ExtractFrom(docInfo)
                    End Using
                End If
            End If
            docInfo.Unlock()

 

0 0

Answer

SELECTED ANSWER
replied on August 19, 2017

Hi Marco,

I suspect you've moved on but your code is fine. The TextExtractor requires an iFilter to match the document's extentsion. Mime-type doesn't seem to matter.

In my test, I had a .CSV and a .LOG file. Both were just ASCII and both were application/octet-stream.

Configuring Windows to treat .LOG (in my example) the same as .CSV, is another issue.

0 0

Replies

You are not allowed to reply in this post.
You are not allowed to follow up in this post.

Sign in to reply to this post.