You are viewing limited content. For full access, please sign in.

Question

Question

ocr for pdf its posible from sdk laserfiche ? or OcrEngine working only images (sdk)?

asked on January 15, 2014

 Hi

Seeking to apply ocr pdf documents, reviewing ocrengine have the sdk Laserfiche
but it only works with images, this ocrengine works with pdfs? as I can get the text of pdfs?.

This the code I'm using.

 

OcrEngine motorOcr = OcrEngine.LoadEngine();

motorOcr.AutoOrient = true;

motorOcr.Decolumnize = true;

motorOcr.OptimizationMode = OcrOptimizationMode.Accuracy;

motorOcr.Run(Document.GetDocumentInfo(Convert.ToInt32(id), varSession));

 

 

thanks.

 

 

 

0 0

Answer

SELECTED ANSWER
replied on January 16, 2014

The OCR engine tries to process laserfiche pages of specified document. It means, the attached code could not extract text from the PDF which is electronic file. I would suggest you use TextExtractor in DocumentServices to get PDF's text.

1 0

Replies

replied on January 16, 2014

Ok

thanks

Huazhen Liu
0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.