Inaccurate OCR Results when Importing predefined files using Import Agent

asked on November 19, 2014 • Show version history

Because Import Agent does not currently OCR Arabic documents, an external OCR program is used to create an indexed image file with converted text. Import Agent seems to misinterpret the text file that is being input and outputs nonsensical characters in Laserfiche.

I'm not sure what is going on here, where in the process is the Text being converted from "مبادئ توجيهية لتعديل دليل البراءات" to "Ø§ÙØ¨Ø±Ø§Ø¡Ø§Øª"? Is that an Import Agent issue or a setting that is not correct?

A screenshot is below. As you can see I provide the .lst file, the .txt file output by our OCR engine, and the results of those files being run through Import Agent.

0 0

Question

Question

Inaccurate OCR Results when Importing predefined files using Import Agent

Answer

Replies

Sign in to reply to this post.