Arabographic Optical Character Recognition (OCR)

Universität Leipzig

The OpenITI team, building on the foundational open-source OCR work of Leipzig University’s Alexander von Humboldt Chair for Digital Humanities, has reported Optical Character Recognition (OCR) percentage accuracy rates for classical Arabic-script texts in the high nineties. These numbers are based on tests of seven different Arabic-script texts of varying quality and typefaces, totaling over 7,000 lines (approx. 400 pages,

» Read more