TU Dortmund -> Department of Computer Science -> LS XII -> Pattern Recognition Group -> Publications -> Publication Details

Novel Sub-character HMM Models for Arabic Text Recognition


Irfan Ahmad, Leonard Rothacker, Gernot A. Fink, Sabri Mahmoud In Proc. Int. Conf. on Document Analysis and Recognition, Washington DC, USA, 2013.

Hidden Markov Model (HMM) is one of the most widely used classifier for text recognition. In this paper we are presenting novel sub-character HMM models for Arabic text recognition. Modeling at sub-character level allows sharing of common patterns between different contextual forms of Arabic characters as well as between different characters. The number of HMMs gets reduced considerably while still capturing the variations in shape patterns. This results in a compact and efficient recognizer with reduced model set and is expected to be more robust to the imbalance in data distribution. Experimental results using the sub-character model based recognition of handwritten Arabic text as well printed Arabic text are reported.

 [bib] [pdf]