Options
Modular approach to recognition of strokes in Telugu script
Date Issued
01-12-2007
Author(s)
Jayaraman, Anitha
Indian Institute of Technology, Madras
Indian Institute of Technology, Madras
Abstract
In this paper, we address some issues in developing an online handwritten character recognition(HCR) system for an Indian language script, Telugu. The number of characters in this script is estimated to be around 5000. A character in this script is written as a sequence of strokes. The set of strokes in Telugu consists of 253 unique strokes. As the similarity among several strokes is high, we propose a modular approach for recognition of strokes. Based on the relative position of a stroke in a character, the stroke set has been divided into three subsets, namely, baseline strokes, bottom strokes and top strokes. Classifiers for the different subsets of strokes are built using support vector machines(SVMs). We study the performance of the classifiers for subsets of strokes and propose methods to improve their performance. A comparative study using hidden Markov models(HMMs) shows that the SVM based approach gives a significantly better performance. © 2007 IEEE.
Volume
1