A New Approach in Persian Handwritten Letters Recognition Using Error Correcting Output Coding

Authors

1 Master Student at Department of Computer Science, University College of Rouzbehan, Sari, Iran

2 Research Assistant of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China

3 Department of Electrical Engineering, Sari Branch, Islamic Azad University, Sari, Iran

Abstract

Classification Ensemble, which uses the weighed polling of outputs, is the art of combining a set of basic classifiers for generating high-performance, robust and more stable results. This study aims to improve the results of identifying the Persian handwritten letters using Error Correcting Output Coding (ECOC) ensemble method. Furthermore, the feature selection is used to reduce the costs of errors in our proposed method. ECOC is a method for decomposing a multi-way classification problem into many binary classification tasks; and then combining the results of the subtasks into a hypothesized solution to the original problem. Firstly, the image features are extracted by Principal Components Analysis (PCA). After that, ECOC is used for identification the Persian handwritten letters which it uses Support Vector Machine (SVM) as the base classifier. The empirical results of applying this ensemble method using 10 real-world data sets of Persian handwritten letters indicate that this method has better results in identifying the Persian handwritten letters than other ensemble methods and also single classifications. Moreover, by testing a number of different features, this paper found that we can reduce the additional cost in feature selection stage by using this method.

Keywords