Naila Fareen1, Mohammad Abid Khan2 & Attash Durrani3 URDU NASKH TYPOGRAPHY PATTERN RECOGNITION OF TYPE FOUNDRIES Dept. of computer science, Allama Iqbal Open University Islamabad, Pakistan
[email protected] Mohammad Abid Khan Dept. of Computer science, University of peshawar Peshawar, Pakistan
[email protected] Attash Durrani Dept. Of Computer science,Allama Iqbal Open University Islamabad, Pakistan
[email protected] Abstract Optical Character Recognition (OCR) is the process of converting printed, handwritten and typed printed text into its equivalent machine readable form. The study of OCR is becoming more popular all over the world including Pakistan. A lot of work has been done on Urdu Literature and History of Muslims. The old documents need to be transferred into electronic form. In this research, work is done on the Typed Urdu Naskh pattern recognition of type foundries. This research primarily focuses on recognizing the Typed Urdu Naskh font. The aim here is to develop a more reliable, accurate and quick system for Typed Urdu Naskh pattern recognition in order to get benefit of the cultural heritage left by our ancestors for centuries. This research introduced and evaluated different font size of type foundries of Naskh font, from the perspective of spatial resolution, normalization, training, segmentation and downsampling. In this study a new method is proposed for offline printed character recognition problem, which uses the sequence of segmentation, training and recognition algorithms. The proposed method is tested on various type foundry’s digits and characters. The system has been implemented by using self organization map (SOM) technique; recognition rate of 98.18% is achieved.