Fadi Sindran, Firas Mualla, Tino Haderlein, Khaled Daqrouq & Elmar Nöth Rule-Based Standard Arabic Phonetization at Phoneme, Allophone, and Syllable Level Fadi Sindran
[email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Firas Mualla
[email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Tino Haderlein
[email protected] Faculty of Engineering Department of Computer Science/Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Khaled Daqrouq
[email protected] Department of Electrical and Computer Engineering King Abdulaziz University, Jeddah, 22254, Saudi Arabia Elmar Nöth
[email protected] Faculty of Engineering Department of Computer Science /Pattern Recognition Lab Friedrich-Alexander-Universität Erlangen-Nürnberg Erlangen, 91058, Germany Abstract Phonetization is the transcription from written text into sounds. It is used in many natural language processing tasks, such as speech processing, speech synthesis, and computer-aided pronunciation assessment. A common phonetization approach is the use of letter-to-sound rules developed by linguists for the transcription from grapheme to sound. In this paper, we address the problem of rule-based phonetization of standard Arabic. 1The paper contributions can be summarized as follows: 1) Discussion of the transcription rules of standard Arabic which were used in literature on the phonemic and phonetic level. 2) Improvements of existing rules are suggested and new rules are introduced. Moreover, a comprehensive algorithm covering the phenomenon of pharyngealization in standard Arabic is proposed.