<<
Home , Gaf

Appendix A Buckwalter Transliteration

Table A.1 Letters with windows 1256, ISO 8859-6, and character encoding and corresponding Buckwater transliteration Letter Description Win. CP-1256 ISO 8859-6 Unicode Buckwalter

 Letter C1 C1 U+0621 ’   Letter Alef, Madda above C2 C2 U+0622 |   Letter Alef, Hamza above C3 C3 U+0623 >  Letter , Hamza above C4 C4 U+0624 &  Letter Alef, Hamza Below C5 C5 U+0625 <   Letter Yeh, Hamza above C6 C6 U+0626 }  Letter Alef C7 C7 U+0627 A  Letter Beh C8 C8 U+0628 b Letter Teh Marbuta C9 C9 U+0629 p

 Letter Teh CA CA U+062A t

 Letter Theh CB CB U+062B v Letter Jeem CC CC U+062C j Letter Hah CD CD U+062D H  Letter Khah CE CE U+062E x   Letter Dal CF CF U+062F d  Letter Thal D0 D0 U+0630 *  Letter Reh D1 D1 U+0631 r   Letter Zain D2 D2 U+0632 z  Letter Seen D3 D3 U+0633 s

 Letter Sheen D4 D4 U+0634 $   Letter Sad D5 D5 U+0635 S  Letter Dad D6 D6 U+0636 D  Letter Tah D8 D7 U+0637 T

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 87 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 88 A Buckwalter Transliteration

Table A.1 (continued) Letter Description Win. CP-1256 ISO 8859-6 Unicode Buckwalter   Letter Zah D9 D8 U+0638 Z   Letter Ain DA D9 U+0639 E  Letter Ghain DB DA U+063A g _ Tatweel DC E0 U+0640 _  Letter Feh DD E1 U+0641 f  Letter Qaf DE E2 U+0642 q  Letter Kaf DF E3 U+0643 k  Letter Lam E1 E4 U+0644 l   Letter Meem E3 E5 U+0645 m  Letter Noon E4 E6 U+0646 n Letter Heh E5 E7 U+0647 h  Letter Waw E6 E8 U+0648 w  Letter Alef Maksura EC E9 U+0649 Y  Letter Yeh ED EA U+064A y   Fathatan F0 EB U+064B F  Dammatan F1 EC U+064C N  Kasratan F2 ED U+064D K

 Fatha F3 EE U+064E a !  Damma F5 EF U+064F u  Kasra F6 F0 U+0650 i "  Shadda F8 F1 U+0651 ∼ #  Sukun FA F2 U+0652 o $ Letter Superscript Alef – – U+0670 ‘  Letter Alef Wasla – – U+0671 { % Letter Peh 81 – U+067E P % Letter Tcheh 8D – U+0686 J

 Letter Veh – – U+06A4 V & Letter 90 – U+06AF G Appendix B Egyptian Colloquial Arabic Lexicon

Table B.1 ECA corpus lexicon with Arabic othography, Buckwalter transliteration, and SAMPA phonetic transcription Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA )*  |h ?A:  '(  >bjdy ?abgadi '+  >bdA ?abadan ,+-+  >bryl ?abri:l  .+  >bw ?abu /0+  >byD ?AbyAd’   -12+ vAr ?AsA:r 28  >jAzp ?aga:za -98 201  >Hmr ?AX\mAr   xbArk ?AxbA:rAk -4:1 >xDr ?Axd’Ar

 >d ?add  ?@ >rADy ?ArA:d’i

;A+  >rbEp ?ArbA?\a -4520A+  >rbEtA$r ?ArbA?\tA:SAr  67A+  >rbEyn ?arbi?\i:n B+  >rbE ?ArbA?\    >rD ?Ard’ C + >rnb ?arnab     zrq ?azra?   =+ sAnsyr ?AsAnse:r  'F >sd ?asad ;+'0GF swAn ?AswA:n .F

.0F >sywT ?asyu:t 2HF >$kAl ?aSka:l -GF IJK-GF  >$krk aSkurak >$krk(2) ?ASkurAk -3L >Sfr ?As’fAr ,L >Sl ?As’l   M< Eml ?a?\mil 20< >EyAd ?a?\ya:d

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 89 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 90 B Egyptian Colloquial Arabic Lexicon

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA   NOD< >gsTs ?aGust’us 203+-P >fryqyA ?afriqya     20DE2AP >fgAnstAn ?afGanista:n -+ .0Q >ktwbr ?ukto:bAr

R ?@R Al

'U0 T Albld ilbalad M0T AltlAt ittala:t  -+-( V* AljzAyr ?iggaza:yir ;AW( V* AljmEp iggum?\a   '(V* AlHd ilX\add ;030(V* AlHnfyp ilX\anafiyya   NXW(V* Alxmys ilxami:s -7(V* Alxyr ilxe:r   -+'T AldA}ry idda:?iri Y?5'T Aldsm iddasam 'T 'T Aldm iddamm Aldwr ?iddo:r 6+ 'T Aldyn iddi:n ;<2DT AlsAEp ?issa:?\a

CX DT Alsbt issabt ;ZMDT AlslAmp ssala:ma

MDT AlslAm ?issala:mu NWDT Al$ms iSams ;O 0DT ;S[T Al$nTp iSSAnt’a  AlSHp is’s’iX\X\A \+-OT AlTryq ?it’t’Ari:? -]OT AlZhr id’d’uhr

;0+ -AT AlErbyp il?\ArAbiyya .WAT AlEmwm l?\umu:m   ;P-AT Algrdqp ?ilGarda?a ^T >lf ?alf IJK^T ;SD 3T   >lf(2) ?alif AlfsHp ilfusX\a 673T >lfyn ?alfe:n -_23T AlqAhrp ?ilqA:hirA

-O3T AlqTr ?il?At’r ;`T Allh ?AllA: IJK;`T T Allh(2) ?AllA:h >a Ally lli IJK T .F2b c >a Ally(2) ?illi AlmAswrp ?ilmAsu:rA  20+2bc >lmAnyA ?almanya -bc Almrp ?ilmArrA  ;UHDbc Alm$klp ilmuSkila  -Abc Almgrb ?ilmaGrib  .Ubc Almlwk lmulu:k 67<.bc AlmwAEyn ?ilmawa?\i:n   .L.bc AlmwDwE ilmAwd’u:?\ 2deT AlnhAr innAhA:r   2deT AlnhArdh innAhArdA .0T Alnwr innu:r -]T Alhrm ?ilhArAm .]T Alhwl lho:l C P.T ; 3P.T Alwqt ilwa?t  Alwqfp ilwa?fa 6Q2Z >mAkn ?ama:kin 2Z >mAn ?ama:n 20 Z mrykA ?amri:ka 2+ >nA ?ana  ;+ .0 + >nbwbp ?anbu:bit ,_ >hl ?ahl M_ >hlA ?ahlan ;U_ >hlh ?ahlu B Egyptian Colloquial Arabic Lexicon 91

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA  NX+ .+ >wtwbys ?utubi:s ;L >wDp ?o:d’it  >wl ?awwil >f >wlY ?u:la   >y ?ayy 20T2OE ywh ?aywa  2+ bAb ba:b  =T2+ bAlk ba:lak 67 UT2+ bAllbn billaban   -7+ btrwl bitro:l 2S 0X+ btnjAn bitinga:n   >?A1 .0+ btwjEny btiwga?\ni 23+-+ brtqAn burtu?a:n  Y?F-+ brsym barsi:m .P-+ brqwq bar?u:?  -+ bzr bizr ;UDE bslp bisilla ,[E bSl bAs’Al Ng2OE bTATs bAt’A:t’is   67g2OE bTATyn bAt’At’i:n ;0+2OE bTAnyp bAt’t’Aniyya   6OE bTn bAt’n >?OE bTny bAt’ni h0OE bTyx bAt’t’i:x 'A+ bEd ba?\d  6+ 'A+ bEdyn ba?\de:n /A+ bED bA?\d

=T23+ bqAlk ba?a:lak >f23+ bqAly ba?a:li   NE'3+ bqdwns ba?du:nis -3+ bqrp ba?ArA

-G+ bkrh bukrA -G+ bkrp bukrA iU+ blH balaX\  'U+ bldy baladi   20+ bnAt bana:t CX+ bnt bint      ;0+-7+ bnzynp banzi:na 6+ -7+ bnzyn banzi:n     .UO0+ bnTlwn bAnt’Alo:n 20A0+ bnEnAE bini?\na:?\

;+ bh bih 2dj bhArAt buhArA:t  ;+ .+ bwAbp bawwa:ba 2( )*.+ bwtjAz butaga:z

.+ bwdrp budrA .+ bwry bu:ri  ;OF.+ bwsTp bust’a ;0Q.+ bwkyp buke:h

>?k0 X+ bybsy pepsi CX+ byt be:t   -70+ bytzA pitza 2[0+ byDA be:d’a   /0+ byD be:d’ ;[0+ byDp be:d’A   20X+ bynA bi:na '82+ tAxd ta:xud   R2+ tAl|f tala:f >l2+ tAny ta:ni

C()* tHt taX\t ,0U()* tHlyl taX\li:l   -Q'+ t*krp tAzkArA -7+ -+ trAbyzp t’ArAbe:zA  IJK -7+ -+ trAbyzp(2) t’ArAbe:zit C X+-+ trtyb tarti:b

;<-+ trEp tir?\a BDE tsE tisa?\

;ADE tsEp tis?\a -4520ADE tsEtA$r tisa?\tA:SAr 92 B Egyptian Colloquial Arabic Lexicon

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA

;0WADE tsEmyp tus?\umiyya C0WADE tsEmyt tus?\umi:t  67ADE tsEyn tis?\i:n ;U0GDE t$kylp taSki:la i0 [E tSbH tis’bAX\ =0 A+ tEbk ta?\abak  NXA+ tEy$ ti?\i:S 23+ tfAH tuffa:X\  ;123+ tfAHp tuffa:X\a 20 +-3+ tqrybA ta?ri:ban M + -4 52+M + tlAt talat  tlAtA$r tAlAttA:SAr ;+M+ tlAtp tala:ta 67+M+ tlAtyn talati:n

CU+ tlt tilt ;0W0U+ tltmyp tultumiyya  C0W0U+ tltmyt tultumi:t ;0+2bm tmAnyp tamanya 6 7+2b m ,00b m  tmAnyn tamani:n  tmvyl tamsi:l 2Dbm tmsAH timsa:X\ 6b m tmn taman   -45200bm tmntA$r tAmAntA:SAr ;0W0bm tmnmyp tumnumiyya   C0W0bm tmnmyt tumnumi:t 2DX+ tnsA$ tinsa:S   NX+ tns tinis >?UL.+ twSlny tiwAs’s’Alni

,0+ tyl ti:l ;0+ tyh tih  ;+ vh sih ;0+ 28 jA*byp ga:zibiyya  67+28 jAyyn gayyi:n -7 8 jbr gAbr  ;00 1 jbnp gibna '+'8 jdyd gidi:d

'+'8 jdydp gidi:da -1 jrAj gArA:Z =T-1 jrAlk gArA:lak -78 -1 jrjyr gargi:r  -1 jrs gArAs 2+-1 jrnAl gurnA:l   -1 jzAr gAzza:r -1 jzr gAzAr   ;Z-1 jzmp gazma -+-1 jzyrp gizi:ra  2A1 jEAn ga?\a:n ;OU8 jlTp gAlt’A - 8 8  7 9 jmbry gambari ,9 jml gamal 8 8 .]9 jmhwr gumhu:r ;+.]9 jmhwryp gumhuriyyit   ;001 jnyp gine: ;P.1 jwAfp gawa:fa >1 -_.1 jwAhrjy gawahirgi .1 jwrj ZorZ

NX1 jy$ ge:S Y?1 jym gi:m  NX01 jyns Zins .01 jyw$ guyu:S

;1 28 HAjp X\a:ga 28 HAl X\a:l  ;01 Htp X\itta -( 1 Hjz X\agz

'8 Hd X\add -45'8 HdA$r X\idA:SAr -1 -1 Hrwf X\uru:f HzAm X\iza:m  2D1 HsAb X\isa:b ;0F2D1 HsAsyp X\asasiyya  2[1 HSAn X\us’A:n ,[1 HSl X\As’al ;U 31 \1 Hflp X\aflit Hq X\a?? M8 HlAq X\alla:? 298 HmAr X\umA:r B Egyptian Colloquial Arabic Lexicon 93

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA 298 ;`T'98 HmAm X\amma:m  Hmdllh X\amdilla ;1 Hh hAh 67T.1 HwAlyn X\awa:le:n  >f.1 HwAly X\awa:li .1 HwD X\o:d   2+.01 HywAnAt X\ayawa:na:t Yn*28 xAtm xa:tim    >l'8 xdny xudni .g-1 xrTwm xart’u:m   C D1 x$b xaSab ;0L.[1 xSwSyp xus’us’iyya -4: 1 M8 xDrA xAd’rA  xlAT xAllA:t 8 N98 >a xly xalli xms xamas   ;D98 xmsp xamsa -4520D98 xmstA$r xAmAstA:SAr   ;0WD98 xmsmyp xumsumiyya C0WD98 xmsmyt xumsumi:t    67D98 xmsyn xamsi:n ;1 xh xAh   .1 xwx xo:x 201 xyAr xiyA:r  -78 xyr xe:r  dA da  ;b m dAl da:l dAymp dayma ;1  drjp dArAgA  drws duru:s

.< dEwp da?\wa \+2P dqAyq da?a:yi?

;30P dqyqp di?i:?a .0Q dktwr dukto:r - >?P.T dlwqty dilwa?ti 7 WDE dysmbr disimbir     *Al za:l *rp dura   *y zayy ,8  rAjl ra:gil

;0WA+  rbEmyp rub?\umiyya C0WA+  rbEmyt rub?\umi:t  B+  rbE rub?\ 20+  rbnA rAbbina  ;U8 rHlp riX\la ;[1 rxSp ruxs’it  2L   rz ruzz rSAS rus’A:s’ ^0< rgyf riGi:f  2Z rmAdy rumA:di  rh rih >o rwmy ru:mi   ;L2+ ryADp riyA:d’A '+  zbdp zibda   ;< zrAEp zirA:?\A >< zrAEy zirA:?\i    M< zElAn za?\la:n 2+ zyAdp ziya:da   2+ zyArp ziyA:rA =+2+ zyArtk ziyArtak    C+ zyt ze:t .0+ zytwn zatu:n   -+ zyrw zi:ru 6+  zyn ze:n 2F ;A P2F sAdp sa:da  sAqEp sa??\a ;120 F sbAHp siba:X\a p)*20 F sbAnx saba:nix

CX F sbb sabab -7 W0X F sbtmbr sibtimbir

B0 F sbE saba?\ ;A0 F sbEp sab?\a 94 B Egyptian Colloquial Arabic Lexicon

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA

-4520A0 F sbEtA$r sAbA?\tA:SAr ;0WA0 F sbEmyp sub?\umiyya  C0WA0 F sbEmyt sub?\umi:t 67A0 F sbEyn sab?\i:n -4 520F CF stA$r sittA:SAr st sit ;0F stp sitta ;0W0F stmyp suttumiyya  C0W0F stmyt suttumi:t 670F styn sitti:n 5  5 6( q sjn sign ;0(q sxnp suxna -45 -AF srp surrA  sEr si?\r '0AF sEydp sa?\i:da -3F sfrp sufra  -73F sfyr safi:r -GF skr sukkar

=0ZMF slAmtk salamtak CSbr5 smHt samaX\t Y?kbr5 smsm simsim =br5 smk samak 5   ;Gbr smkp samaka 670F snyn sini:n .F ; P.F  swAq sawwa:? swAqp siwa:?a >l.F swdAny suda:ni 2+.F swryA surya  67F syn si:n 2F $ArE Sa:ri?\

>o2F $Amy Sa:mi  2F $Ay Sa:y 5  -( q $jrp SAgarA ;03F $fth Suftu

;3F $qp Sa??a -GF $kr Sukr

-GF $krA SukrAn ,HF $kl Sakl 5 5 Nbr $ms Sams ;0Dbr $msyp Samsiyya  ;Abr5 $mEp Sam?\a ;O0F $nTp SAnt’A

2de5 $hAdp Siha:dit ;+.F $wAyp Sawwa:ya

;+ .F $wrbp Surba ;Q.F $wkp So:ka  ;+.F $wyp Swayya 67F $yn Si:n  .+ 2L SAbwn s’Abu:n 2L SAd s’A:d  .T2L SAlwn s’alo:n 20 L SbAH s’AbA:X\   -3L Sfr s’ifr '0L Sndwq sandu:? .L 2L   Swt s’o:t DAd d’A:d sAL DgT d’AGt’ \0 g Tbq t’AbA?

>?g Tby t’ibbi ;0WAg TEmyp t’A?\miyya   ;+23g TfAyp t’AffA:ya tg2Wg TmATm t’AmA:t’im .Wg TmwH t’umu:X\ ;g Th t’Ah

;+ .g Twbp t’u:bA .g Twl t’u:l  20g TyArp t’AyyA:rA -7g TyrAn t’AyArA:n ;g 2< Zh D’Ah EArf ?\a:rif ;()*-T2< EAlryHp ?\arri:X\a ;ZMDT2< EAlslAmp ?\assala:ma C 0S < Ejyb ?\agi:b '< Eds ?\ads B Egyptian Colloquial Arabic Lexicon 95

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA

>l-< Erby ?\ArAbi 20+ -< ErbyAt ?\ArAbiyya:t  ;0+ -< Erbyp ?\ArAbiyya IJK;0+ -< Erbyp(2) ?\ArAbiyyit

,D< Esl ?\asal -4k< E$r ?\ASAr

-4k< E$rp ?\ASA:rA 6+ -4k< E$ryn ?\iSri:n   -7P2[< ESAfyr ?\As’Afi:r .3[< ESfwrp ?\As’fu:rA  -7[< ESyr ?\As’i:r 2O< EZAm ?\iD’A:m 3< ;0 U< >a Eqly ?\a?li Elbp ?\ilbit >a< ElY ?\ala .G0U< Elykw ?\ale:ku <   -9 Emry ?\umri 6< En ?\an   C 0< Enb ?\inab '0< Endk ?\andak    .0 G0< Enkbwt ?\ankabu:t .0< EnwAn ?\inwA:n

'0< Eyd ?\i:d NX< Ey$ ?\e:S   67< Eyn ?\e:n -< grwb Guru:b   -7< gyr Ge:r 2_-7< gyrhA Gerha    67< gyn Ge:n C+2P fAtt fa:tit    20T.L2P fASwlyA fAs’ulya ,L2P fADl fA:d’il    ;0L2P fADyp fAd’ya CU3T2P fAlqlb fil?alb   -+-7 P fbrAyr fibrA:yir -P frAx fira:x ,Z-P ; 1- P  frAml fArA:mil   frxp farxa ;L-P frSp furs’A 2DE-P frnsA fArAnsA ; [ P =U[ P   fDp fAd’d’A   fDlk fAd’lAk B0OP fZyE faD’i:?\ ,PMP flAfl fala:fil ,3U P .UP  flfl filfil    flws filu:s ;P fh fih .P.P fwdAfwn vodafon . P .P  fwq fo:?   fwl fu:l >P fy fi -7P fyzA vi:za    M0P fylA villa Ya0P fylm film    .00P fynw fi:nu 2P qAf qA:f   .+2P qAnwn qAnu:n ,0 P qbl ?abl

Yn*'P qdym ?adi:m -P qrAr qArA:r

-P qrd ?ird ;+-P qryp qAryA

NE-P qry$ ?ari:S ;OP qTp ?ut’t’A  6O P qTn ?ut’n ;0OP qTnp ?ut’nA CUP YaP qlb ?alb qlm ?alam i9P qmH ?amX\ -9P qmr ?AmAr .] P ;W0 P  qhwp ?ahwa  qymp ?i:ma 2u kAf ka:f 2u kAm ka:m 96 B Egyptian Colloquial Arabic Lexicon

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA  ,Z2u kAml ka:mil 2u kAn ka:n

'0 Q kbdp kibda C+-7 Q kbryt kabri:t

 20Q ktAb kita:b .G0Q ktkwt katku:t  'Q kdAb kadda:b 'Q kdh kida 'Q 2WQ  kdp kida  kmAn kama:n ;0 0Q knbp kanaba '0Q kndA kanada   '0Q kndwz kandu:z 2+ -]Q khrbA’ kAhrAbA

-+ .Q kwbry kubri .Q kwrp ko:rA  ;F.Q kwsp ko:sa NXQ kys ki:s  =30Q kyfk ke:fak R l< la? R R l

20+ .T lwbyA lubya ;1.T lwHp lo:X\a

20T lyA liyya ;U0T lylp le:la  .W0T lymwn lamu:n 2Z mA ma 2Z mArs ma:ris =T2Z mAlk ma:lak

.T2Z mAlw$ malu:S NXT2Z mAly$ mali:S  ;( )*2Z mAnjp manga .+2Z mAyw ma:yu -7Z sF.0Z   mtrw mitru mtwsT mutAwAssit .0( o mjnwn magnu:n .]( o mjhwl maghu:l o o C F2( mHAsb muX\a:sib >o2( mHAmy muX\a:mi

'(o mHd$ maX\addS ;O(o mHTp mAX\At’t’it -W( o ' (o mHmrp miX\AmmAra mxdrAt muxAddArA:t NZ'Z mdms midammis ;+-Z mrAyp mira:ya

;g.+ -Z mrbwTp mArbu:t’A -Z mrp mArrA

C +-Z mrtb murattab 2DZ msA’ masa:?  '<2DZ msAEdp musa?\da v3DXDZ mst$fY mustaSfa -A 0DZ 'DZ mstgrb mistAGrAb msdwdp masdu:da .UDZ mslwq maslu:? .WDZ msmwH masmu:X\

NZ m$ miS NWDZ m$m$ miSmiS

 .DZ m$wy maSwi ;+.DZ m$wyp maSwiyya  -4:Z mSr mAs’r  -4:Z mDrb mAd’rAb 2OZ mTAr mAt’A:r h0 OZ mTbx mAt’bAx tAOZ mTEm mAt’?\Am BZ mE ma?\a 2AZ mEAk ma?\a:k 2+2AZ mEAyA ma?\a:ya  ;0+'AZ mEdnyp ma?\daniyya .3AZ mEqwl ma?\?u:l B Egyptian Colloquial Arabic Lexicon 97

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA

;3UAZ mElqp ma?\la?a IJK;3UAZ mElqp(2) ma?\la?it   -Z2AZ mgAmrAt muGAmrA:t 203Z mftAH mufta:X\    ,3U3Z mflfl mefalfel NX3Z mfy$ mafi:S 3Z 2HZ >a mqly ma?li mkAn maka:n ; +-GZ .DGZ mkrwnp mAkAro:na mkswf maksu:f .GZ mkwp makwa >1 .GZ mkwjy makwagi iUZ mlH malX\ ;U0wx mmvlp mumassila 6Gw x 6Z  mmkn mumkin  mn min ,+'0Z mndyl mandi:l ;3O0Z mnTqp mAnt’i?a   2deZ mnhA minha '0]Z mhnds muhandis '0<.Z mwAEyd mawa?\i:d .1 .Z mwjwd mawgu:d  .Z mwz mo:z v30F.Z mwsyqY musi:qA

;0Z myp miyya IJK;0Z myp(2) mAyyA  C0Z myt mi:t 6700Z mytyn mite:n M0Z mylAd mila:d Y?Z mym mi:m    '82+ nAxd na:xud  2+ nAdy na:di   2+ nAs na:s Yn*2+ nAym na:yim ,()* 2D E  nHl naX\l   n$AT nASA:t ;S0[E nSyHp nAs’i:X\a 2OE nZArp nAd’d’A:rA    tA+ nEm na?\am 20A+ nEnAE ni?\na:?    N3+ nfs nafs ;O3+ nqTp nu?t’it ,bm ;Ub m   nml naml  nmlp namla -7 9P.+ nwfmbr nuvimbir .+ nwm no:m     .+ nwn nu:n .0+ nywn niyo:n -W_ 20_  hmzp hamza hnA hina ;F'0_ hndsp handasa ;_ hh hih

;A+  w

-452+M+ wtlAtA$r wtAlAttA:SAr ;+M+ wtlAtp wtala:ta  67+M+ wtlAtyn wtalati:n CU+ wtlt wtilt  ;0W0U+ wtltmyp wtultumiyya ;0+2bm wtmAnyp wtamanya 98 B Egyptian Colloquial Arabic Lexicon

Table B.1 (continued) Arabic Buckwalter SAMPA Arabic Buckwalter SAMPA    67+2bm wtmAnyn wtamani:n -45200bm wtmntA$r wtAmAntA:SAr ;0 W0b m  -4 5'8   wtmnmyp wtumnumiyya  wHdA$r wX\idA:SAr ;D9 8 -4 520D9 8  wxmsp wxamsa   wxmstA$r wxAmAstA:SAr ;0WD98 wxmsmyp wxumsumiyya 67D98 wxmsyn wxamsi:n y)* wrAyH wrA:yiX\ B+  wrbE wrub?\

;0WA+  wrbEmyp wrub?\umiyya  wrd ward

;P wrqp wara?a ;A0 F wsbEp wsab?\a

-4520A0 F wsbEtA$r wsAbA?\tA:SAr ;0WA0 F wsbEmyp wsub?\umiyya  67A0 F wsbEyn wsab?\i:n -4520F wstA$r wsittA:SAr

;0F wstp wsitta ;0W0F wstmyp wsuttumiyya  670F wstyn wsitti:n Mde5 wshlA wasahlan

;F w$h wiSSu .L wSwl wus’u:l

-4k< wE$rp w?\ASA:rA 6+ -4k< wE$ryn w?\iSri:n

'T wld walad ;0Z wmyp wmiyya   6700Z wmytyn wmite:n /E wnS wnus’s’ '8 wwAHd wwa:X\id 2Z2+ yAmA yama =+'+ ydyk yiddi:k =WUDE yslmk ysallimak   >?A+ yEny ya?\ni ,DA+ ygsl yiGsil  ;U+ ylh yAllA -+20+ ynAyr yana:yir   .DX+ ynswn yansu:n ;+ yh yih  =3P.+ ywfqk ywaffa?ak ;0T.+ ywlyh yulya .0T.+ ywlyw yulyu .+ ywm yo:m   ;0+.+ ywnyh yunya .0+.+ ywnyw yunyu Appendix C Seinnheiser ME-3 Specifications

Seinnheiser ME-3 is a headset microphone of exceptional sound quality, the ME 3 is intended for music and speech applications. The super-cardioid condenser design offers excellent feedback rejection.

Table C.1 Specifications of AF sensitivity 1.6 mV/Pa the Seinnheiser ME-3 Max. sound pressure level (active) 150 dB microphone Pick-up pattern Super-cardioid Transducer principle Electret condenser

Fig. C.1 Polar diagram for Seinnheiser ME-3

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 99 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 100 C Seinnheiser ME-3 Specifications

Fig. C.2 Frequency response curve for Seinnheiser ME-3 Appendix D Buddy 6G USB Specifications

The Buddy 6G USB adapter manufacured by InSync speech Technologies, Inc. is based on the Micronas UAC3556b microchip. It has a built-in high-quality sound card, which replaces a desktop or laptop computer’s sound card for high perfor- mance speech sound input and output. It offers full duplex operation for connection with microphone and speakers that is especially well suited to speech recognition applications.

Table D.1 Specifications of the Buddy 6G USB adapter based on the Micronas UAC3556b mi- croship Microphone Supports 8/16 bit mono recording at 6.4 kHz to 48 kHz, sensitivity −54 ± 4 dB impedance < 650 Ohms. Speaker Output Supports 16/24 bit mono/stereo at 6.4 kHz to 48 kHz. Includes a low power stereo amplifier. Signal to Noise Ratio SNR is typically −92 dB for A/D (recording) and −96 dB for D/A (playback). Total Harmonic Distortion THD is better than −90 dB for both A/D (recording) and D/A (playback). Power Self powered from USB bus with less than 100 mA current at 5V DC. Operating Temperature Minimum −10°C (14°F), maximum 70°C (158°F). Storage Temperature Minimum −40°C (−40°F), maximum 75°C (167°F).

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 101 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 References

Abdou, S., Hamid, S. E., Rashwan, M., Samir, M., Abd-Elhamid, O., Shahin, M., and Nazih, W. (2006) Computer Aided Pronunciation Learning System Using Speech Recognition Tech- niques. In Proceedings of International Conference on Speech and Language Processing IN- TERSPEECH Afify, M., Nguyen, L., Xiang, B., Abdou, S., and Makhoul, J. (2005) Recent Progress in Arabic Broadcast News Transcription at BBN. In Proceedings of International Conference on Speech and Language Processing INTERSPEECH, Lisbon, Portugal, pp. 1637–1640 Afify, M., Sarikaya, R., Kuo, H. J., Besacier, L., and Gao, Y. (2006) On the use of morphological analysis for dialectal Arabic speech recognition. In Proceedings of International Conference on Speech and Language Processing INTERSPEECH, Pittsburgh, Pennsylvania, pp. 277–280 Alshalabi, R. (2005) Pattern-based Stemmer for Finding Arabic Roots. Information Technology Journal 4(1), pp. 38–43 Appen Pty Ltd, Sydney, Australia (2006a) Iraqi Arabic Conversational Telephone Speech. Linguis- tic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC2006S45 Appen Pty Ltd, Sydney, Australia (2006b) Gulf Arabic Conversational Telephone Speech. Linguis- tic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC2006S43 Appen Pty Ltd, Sydney, Australia (2007) Levantine Arabic Conversational Telephone Speech. Lin- guistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC2007S01 Atiyya, M., Choukri, K., and Yaseen, K. (2005) Specifications of the Arabic Written Corpus. Nem- lar project Barras, C., Geoffroisb, E., Wuc, Z., and Libermanc, M. (2000) Transcriber: Development and use of a tool for assisting speech corpora production. Speech Communication 33(1–2), pp. 5–22 Billa, J., Noamany, M., Srivastava, A., Liu, D., Stone, R., Xu, J., Makhoul, J., and Kubala, F. (2002) Audio Indexing of Arabic Broadcast News. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 5–8 Buckwalter, T. (2002a) Arabic Transliteration. URL: http://www.qamus.org/transliteration.htm Buckwalter, T. (2002b) Buckwalter Arabic Morphological Analyzer Version 1.0. Linguistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC2002L49 Cambridge (2010) HTK—Hidden Markov Model Toolkit—Speech Recognition toolkit. URL: http://htk.eng.cam.ac.uk/ Canavan, A., Zipperlen, G., and Graff, D. (1997) CALLHOME Speech. Linguistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC97S45 Clarkson, P., and Rosenfeld, R. (1997) Statistical Language Modeling Using the CMU-Cambridge Toolkit. In Proceedings of ISCA Eurospeech Carnegie Mellon University (2010a) Sphinx—Speech Recognition Toolkit. URL: http:// cmusphinx.sourceforge.net/ Carnegie Mellon University-Cambridge (2010b) CMU-Cambridge Statistical Language Modeling toolkit. URL: http://www.speech.cs.cmu.edu/SLM/toolkit.html

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 103 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 104 References

Darwish, K. (2002) Building a shallow Arabic morphological analyzer in one day. In Proceedings of ACL workshop on computational approaches to semitic languages Djoudi, M., Fohr, D., and Haton, J. P. (1989) Phonetic study for automatic recognition of Ara- bic. In Proceedings of first European conference on speech communication and technology (Eurospeech), Paris, France, pp. 2268–2271 Djoudi, M., Aouizerat, H., and Haton, J. P. (1990) Phonetic study and recognition of standard Ara- bic emphatic consonants. In Proceedings of First International conference on spoken language processing (ICSLP), Kobe, Japan, pp. 957–960 El-Halees, Y. (1989) A study of subglottal pressure for emphatic and non-emphatic sounds in Arabic. In Proceedings of first European conference on speech communication and technology (Eurospeech), Paris, France Elmahdy, M., Gruhn, R., Minker, W., and Abdennadher, S. (2009a) Survey on Common Arabic Language Forms from a Speech Recognition Point of View. In Proceedings of the International Conference on Acoustics (NAG-DAGA), Rotterdam, Netherlands, pp. 63–66 Elmahdy, M., Gruhn, R., Minker, W., and Abdennadher, S. (2009b) Effect of Gaussian Densi- ties and Amount of Training Data on Grapheme-Based Acoustic Modeling for Arabic. In Pro- ceedings of the IEEE international conference on natural language processing and knowledge engineering (IEEE NLP-KE), Dalian, China Elmahdy, M., Gruhn, R., Minker, W., and Abdennadher, S. (2009c) Modern Standard Arabic Based Multilingual Approach for Dialectal Arabic Speech Recognition. In International Symposium on Natural Language Processing (SNLP), Bangkok, Thailand, pp. 169–174 Elmahdy, M., Gruhn, R., Minker, W., and Abdennadher, S. (2010) Cross-Lingual Acoustic Mod- eling for Dialectal Arabic Speech Recognition. In Proceedings of International Conference on Speech and Language Processing INTERSPEECH, Makuhari, Japan, pp. 873–876 Elmahdy, M., Gruhn, R., Abdennadher, S., and Minker, W. (2011) Rapid Phonetic Transcription using Everyday Life Natural Chat Alphabet Orthography for Dialectal Arabic Speech Recog- nition. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic ELRA: European Language Resources Association (2010) URL: http://www.elra.info/ Fegen, C., Steker, S., Soltau, H., Metze, F., and Schultz, T. (2003) Efficient Handling of Multilin- gual Language Models. In Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU), St. Thomas, Virgin Islands, pp. 441–446 Ferguson, C. (1959) Diglossia. Word 15, pp. 325–340 Fung, P., and Schultz, T. (2008) Multilingual Spoken Language Processing. IEEE Speech Process- ing Magazine 25(3), pp. 89–97 Gal, Y. (2002) An HMM approach to vowel restoration in Arabic and Hebrew. In Proceedings of the ACL-02 workshop on Computational approaches to semitic languages, USA, Association for Computational Linguistics Gales, M. J. F., Diehl, F., Raut, C. K., Tomalin, M., Woodland, P. C., and Yu, K. (2007) Develop- ment of a Phonetic System for Large Vocabulary Arabic Speech Recognition. In Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU), Kyoto, Japan, pp. 24– 29 Gibbon, D., Moore, R., and Winski, R. (1997) SAMPA computer readable phonetic alphabet. In Handbook of Standards and Resources for Spoken Language Systems. Mouton de Gruyter, Berlin. Part IV, section B Google Labs (2009) Google Transliteration. URL: http://www.google.com/ta3reeb/ Google Labs (2010) Google Tashkeel. URL: http://tashkeel.google.com Gruhn, R., and Nakamura, S. (2001) Multilingual, Speech Recognition with the CALLHOME Corpus (ASJ2001), vol. 1. Acoustical Society of Japan, Japan, pp. 153–154 Gu, L., Zhang, W., Tahir, L., and Gao, Y. (2007) Statistical Vowelization of Arabic Text for Speech Synthesis in Speech-to-Speech Translation Systems. In International Conference on Speech and Language Processing INTERSPEECH, Antwerp, Belgium, pp. 1901–1904 Habash, N., and Rambow, O. (2007) Arabic Diacritization through Full Morphological Tagging. In Proceedings of NAACL HLT, pp. 53–56 References 105

Hassan, Z. M., and Esling, J. H. (2007) Laryngoscopic (Articulatory) and Acoustic Evidence of a Prevailing Emphatic Feature Over the Word in Arabic. In Proceedings of the 16th International Congress of Phonetic Sciences Hinds, M., and Badawi, E. (2009) A Dictionary of Egyptian Arabic. Librairie du Liban, Reprinted Holes, C. (2004) Modern Arabic: Structures, Functions, and Varieties. Georgetown University Press, Washington Huang, X., Acero, A., and Hon, H. (2001) Spoken language processing: a guide to theory, algo- rithm, and system development. Prentice Hall, New York ISO 8859-6 (1987) Information processing—8-bit single-byte coded graphic character sets—Part 6: Latin/. International Organization for Standardization Jurafsky, D., and Martin, J. H. (2009) Speech and language processing: An introduction to nat- ural language processing, computational linguistics, and speech recognition, second edition. Prentice Hall, New York Kaye, A. S. (1970) Modern Standard Arabic and the Colloquials. Lingua 24, pp. 374–391 Kilany, H., Gadalla, H., Arram, H., Yacoub, A., El-Habashi, A., and McLemore, C. (2002) Egyp- tian Colloquial Arabic Lexicon. Linguistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC99L22 Kirchhoff, K., and Vergyri, D. (2005) Cross-Dialectal Data Sharing For Acoustic Modeling in Arabic Speech Recognition. Speech Communication 46(1), pp. 37–51 Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Ji, G., , F., Henderson, J., Liu, D., Noa- many, M., Schone, P., Schwarta, R., and Vergyri, D. (2002) Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins summer workshop. Technical report, Johns Hopkins University Lagally, K. (1992) ArabTEX Typesetting Arabic with vowels and ligatures. In Proceedings of the EuroTEX92 conference, Prague Lamel, L., Messaoudi, A., and Gauvain, J. (2007) Improved Acoustic Modeling for Transcrib- ing Arabic Broadcast Data. In International Conference on Speech and Language Processing INTERSPEECH, pp. 2077–2080 Lamere, P., Kwok, P., Gouvea, E. B., Raj, B., Singh, R., Walker, W., and Wolf, P. (2003) The CMU SPHINX-4 speech recognition system. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 46(1), pp. 37–51 Linguistic Data Consortium (LDC) (2010) University of Pennsylvania. URL: http://www.ldc. upenn.edu/ Lee, C., and Gauvain, J. (1993) Speaker Adaptation Based on MAP Estimation of HMM Param- eters. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. II–558 Leggetter, C. J., and Woodland, P. C. (1995) Maximum likelihood linear regression for speaker adaptation of the parameters of continuous density hidden Markov models. Computer Speech and Language 9, pp. 171–185 Maamouri, M., Graff, D., Jin, H., Cieri, C., and Buckwalter, T. (2004) Dialectal Arabic Orthography-based Transcription and CTS Levantine Arabic Collection. Paper presented at the Parallel STT-NA Tracks Session of the EARS RT-04 Workshop, Palisades IBM Executive Center, New York Maamouri, M., Graff, D., and Cieri, C. (2006) Arabic Broadcast News Transcripts. Linguistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC99L22 Maamouri, M., Buckwalter, T., Graff, D., and Jin, H. (2007) Fisher Levantine Arabic Conversa- tional Telephone Speech. Linguistic Data Consortium, University of Pennsylvania, LDC Cata- log No.: LDC2007S02 Maegaard, B., Damsgaard, J. L., Krauwer, S., and Choukri, K. (2004) NEMLAR: Arabic Language Resources and Tools. In Proceedings of Arabic Language Resources and Tools Conference, Cairo, , pp. 42–54 Makhoul, J., Zawaydeh, B., Choi, F., and Stallard, D. (2005) BBN/AUB DARPA Babylon Levan- tine Arabic Speech and Transcripts. Linguistic Data Consortium, University of Pennsylvania, LDC Catalog No.: LDC2005S08 106 References

Messaoudi, A., Lamel, L., and Gauvain, J. (2004) Transcription of Arabic Broadcast News. In In- ternational Conference on Spoken Language Processing (INTERSPEECH), Jeju Island, Korea, pp. 1701–1704 Messaoudi, A., Gauvain, J., and Lamel, L. (2006) Arabic Broadcast News Transcription using a One Million Word Vocalized Vocabulary. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 1093–1096 Microsoft Innovation Lab, Cairo (2009) Microsoft Maren. URL: http://www.microsoft.com/ middleeast/egypt/cmic/maren/ Nelken, R., and Shieber, S. M. (2005) Arabic Diacritization Using Weighted Finite-State Trans- ducers. Workshop On Computational Approaches To Semitic Languages 5(2), pp. 79–86 Newman, D. (2002) The Phonetic Status of Arabic within the World’s Languages: The Uniqueness of the Lughat Al-Aaad. Antwerp papers in linguistics 100, pp. 65–75 Ney, H., Essen, U., and Kneser, R. (1994) On structuring probabilistic dependencies in stochastic language modeling. Computer Speech and Language 8(1), pp. 1–28 , T., Nguyen, K., Zbib, R., and Nguyen, L. (2009) Improved Morphological Decomposition for Arabic Broadcast News Transcription. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4309–4311 Paulsson, K., Choukri, K., Mostefa, D., DiPersio, D., Glenn, M., and Strassel, S. (2009) A Large Arabic Broadcast News Speech Data Collection. In Proceedings of the Second International Conference on Arabic Language Resources and Tools, Egypt, pp. 280–284 Rabiner, L., and Juang, B. (1993) Fundamentals of Speech Recognition. Prentice Hall, New York Rabiner, L. R. (1989) A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2), pp. 257–286 Razak, Z., Ibrahim, N. J., Idris, M. Y. I., Tamil, E. M., Yakub, M., Yusoff, Z. M., and Rahman, N. N. A. (2008) Quranic Verse Recitation Recognition Module for Support in j-QAF Learning: A Review. IJCSNS International Journal of Computer Science and Network Security 8(8), pp. 207–216 RDI (2007) Fassieh. URL: http://www.rdi-eg.com/ Rybach, D., Hahn, S., Gollan, C., Schluter, R., and Ney, H. (2007) Advances in Arabic Broadcasr News Transcription At RWTH. In Proceedings of Automatic Speech Recognition and Under- standing Workshop (ASRU), Kyoto, Japan, pp. 449–454 The Nemlar project (2005) URL: http://www.nemlar.org/ Sarikaya, R., Emam, O., Zitouni, I., and Gao, Y. (2006) Maximum Entropy Modeling for Diacriti- zation of Arabic Text. In Proceedings of International Conference on Speech and Language Processing INTERSPEECH, pp. 145–148 Schultz, T., and Waibel, A. (2001) Language Independent and Language Adaptive Acoustic Mod- eling for Speech Recognition. Speech Communication 35, pp. 31–51 Stevens, V., and Salib, M. (2005) A Pocket Dictionary of the Spoken Arabic of Cairo. The Ameri- can University in Cairo Press, Cairo Vergyri, D., and Kirchhoff, K. (2004) Automatic diacritization of Arabic for acoustic modeling in speech recognition. In Proceedings of COLING Computational Approaches to - based Languages, Geneva, Switzerland, pp. 66–73 Vergyri, D., Kirchhoff, K., Gadde, R., Stolcke, A., and Zheng, J. (2005) Development of a con- versational telephone speech recognizer for Levantine Arabic. In Proceedings of International Conference on Speech and Language Processing INTERSPEECH, Lisboa, pp. 1613–1616 Waibel, A., Geutner, P., Mayfield-Tomokiyo, L., Schultz, T., and Woszczyna, M. (2000) Multilin- guality in Speech and Spoken Language Systems. Proceedings of the IEEE, Special Issue on Spoken Language Processing 88(8), pp. 1297–1313 Xiang, B., Nguyen, K., Nguyen, L., Schwartz, R., and Makhoul, J. (2006) Morphological de- composition for Arabic broadcast news transcription. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. I, pp. 1089–1092 Yaghan, M. A. (2008) Arabizi: a contemporary style of Arabic slang. Design Issues 24(2), pp. 39– 52 References 107

Yaseen, M., Attia, M., Maegaard, B., Choukri, K., Paulsson, N., Haamid, S., Krauwer, S., Ben- dahman, C., Fersoe, H., Rashwan, M., Haddad, B., Mukbel, C., Mouradi, A., Al-Kufaishi, A., Shahin, M., Chenfour, N., and Ragheb, A. (2006) Building Annotated Written and Spoken Arabic LRs in NEMLAR Project. In Proceedings of International Conference on Language Resources and Evaluation (LREC) Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., and Woodland, P. (1996) The HTK Book. Cambridge University Press, Cambridge Zitouni, I., Olive, J., Iskra, D., Choukri, K., Emam, O., Gedge, O., Maragoudakis, E., Tropf, H., Moreno, A., Rodriguez, A. N., Heuft, B., and Siemund, R. (2002) ORIENTEL: Speech-Based Interactive Communication applications for the Mediterranean and the Middle East. In Pro- ceedings of International Conference on Speech and Language Processing INTERSPEECH, pp. 325–328 Index

A I , 5, 71 IPA, 17, 30, 73, 79, 80

B Bayes rule, 8 L Buckwalter, 25, 72, 87 Levantine colloquial Arabic, 25, 30, 67 Lexicon, 10, 29, 55 C Classical Arabic, 13, 17 Context dependent acoustic model, 9 M Context independent acoustic model, 9 Maximum A-Posteriori, 42, 43 Maximum Likelihood Linear Regression, 42, D 43 Data pooling, 39, 61 MFCC, 7, 34, 57 Diacritization, 3, 19, 20, 25, 53, 55, 71 Modern standard Arabic, 1, 2, 4, 13, 15, 25 Dialectal Arabic, 4, 13, 18, 27, 29, 33, 54, 63 Morphological analyzer, 25 Diglossia, 1 Morphological complexity, 19 Distinct tri-phones coverage, 27 Multilingual acoustic model, 35, 60 E Egyptian Colloquial Arabic, 18, 25, 27, 33–35, N 60, 72 N-gram, 8, 46, 49 Emphatic phonemes, 15, 35

F O Fussha, 15 Out-of-vocabulary, 2, 19, 71

G Gaussian, 57 P Gaussian mixture model, 10, 33, 42, 73 Phoneme set, 30 Grapheme-to-phoneme, 19, 23, 32 Phoneme sets normalization, 36 Graphemic Acoustic Modeling, 53 Phonemic acoustic modeling, 33 Graphemic lexicon, 55 Phonetic transcription, 15, 19, 71

H Hidden Markov Model, 9, 33–35, 42, 55, 56, Q 73 Quran, 17

M. Elmahdy et al., Novel Techniques for Dialectal Arabic Speech Recognition, 109 DOI 10.1007/978-1-4614-1906-8, © Springer Science+Business Media New York 2012 110 Index

R T Real Time Factor, 47 Text-To-Speech, 20 Tied-states, 9, 34, 39, 57, 74 S SAMPA, 15, 17, 20, 26, 73, 79, 80, 87 Spelling variants, 63 U State tying, 9 Unicode, 87 Supervised adaptation, 41, 61 Unsupervised adaptation, 44, 63