TUGboat, Volume 20 (1999), No. 2 119
Language Support
Typesetting Bengali in TEX Anshuman Pandey 1 Introduction
The Bengali (or B¯a˙ngl¯a bAÅÐA) script is one of the thirteen primary scripts used throughout India. Like other Indic scripts, the Bengali is derived from the ancient Brahmi script. The script is intimately tied with the Bengali language, which according to the latest data from Ethnologue, is currently the fourth most spoken language in the world with roughly 189 million speakers [2]. The language is spoken mainly in the Indian province of West Bengal and in Bangladesh. Bengali has been the medium for many notable artists, of whom the famous litera- teur Rabindranath Tagore and the great film-maker Satyajit Ray are the best known. 2 The Script Of all the scripts derived from Brahmi, Bengali is most closely related historically to Devanagari. The two scripts share a comparable inventory of conso- nant, vowel, and conjunct characters, however, aside from superficial form and design, the primary differ- ence is the phonetic value assigned to certain char- acters of the Bengali script.1 Like all Brahmi-based scripts, Bengali is techni- cally an alpha-syllabic script. This system is based on the unit of the “graphic” syllable, or aks.ara, which by definition always ends in a vowel. Each basic consonant character in Bengali is understood to represent the consonant modified by the inherent
vowel a, eg. k = ka. When a consonant is modified by any other vowel, the syllable is written using a diacritic form
of the vowel. For example, the syllable k¯a kA is
composed of the consonant ka k and the vowel ¯a A aA (diacritic form: ). Syllables consisting of only a vowel, or with a vowel in word-initial position, are written with the full form of the vowel. A “graphic” syllable consisting of a sequence of consonants is written using a specific conjunct
form, or yukt¯˙ aks.ar ÝÙAÖ . For instance, the con-
· k
sonant cluster kka (k ) is written as a single kk grapheme, , not laterally as . The latter form
1 For further details on Bengali phonology please con- sult Suniti Kumar Chatterji’s Origin and Development of the Bengali Language, Allen & Unwin: London, 1970–72, reprint of 1926 ed. 120 TUGboat, Volume 20 (1999), No. 2
represents kaka,notkka. Such conjuncts are rela- TEX was first introduced to Bengali in 1992 tively easy to decipher because the conjunct retains when Avinash Chopde added a support module for some semblance to the individual elements it com- the HP Softfont ‘SonarGaon’ (sgaon) to his itrans prises. However, with others decipherment is a little package. ‘SonarGaon’ was designed by Anisur Rah-
more daunting because the form gives no indication man in 1990, and was available in three weights:
of its constituents: ks.a (k ) , nga˙ ( ) normal, slanted, and compressed. Although it was
Ø · Ö ² ä,andtra ( ) . a decent Bengali font for the time, it was less than complete in that it lacked several standard ligatures. The subscript character called hasanta (◦ ¬ )is used to indicate an elision of the inherent a from a To amend this deficiency, Muhammad Ali Mas- roor developed a package which supplemented the consonant. For example, k¬ is k,notka. In some in- stances, generally when dealing with poor or limited support for ‘SonarGoan’ in itrans. Called arosgn,or types, the hasanta is used in modifying consonants “Aro SonarGaon” (“more SonarGaon”), this pack- to produce conjuncts. The modified consonant is age extended the ‘SonarGaon’ font by providing in then written laterally with the following consonant a METAFONT source the glyphs missing from the parent font and a set of macros with which to access to produce a simplified ligature. Therefore, may these new extensions. theoretically be written as k¬k, but such practice is rare as traditional Bengali orthography places great The support for ‘SonarGaon’ and arosgn was importance on the proper formation of letters. later dropped from itrans partially because the font was retracted from the public domain. The other 3 Short History of Bengali Typesetting factor was the development by Shrikrishna Patil of a The development of modern modes of printing and user-defined Type-1 PostScript font called “ItxBen- typesetting Bengali coincided with the assumption gali”. This font was adapted for use with itrans and 2 of government of Bengal by the British East India replaced ‘SonarGoan’. Company in 1772. Seeing the need to educate its Last, but not least, is a program called ‘Bengali officials in the vernacular, the British sought to pro- Writer’ developed by Abhijit Das (Barda).3 ‘Ben- vide a means by which to expedite such instruction. gali Writer’ is a text editor for the X11 windows The task was taken up by Nathaniel Brassey Hal- system which allows the user to type Bengali doc- hed who wrote a book titled A Grammar of the uments and save them in TEX format. Documents Bengal Language. Printed in 1778 at Hoogly (near savedinTEX format are then to be used with TEXin Calcutta), this was the first book containing Ben- conjunction with the ‘Bengali Writer TEX Interface’ gali characters printed with movable types. These (bwti) package. movable types were cast by Sir Charles Wilkins. The bwti package consists of a beautiful Bengali Wilkins, considered the pioneer of Indic type- METAFONT and a set of macros which facilitate the setting, not only instituted the mechanical print- inputting of Bengali text. The input system is not ing of Bengali, but even trained native technicians extremely fluid and has low readability. It appears and motivated them to apply their skills to other to be a verbatim reflection of the font encoding, and Indic scripts as well. One of Wilkins’ students, a conjuncts are defined as macros. For example, with man by the name of Panchanana Karmakar, eventu- bwti a word like digbija˙ya idijÝ must be input ally brought forth a large inventory of Bengali types as idi\gb jy instead of digbija.ya as with the which led to further advancements in the printing of bengali package. Bengali [5]. 5Thebengali Package 4 Bengali Fonts and Packages for TEX The Bengali for TEX (bengali) package is housed The typesetting of Bengali gained another big boost on CTAN in the language/bengali/pandey/ direc- when it was introduced to electronic typesetting and tory. This package differs from those described in publishing. Initially, due to the complex nature of the above section in that it: Bengali conjuncts and the intricate design elements 1. provides a means aside from itrans and bwti of of the basic characters themselves, it was rather dif- typesetting Bengali ficult to find a computer font containing a complete 2. provides a simple, stand-alone preprocessor in- set of conjuncts for the script, and/or a typeface terface which was aesthetic in its display of the intricate 2 and complex glyphs. These and other Indic language packages are described in the article “An Overview of Indic Fonts for TEX” [4]. 3 For more information on ‘Bengali Writer’ and related utilities see http://www2.csa.iisc.ernet.in/~abhij/bwti/.
TUGboat, Volume 20 (1999), No. 2 121 ² 3. implements the ‘Velthuis’ transliteration scheme With a few exceptions — namely kra , tra ,
(see Section 5.2 for details) and bhra ×, and variants of these — Table 2 does not Y 4. provides a simple, macro-based method of de- show conjuncts formed with ra Ö or ya ,asmost limiting Bengali text, similar in form and style ligatures containing these two elements are produced to other Indic script packages like Devanagari in a relatively static vertical or lateral fashion. In
for TEX the case of ra,thera-phala ( ),¯ or the ligature form
5. complies with the New Font Selection Scheme of ra is placed beneath the consonant: Ô¯ pra and
(NFSS) ׯ sra.Inthecaseofya,theya-phala ( ) is placed × 6. incorporates the latest version of the ‘Bengali’ after the consonant: Ô pya and sya.
METAFONT developed by Abhijit Das (Barda) Additionally, when ra Ö is the initial element in a consonant conjunct, it is written as a superscript 5.1 The ‘Bengali’ Font diacritic called repha. For instance, the word karma
Das designed his ‘Bengali’ METAFONT after study- is written kÑÆ karma, with the r above the ma. ing the various types employed in the modern print- As there are no character primitives in the bn ing of Bengali books in India. Das emphasized that font, producing conjuncts containing non-traditional he did not model the font after the design and form phonemes is a problem. A few of these have been of any particular typeface, nor did he employ any in- accounted for, but there may be cases which have struments in its development, but relied heavily on been overlooked. One solution is to use the hasanta crude approximations made by the naked eye. Upon to explicitly produce such characters. If the prepro- close inspection, the characters of Das’s METAFONT cessor detects a conjunct it does not recognize it will adhere to the traditional Bengali orthographic style ‘create’ one by joining the full sizes with a hasanta. found in many printed books, and rival many mod- The result is not pretty, but it is a solution! ern Bengali typefaces of the highest quality. A few subtle kerning adjustments were needed The beautiful ‘Bengali’ (bn) font is currently to correctly align the placement of vowel diacritics available in two shapes: normal (bnr)andslanted below consonants. The preprocessor manages such (bnsl). It contains an almost complete inventory kernings and therefore manual adjustments are not of Bengali vowels, consonants, numerals, diacritics, needed. and punctuation marks. The font does lack three Since the font contains several complex liga- rare characters: ¯r (long vocalic ‘r’), l (vocalic ‘l’, tures which will produce rather unsatisfactory out- ˚ ¯˚ put at small sizes such as 8pt, font magnifications resembling the numeral 9: 9), and l (long vocalic ‘l’).4 ˚ below 10pt are not supported. Therefore attempts Nonetheless, the font contains the entire reper- to use bn in footnotes and with such NFSS com- toire of traditional Bengali conjunct and ligature mands as \small will result in a message from TEX forms, and also a few extra ligatures used in writing complaining about a missing metric file. Perhaps a loan words. In addition to the conjunct forms given solution is possible in the near future. in Table 2, Bengali has a few special consonant- 5.2 Transliterated Input
vowel ligatures formed from the vowel u ¤, and in Bengali text is entered in transliteration. Each char-
one case the vowel r ¦. These are illustrated in the chart below: ˚ acter in the Bengali syllabary has been assigned a
corresponding value based on the Roman alphabet,
· ¤ h · ¤ g gu hu
or a combination of signs from the Roman alphabet.
· ¤ Ø · Ö · ¤ ³
Ö ru tru The author has adapted the ‘Velthuis’ translitera-
· ¥ Ò · Ø · ¤ üÙ
Ö ruu ntu tion scheme for the bengali package.
Ë · ¤ Ð · g · ¤ ã
"su lgu The ‘Velthuis’ scheme was developed in 1990
· ¦ × · Ø · ¤ þ h h.r stu by Frans Velthuis as a means of providing translit- 4 The existence of these characters in the traditional Ben- erated input for his Devanagari for TEX package, or gali syllabary is debatable. Although The Unicode Standard, devnag. As the Bengali syllabary resembles the De- Version 2.0 does not place the long forms of these characters vanagari, the ‘Velthuis’ scheme is perfectly suitable in the traditional ordering, it does reserve the positions they would occupy were they to be included in the proper arrange- to transliterate Bengali (and, for that matter, any ment. Nor does it list them under the heading ‘Independent other Indic script). It was necessary to add a few Vowels’, but places them instead under ‘Generic Additions.’ extensions to the scheme, otherwise it has remained Unicode also gives the diacritic form of these characters [7]. largely unchanged from the original. The scheme as However, the ISO draft standard 15919 for the transliteration of Indic scripts does show the two long forms in the tradi- modified for Bengali is given in Table 1. tional arrangement [6]. 122 TUGboat, Volume 20 (1999), No. 2
stops sonorants vowels
cv cv cv
a k
ka ka Y ya ya a a
à aA ◦ A
kha kha Ý ya˙ .ya ¯a aa
¢ i◦ g
ga ga Ö ra ra i i
G £ ◦ Á
gha gha Ä ra ~ra ¯ı ii
¤ ◦ Ù Ð
Õ na˙ "na la la u u
¥ ◦ Í b
˚
ja ja Ë ´sa "sa e e
© E◦ Â
ª e◦ A É
na˜ ~na × sa sa o o
« e◦ Ç Ì .ta .ta h ha ha au au
Z .tha .tha numerals D d. a .da special characters cv
Ê .ra Ra cv
¼ X
d. ha .dha 0 0 Å m. .m anusv¯ara
½ Î
.rha Rha 1 1 Û m˙ ~m candrabindu
¾ Æ
¿ ¬
Ø ta ta 3 3 & hasanta
4 B
Þ tha tha 4 4 t T khan. d. ata
5 º
d da da 5 5 . | d¯a˙mr.i 6
Ü dha dha 6 6 7
Ò na na 7 7 8
b ba ba c Ú bha bha CSX+ 8-bit scheme v ‘Velthuis’ 7-bit scheme Ñ ma ma
Table 1: Inventory of Bengali Characters
TUGboat, Volume 20 (1999), No. 2 123
· k Ø · Þ ® Ñ · Ú · Ö Þ
k kk tth mbhr
k · Ì Ø · Ò Ñ · Ñ ¯ ß
k.t tn mm
° à · Ø Ø · b Ñ · Ð
k kt tb ml
k · Ñ Ø · Ñ Ð · k ± á
km tm lk
² â · Ö Ø · Ö Ð · g
k kr tr lg
ú ä · Ð Þ · b Ð · Ì
k kl thb l.t
k · b d · g Ð · D ð å
kb dg l.d
ñ æ · F d · G Ð · Ô
k k.s dgh lp
ò ç · F · Æ d · d Ð · b
k k.s.n dd lb
· F · Ñ d · Ü ó Ð · Ñ è
k k.sm ddh lm
ô é · × d · b Ð · Ð
k ks db ll
g · Ü d · Ú Ë · c õ ê
gdh dbh "sc
· Ò d · Ú · Ö ö Ë · C ë
g gn dbhr "sch
÷ ì · Ñ d · Ñ Ë · Ò
g gm dm "sn
g · Ð Ü · Ò Ë · Ñ ø í
lm DZ dhn "sm
· b Ü · b ù Ë · Ð î
g gb dhb "sl
· Ò à Ò · Ø ü Ë · b ï
G ghn nt "sb
á þ ð · k Ò · Ø · b F · k
Õ "nk ntb .sk
· k · F â Ò · Ø · Ö ÿ F · k · Ö ñ
Õ "nk.s ntr .skr
Õ · Ã Ò · Þ F · Ì Ä ò
"nkh ã nth .s.t
Õ · g Ò · d F · Z Å ó
"ng ä nd .s.th
å û ô · G Ò · d · b F · Æ
Õ "ngh ndb .s.n
Õ · Ñ Ò · Ü F · Ô Æ õ
"ngm æ ndh .sp
ç Ç ö · c Ò · Ò F · f
c cc nn .sph
c · C Ò · b F · Ñ È ÷
cch è nb .sm
é É ø · C · b Ò · Ñ × · k
c cchb nm sk
c · É Ò · × × · k · Ö Ê ù
c~n ê ns skr
ë Ë ú · j Ô · Ì × · k · Ð
j jj p.t skl
j · j · b Ô · Ø × · Ã Ì û
jjb ì pt skh
í Í ü · Â Ô · Ò × · Ì
j jjh pn s.t
· É î Ô · Ô Î × · Ø ý
j j~n pp st
j · b Ô · Ð × · Ø · Ö Ï ÿ
jb ï pl str
É · c Ô · × × · Þ Ð
~nc ps sth
· C ¡ f · Ð Ñ × · Ò
É ~nch phl sn
¢ Ò · j b · j × · Ô
É ~nj bj sp
É · Â b · d × · Ô · Ð Ó
~njh £ bd spl
¤ Ô · Ì b · Ü × · f
Ì .t.t bdh sph
Ì · b b · b × · b Õ
.tb ¥ bb sb
¦ Ö · D b · Ð × · Ñ
D .d.d bl sm
Æ · Ì Ú · Ö × · Ð ×
.n.t § bhr sl
· Z ¨ Ú · Ð Ø h · Æ
Æ .n.th bhl h.n
· D © Ñ · Ò Ù h · Ò
Æ .n.d mn hn
· Æ ª Ñ · Ô Ú h · Ñ
Æ .n.n mp hm
· Ñ « Ñ · f Û h · b >
Æ .nm mph hb
¬ Ü } · Ø Ñ · b h · Ð
Ø tt mb hl
Ø · Ø · b Ñ · Ú Ê · g Ý ttb mbh Rg
Table 2: Supported Bengali conjuncts
124 TUGboat, Volume 20 (1999), No. 2
Тeb eÑAÖ kAYƸ keh ×ÆA Öib
ek {\bn ke la{}ibe mor kaarya, kahe sandhyaa rabi
jgB Öeh iÒ¬Ö Cib º
iÒÝA "suni.yaa jagaT rahe niruttar chabi |
Ô¯dÁÔ iCи e× kihи AiÑ
ÑAiÌÖ maa.tir pradiip chila, se kahila, sbaami
eYÌÙkÙ ×AÜ kiÖb ØA aAiÑ º
aAÑAÖ aamaar ye.tuku saadhya kariba taa aami |
ÖibůÒAÞ ZAkÙÖ ß -- rabindranaath .thaakur}
Figure 1: A Poem by Rabindranath Tagore
Bengali does not distinguish between ba and va, these Bengali documents should have the extension and collapses both into ba. For this reason, as a gen- .bn in the file name. This lets the preprocessor know eral rule, all words containing va should be translit- that the file is associated with it. erated as ba. However, when parsing conjuncts the Additionally, any shape- or size-changing com- preprocessor will accept va in place of ba,sothe mands such as \large or \slshape should follow
word AiÑ may be transliterated as either svaami the \bn macro. Otherwise, the declaration of the or sbaami. If the preprocessor detects va outside of default font as bn will be over-written by the NFSS a conjunct, it will assume that the Assamese va is macro’s default declaration of the Computer Mod- intended, and will produce an undesired output. ern fonts. The result will be a jumble of characters, Another significant point is the use of {} to not the intended Bengali. break the lexical scan. This is important when two short vowels are encoded in succession. The brack- 5.4 The Preprocessor ets will prevent the preprocessor from interpreting Once the Bengali text has been transliterated, the
the two short vowels as a diphthong. For example, file is run through a program called a preprocessor. EÐeb compare Тeb la{}ibe with laibe. A preprocessor is the ideal method for enabling the The above two points are illustrated in Figure 1. typesetting of such complex scripts as Bengali be- As indicated in the figure, the transliterated input cause it presents the user with a simple interface for text is placed within the scope delimited by {\bn character input. The user has only to be concerned and }. This allows the preprocessor to locate and with the transliteration, because the preprocessor properly translate the input to the appropriate font will manage the conversion of the input text into character codes. character codes with which TEX is familiar. A preprocessor called beng has been developed 5.3 The Macros for use with the bengali package. It is a small pro- The initialization for the bengali package is managed gram written in C and based in function on the by the beng style file. The style defines: preprocessor for Charles Wikner’s sanskrit package. 1. the macro \bn as the delimiter for Bengali text. The syntax for its use is: The definition of \bn sets the current font to the bn family, initializes bn for use with NFSS, beng infile[.bn][outfile[.tex]] and provides an appropriate \baselineskip so there is adequate clearance between lines for super- and subscript characters. where infile is the name of the Bengali document. 2. a bengali counter which may be used to pro- The .bn extension is optional, as is the target out- duce Bengali numerals for page numbering and put file. By default the preprocessor will name the enumerated environments. output file the same as the input, but with the ex- An idiosyncracy of the \bn macro is that a curly tension .tex. brace must immediately precede it and that a space Running the preprocessor without any argu- must immediately follow it: {\bn .Otherwise,the ments invokes the interactive mode. The input and preprocessor will return an error stating it was un- output filenames must then be manually entered at able to locate a valid Bengali delimiter. the prompts. The version number and other infor- The document containing Bengali text is to be mation can be obtained from the preprocessor by considered as any other TEX file. As Bengali text is invoking it with -h. The author’s email address is delimited by {\bn text },anyotherTEX macros and also given in case any problems arise with the use of packages may be used in the document. However, the program. TUGboat, Volume 20 (1999), No. 2 125
5.5 A Note on Hyphenation 7 What’s Next? There is no tradition of hyphenating words in In- 7.1 Refining the font dian orthography. In manuscripts, the use of hy- In addition to creating a suitable boldface, Das and phens is non-existent, and in early printed mate- the author have discussed replacing the explicit con- rials, hyphenation was applied arbitrarily between junct glyphs with character primitives. One advan- any aks.ara-s when the end of a line was reached. tage of using primitives is the opening up of several Nowadays, printers are more keenly aware of word- positions for other characters. The disadvantage breaks at the end of lines, and attempt to maintain is replicating traditional ligature forms with prim- syntactic sense when applying hyphenation. itives: ligatures can be formed laterally or by juxta- Unfortunately, such “intelligent” hyphenation position, but this method serves an injustice to the cannot be produced with the bengali package. When aesthetic of the Bengali script. the preprocessor converts transliterated input into Depending on the number of character positions internal character codes, the output deviates sub- available after the font has been revised, new char- stantially from what TEX would consider a ‘word’. acters may be introduced to the repertoire. Possible However, if by chance a word is broken at the end additions may be the characters ¯r, l,and¯l, along of a line, TEX will produce a hyphen because the with their diacritic forms (described˚ ˚ in further˚ de- bn font possesses a hyphen character at the stan- tail in Section 5.1). Other possible inclusions may dard position. A hyphen may otherwise be encoded be the currency signs given in Unicode. within a given text simply by typing the hyphen Taco Hoekwater has generously offered to pro- character: -. duce Type-1 versions of the bn font. Once the font 6 Support for Assamese is stabilized, the conversions will be performed and made available as part of the bengali package. The script used in the far eastern Indian province of Assam is nearly identical to the Bengali script. 7.2 Uniting babel and bengali
The Assamese (or Asam¯ı˙ya a×ÑÁÝ) script (also the Jun Takashima has developed hyphenation patterns name of the associated language) differs from the for Sanskrit and Kannada to be used in conjunc- Bengali in the design of two consonant characters, tion with Johannes Braams’s babel package. As the although the correspondence between pronunciation lexica of both Kannada and Bengali are heavily in- and script is also different in a number of respects fluenced by Sanskrit, it is feasible that either the
between Assamese and Bengali. These two charac- Sanskrit or Kannada hyphenation pattern may be È ters are ra Ä and va . adapted for use with the Bengali language. These characters were not part of the original Such an adaptation would require an input en- bn font developed by Das. The author has created coding scheme compatible with the babel conven- them based on the design found in Halhed’s Bengali tion. One idea could be to modify the beng prepro- grammar [3] and the descriptions given by Banerji cessor to convert the ‘Velthuis’ transliterated input [1]. to this babel-compliant scheme. It is interesting that these two characters are used in Halhed’s grammar. Their appearance in the 7.3 Printing Dates in Bengali
book implies that they were commonly used in Ben- A new method for printing dates in Bengali needs Ö
gali orthography, and that the switch from Ä to to be developed. This new method would print the b and from È to must have occurred within the past date according to both the Western and traditional 200 years. Also, it is probably during this time that Bengali calendars. Both styles are commonly used these earlier forms were restricted to the Assamese throughout Bengali-speaking regions of the world. script. Of the two, the Western style is easier to im- A group led by Jugal Kalita is believed to be plement. As is expected, the macro does 5 \today working on an Assamese package. However, no in- not produce the correct result in the Bengali envi- dication of progress has been posted at the project’s ronment. To overcome this, Masroor, who also de- site, and as of yet, nothing has been released. veloped the arosgn package, wrote a LATEX package called bngtoday, intended for use with itrans.6 This package provides the macro \BanglaToday which gives the current Western date transliterated into
Bengali. For example, June is simply J¯un jÍÒ. 5 Details are available at http://www.acsu.buffalo.edu/ ~talukdar/assam/language/assamlang.html. 6 On CTAN at language/bengali/bngtoday.sty 126 TUGboat, Volume 20 (1999), No. 2
The traditional Bengali calendar is quite differ- References ent from the Western and therefore requires a bit [1] Banerji, R. D. The Origin of the Bengali Script. more work for correct implementation. Due to the Nababharat Publishers: Calcutta, 1973. Reprint arrangement of the Bengali calendar, Bengali and of 1973 1st ed. Western months overlap. Thus, a given Western [2] Grimes, Barbara F. [ed]. Ethnologue: Languages month may be known by two different names in Ben- of the World. Summer Institute of Linguistics, gali. For example, June may either be Jyais..tha Ejó ¯ Inc. Dallas, Texas, 1996. [Note: Information or As¯ar aA×AÊ, depending on whether the first or . on most widely spoken languages may be found second half of the month is being referred to. Fur- at http://www.sil.org/ethnologue/. The list thermore, Ejó may also refer to the last half of was last updated in February 1999.]
May, and aA×AÊ to the first half of July. Additionally, in the Bengali calendar the first [3] Halhead, Nathaniel Brassey. A Grammar of the four days of the month have special names, and or- Bengal Language. Printed at Hoogly, Bengal, dinal numbers used for days of the month also have 1778. distinct forms. The new package would automate [4] Pandey, Anshuman. “An Overview of Indic the calculations needed to produce the date accord- Fonts for TEX”, TUGboat, 19(2), 1998. pp. 115– ing to these conventions. 120. [5] Priolkar, Anant K. The Printing Press in India: 7.4 Implementing CSX+ in bengali Its Beginnings and Early Development [Being CSX+, or Classical Sanskrit eXtended+, is an 8- a quatercentenary commemoration study of the bit encoding scheme which parallels the convention advent of printing in India (in 1556)]. Marathi adopted for the ISO Committee Draft 15919 [6]. Samshodhana Mandala: Bombay, 1958. Contrary to what its name indicates, CSX+ uni- [6] Stone, Anthony [ed]. ISO Committee Draft formly supports all Indic scripts. The CSX+ scheme 15919: Transliteration of Devanagari and Re- is modelled after IBM Code Page 437 and occupies lated Scripts into Latin Characters. Avail- characters in the Upper ASCII block. Table 1 shows able at http://ourworld.compuserve.com/ the CSX+ scheme as it pertains to Bengali. homepages/stone_catend/trdcd1c.htm. Currently the beng preprocessor only recognizes [7] The Unicode Consortium. The Unicode Stan- 7-bit input in the form of the ‘Velthuis’ transliter- dard, Version 2.0. Addison-Wesley Developers ation scheme. The author intends to extend the Press: Reading, Massachusetts, 1997. preprocessor to recognize the 8-bit CSX+ encoding scheme as well. Anshuman Pandey 8 Notes University of Washington Department of Asian Languages The author was informed of a book by Fiona G. E. and Literature Ross titled The Printed Bengali Character (Curzon: 225 Gowen Hall, Box 353521 Richmond, 1999) which provides lucid and detailed Seattle, WA 98195 information on Bengali orthography and typogra- [email protected] phy. The author was strongly encouraged to review http://weber.u.washington.edu/ Ross’s work, but unfortunately, a copy of the book ~apandey/ could not be acquired in time. It is hoped that this does not diminish the force of the article.