<<

Bijlage Werkgroep 13: The right of ij to be a 37

Werkgroep 13: Nederlandstalige TEX TheRightofijtobeaLigature David van Leeuwen

[email protected]

Ever since caligraphy (beautiful writing) is used as a alphabet is pronounced the same way! Sometimes, the way to express ones thoughts writers (and later prin- ij is treated as a single letter. When one capitalizes a ters) were concerned about how certain letters clashed. word starting with an ij, both will be written in upper In fact, people thought of ways to camou¯age these case, as in `IJmuiden' (impossible to pronounce by non collisions, and they had at least two methods for doing Dutchmen, by the way). In some cases the ij leads

so: One is known as , the slightly shifting the to inconsistencies: some alphabetization programs use ::: letters with respect to one another, the other is the use the :::xyz-alphabet, others the xyijz-alphabet. of ligatures, the replacement of the characters by one One of the worst things of running an English Xver- single but bigger symbol. What combinations of letters sion with Dutch text, is that words with an ij are very were ugly enough to qualify for a ligature has proba- likely to break between the two characters, which is the

bly always been in¯uenced by fashion. Everyone who

i

worst choice. So having a pattern 4 in the \pattern has an eye for typesetting must have noticed how in sequence is the change that has the highest yield in im- the earlier centuries manuscript had this weird `ct' li- proving the English version for Dutch. Well, nowadays gature. Although our current Roman style of printing there are commercial and non-commercial hyphenation looks in some ways much like the older handwritings, a patters for Dutch available, so that problem has been ct-ligature seems completely super¯uous. On the other solved. Somewhat more delicate is the fact that we hand with computer typesetting systems like T Xnow E would prefer to position the i and the j just somewhat readily available, people seem to have a tendency to go closer together than cm-fonts would naturally do. So a overboard in `inventing' new ligatures. kerning of one k# between the i and j would considera- The computer roman family of fonts, designed by Do- bly improve the readability of Dutch texts. nald E. Knuth after the fonts used in the ®rst editions On this subject, whether the ij deserves a kerning or of his books on Computer Algorithms, is strongly pro- even a ligature and more importantly how this should vided with -ligatures. The fonts appear a little `old', be implemented on TEX systems, there has been quite because most books and newspapers are printed no- some discussion at the Dutch TEX bulletin board. Ge- wadays in the popular Times-Roman set of characters. nerally people agree that the ij has a special status, but These seem to need the f-ligatures much less, although how we should deal with it is not a question that is there might be a correspondence between the decline solved. Roughly there are two camps. The ®rst thinks of the number of skilled lead-typesetters and the use that everything should be done using the standard set of ligatures. Now the fonts of characters of the com- of cm-fonts and a huge amount of fairly dif®cult TEX- puter modern family are based on English/American source spaghetti, so that everybody can use its standard use, where these f-combinations appear strikingly of- TEX-setup. The second camp has the opinion that spe- ten. But in other languages different letter combinati- cial fonts should be used that respect the ij as a ligature, ons will occur frequently enough to apply for a ligature. so that typists need no special training for identifying ij'. The difference may be clear: the ®rst group makes My knowledge of languages is very poor, the only one it easier for system operators, the second group for the I can say meaningful things about is Dutch. Dutch is users. Moreover, the ®rst solution is much more `inter- language with lots of `open' vowels. So many, that we national' than the second one, while that will run faster come short of double combinations ee, aa, uu,andoo and have better readable source texts. and need to use ei, ij, eu, ui for typically Dutch sounds. (Of course, we do have the `normal' a, e, i, and u, The best thing to do would probably be using virtual and we use ie for a , but the vowel is not used fonts. My own knowledge about it is almost nothing in normal Dutch words.) An odd thing to a foreigner (I don't have any written paper on virtual fonts nor do might be that we use the consonant j in a vowel combi- I have any access to one), but I understand that they nation. To make things even stranger, the pronunciation come in two parts: a .tfm ®le that instructs TEX, and an of the ij is identical to that of ei, and the letter y in the other ®le that tells a dvi driver how to compose charac-

Dutch TEX Users Group (NTG), P.O. Box 394, 1740 AJ Schagen, The Reprint MAPS#7 (91.2); Nov 1991 38 Werkgroep 13: The right of ij to be a Ligature Bijlage K ters from which .pk ®les. In any case, this means that does not recognize the double- opening quote º every font that is speci® to Dutch needs font space in (which the font does contain), which the German and TEX. For plain-TEX users this is not a large problem, Dutch wouldliketo use. What is the use of an ij-symbol nor will singly Dutch LaTEX documents be hard to if it is not recognized as an ij-ligature? process. But multilingual LaT users should have to E Re-reading the discussion on the ij-ligature I see that enlarge T X's font space, since LaT X uses almost all of E E Gerben Wierda and Johannes Braams have reached a the standard capacity of T X's font memory, and if we E similar conclusion much earlier than me, and Johannes were to double the amount of fonts this would become (and also Yannis) noticed than non-dutch people would a serious thing. But suppose the system wizards that probably not like an ij-ligature. I am not so sure about ®nd room to store two (or more) sets of hyphenation that. In Dutch, we do not need the f¯-ligature as much patters will also know do deal with the font storage. as the English do, but do we bother to have one? I don't Then a language-change macro would have to do two think the ij combination occurs much in foreign langua- things: change the value of \language and change ges, or that it would matter much that they are kerned the current font. The latter can actually be done if the a little closer. On Johannes' remark that virtual fonts font naming conventions of the foreign country has the would be a solution Nico Poppelier argues that not all same structure as the cm-family. dvi-drivers supports virtual fonts. In some aspect this JÈorg Knappen noticed that a family of fonts exists that is just the same as with 256-character fonts. Some ol-

has this property. These are called the dc-family, pre- der dvi-drivers didn't support 8 bit fonts although even ;x sumably meaning `Deutsche Computer modern'. The TEX 2 did, but since the cm-fonts had no more than fonts were actually accepted by the international TEX 128 characters dvi-driver-makers did not bother to sup- community at the conference in Cork. In order to see if port 8 bit fonts. If dvi-drivers do not yet support virtual these would be suitable for the ij (and correlated) pro- fonts we must keep asking the programmers to update blem(s) I ftp-ed one of them, dcr10 (cf. cmr10), in .tfm their drivers. and .pk form. It consists of 255 characters, among them I think that we must seek for a solution where T X itself ones that I have never seen before, but are claimed to E should take care of ®nding ligatures like ij, double- be used somewhere in Europe (which I have no doubt comma, french quote etcetera. This can be done by about). Most normal Latin letters appear on the same either using virtual fonts and the standard cm family, place as in cmr10, but ligatures, accents en special sym- or by a complete new set of fonts like dcr but with lan- bols are located elsewhere. There are no Greek capitals guage speci®c ligatures. (One could think of making in it. A fairly large amount of the upper 128 characters the `i' an active character, checking for the next to be are letters that could be formed by plain T X with nor- E a `j', but let us call this an academic solution since it mal accents, in this sense the font can be used with 8 bit will not make life very much easier.) Typing "y for a ij input where characters are mapped on identical places, ligature could be acceptable in English texts where the so that the French could use this font directly. The ij ligature hardly ever will be needed, but is completely and IJ appear as one-symbol letters in positions 188 and unacceptable for Dutch texts. It would be comparable 156, but these are not declared in the .tfm ®le as being to asking the English to write "ff whenever the ff li- formed by a ligature ij. So in this sense the dcr10 font gature should be used. Of the two alternatives (virtual merely acts as a bitmap source for the drivers, unless fonts and a different family) the former seems more Dutch keyboards are supplied with an ij-key. In fact, a elegant, using less .pk fonts. On the other hand the dcr10 user could never do without the cmr10 font, be- latter alternative can be more widely be implemented cause some characters are missing. Also the ligatures since not all dvi-drivers accept virtual fonts yet. !` and ?` are not recognized, although the characters appear in the font. For either solution people should get together and de- cide which languages deserve what ligatures. The dc- It is not clear to me how the dcr10 font should be used. family may be a good scheme to start from. For a It does have the f-ligatures and ligatures for the french typical national problem like this international porta- double quotes that can be entered as << and >>,but bility is less important than ease to read and write the when one types a single < one doesn't get the single source texts. french quote (which the font contains) but a smaller-

than sign <, which makes no sense outside mathmode. I hope that this can be a positive contribution to the Also it supports the double quote ligatures ª and º, but question of language dependent ligatures.

Reprint MAPS#7 (91.2); Nov 1991 Dutch TEX Users Group (NTG), P.O. Box 394, 1740 AJ Schagen, The Netherlands