Arto Salomaa
Professor Arto Salomaa received his Ph.D. in 1960 and has b een Professor of Mathe-
matics at the UniversityofTurku since 1965. He has authored more than 400 scienti c
publications in ma jor journals, as well as ten b o oks, some of whichhave app eared also
in French, German, Japanese, Romanian, Russian, Vietnamese and Chinese transla-
tions. Prof. Salomaa holds the degree of do ctor honoris causa at six universities. He
has b een an invited sp eaker at numerous conferences in computer science and math-
ematics, and a program committee member or chairman for ma jor computer science
conferences, including STOC, ICALP,MFCS, FCT. Prof. Salomaa was EATCS Pres-
ident 1979-85 and is currently the editor-in-chief of the EATCS Monograph Series
published by Springer-Verlag, as well as an editor of eight international journals of
Computer Science. Festival conferences \Salo days in Theoretical Computer Science"
Bucharest 1992, \Salo days in Auckland" Auckland, NZ, 1994 and \Imp ortant Re-
sults and Trends in Theoretical Computer Science" Graz, 1994 have b een arranged
in his honor. Prof. Salomaa is a memb er of the Academy of Sciences of Finland, the
Swedish Academy of Sciences of Finland and Academia Europaea 92. Currently,heisa
Research Professor at the Academy of Finland and the head of the Academy's research
pro ject on mathematical structures in computer science.
Events and Languages
Early computer science
It is customary to b egin a pap er with an abstract or a summary. The purp ose of
this article is to tell ab out some people and ideas in the early history of computer
254 Salomaa
science, roughly up to 1970. The article has a very sub jective avor: I will tell
only ab out events and p ersons I have in some sense come into contact with.
Since my p oint of view is over formal languages and automata, the happ enings
and phenomena rep orted in the sequel will b e from these and related areas of
theoretical computer science, rather than from the theory of algorithms and
complexity. The relative imp ortance of the latter areas has increased very much
since the 1970s.
Although there have b een many attempts of uni cation and clari cation,
it is still not at all clear what should be called computer science and what is
theoretical computer science. Opinions and esp ecially p oints of emphasis vary
remarkably from place to place, country to country. Of course the setup was
even less clear and more obscure in the 60s when imp ortant ideas were emerging
or just ab out to emerge. However, by the end of the 60s well-established com-
puter science curricula existed already at numerous universities. An indication
of the ideological unclarity is the Finnish word for \computer science", \tieto-
jenkasittelyoppi", still prevalent in Finland. Literally translated the word means
\discipline of data pro cessing". I opp osed this term very muchin the 60s but
without success. I was in the 60s also in London, Canada, for a long p erio d.
Here are some excerpts from my writing at the UniversityofWestern Ontario,
in connection with a prop osed Ph.D. program in computer science:
Likeanyyoung and growing discipline, Computer Science is far from
b eing saturated and, therefore, one should b e prepared to add new ar-
eas to the program. When launching the program, one should consider
simultaneously with the areas of study also the availability of sta mem-
b ers in these areas. ::: Formal language and automata theory is in a
rather central p osition in every Ph.D. program in Computer Science I
know of and, thus, I don't see to o much harm in this resp ect in a con ict
with other universities in the province. ::: The main part of computer
combinatorics would consist of the theory of graphs and networks but
also other combinatorial problems with applications in computers could
b e included. ::: Appliedareas could include some of the following: infor-
mation retrieval, symb olic computation, arti cial intelligence, theorem
proving by computer, pattern recognition, learning theory CAI, simu-
lation, organization and design of systems for sequential, concurrent and
time-shared pro cessing b oth on line and o line, design of system comp o-
nents for assembly and compilation, computational linguistics. Any list
like this is op en-ended and in selecting the areas the availability of sta
should b e kept in mind. In all of the suggested areas, co op eration with
other departments is p ossible and sometimes even necessary. Some such
departments could be: mathematics recursive function theory, combi-
natorics, applied mathematics numerical analysis, linguistics natural
language asp ects of formal languages, electrical engineering, library sci-
ence.
That is my authentic writing from 1968, mayb e still preserved in some les
at the University of Western Ontario. I would still agree with some of it but
Events and Languages 255
obviously many things in it have b ecome obsolete. However, I do not wantto
dwell on this topic longer; no nal answers can b e given anyway.
Formal languages: Are they formal and are they languages?
Grzegorz Rozenb erg { I will talk more ab out him later { published a pap er
\Decision problems for quasi-uniform events" which had b een presented to the
Bulletin of the Polish Academy of Sciences by Andrzej Mostowski on August 7,
1967. In this pap er we read:
Let = f ; ;:::; g be a nite nonempty alphabet xed for
1 2 n
all our considerations. The elements of are called symbols. Finite se-
quences of symb ols are called tapes. The length of any tap e x is de ned
to be the number of letters in the tap e and is denoted by lgx. The
set of all tap es over including the empty sequence is denoted by
T . We shall identify tap es of length 1 with symb ols of .If x and
y are tap es, then xy denotes the concatenation of x and y . ::: If P; Q
are subsets of T , then P; Q are called events.For arbitrary events P
and Q, P [ Q is de ned as the set union, the complex pro duct PQ as
1 k 0
PQ = fxy : x 2 P; y 2 Qg, and P as [ P , where P = f0g and
k =0
i+1 i
P = P P i 0. It is obvious that T = . ::: An event P is
fundamental if there exists a subset B of such that P = B .
Everything is very well presented, mayb e some of the terms sound strange
nowadays. Indeed, formal languages were earlier mostly called events. I used
the latter term in all of my rst pap ers. The double meaning of the word \event"
is re ected also in the title of this writing. Apparently, the usage of the word
\event" in this sense stems from Kleene, [20]. I switched to \languages" in 1966{
67. \Events" do not o ccur any more in my rst book \Theory of Automata"
which was written in 1966{67 but, due to my complete ignorance ab out the
publishers, was delayed for more than twoyears by the publisher.
But do es it make more sense to call subsets of \formal languages" than
\events"? Probably not. Most of the subsets lackany form whatso ever { how
could they b e called \formal"? And most of them do not serveany purp ose in
any kind of communication { how could they b e called \languages"?
Let us go backto the quoted passage of Rozenb erg, [13]. It is very mathe-
matical in avor. Moreover, the in uence of the great Polish scho ol of logicians
b ecomes apparent in statements such as \We shall identify tap es of length 1
with symb ols of ". In general, most of the early investigators of computer sci-
ence, no matter how the term is understo o d, were mathematically trained and
oriented. Thus it is very natural that typical mathematical topics, suchasthe
e ect of certain well-de ned op erations on languages, were studied early in the
game. This tradition has continued, at least to some extent. Currentinvestigators
should not just continue the tradition for its own sake. They should rather make
sure that their topic is of some practical imp ortance, or else really signi cant
mathematically.
256 Salomaa
I met a few times the aforementioned Andrzej Mostowski, a famous repre-
sentative of the Polish scho ol. He wore very elegant suits at conferences, quite
unlike the casual so-and-so attire currently in use. He was also p olite and consid-
erate. He was chairing a session in 1962, where I gave a talk ab out in nite-valued
truth functions. I was somewhat nervous, and apparently this was noticed by
Mostowski. There were some simple questions after my talk but Mostowski did
not want to embarrass me with his dicult question. Instead, he came to me
much later and asked \I would like to knowhowyou actually obtain the decid-
ability".
In the early days of formal language theory, the motivation and the b elief
in the usefulness of the theory were quite strong. Let us lo ok at some examples
from well-known sources. They tell also what di erent p eople thought language
theory was ab out. Noam Chomsky, [3], wrote in 1959:
A language is a collection of sentences of nite length all constructed
from a nite alphab et or, where our concern is limited to syntax, a nite
vo cabulary of symb ols. Since any language L in whichwe are likely to
be interested is an in nite set, we can investigate the structure of L only
through the study of the nite devices grammars which are capable of
enumerating its sentences. A grammar of L can b e regarded as a func-
tion whose range is exactly L. Such devices have b een called \sentence-
generating grammars". A theory of language will contain, then, a sp eci-
cation of the class F of functions from which grammars for particular
languages maybe drawn. The weakest condition that can signi cantly
b e placed on grammars is that F is included in the class of general, un-
restricted Turing machines. The strongest, most limiting condition that
has b een suggtested is that each grammar b e a nite Markovian source
nite automaton. The latter condition is known to b e to o strong; if F
is limited in this way, it will not contain a grammar for English. The
former condition, on the other hand, has no interest. We learn nothing
ab out a natural language from the fact that its sentences can b e e ec-
tively displayed, i.e., that they constitute a recursively enumerable set.
The reason for this is clear. Along with a sp eci cation of the class F
of grammars, a theory of language must also indicate how, in general,
relevant structural information can b e obtained for a particular sentence
generated by a particular grammar.
This can b e viewed also as a motivation and background argument for the
Chomsky hierarchy { of course there are other sources in this resp ect more
comprehensive than [3]. By mid-60s the hierarchywas pretty well established;
the following writing by Ginsburg, [7], emphasizes the interconnection b etween
ALGO L and context-free languages. In fact, the latter were often called ALGO L-
like languages.
As is well-known, the context free languages are excellent approx-
imations to the syntactic comp onents of currently used programming
Events and Languages 257
languages such as ALGO L. Pushdown automata abbreviated pda
are devices used in parsing programming languages, for the most part in
compiling. These two comp onents are linked by the result that a set of
words is a context free language if and only if it is accepted, i.e., recog-
nized by some pda.To implement the construction either by hardware
or by software and the usage of pda, it is imp ortant to have general
classes of pda with particular prop erties. For example, it is convenient
to discuss deterministic pda since they parse rapidly. In the same spirit,
we investigate the class of pda having the prop erty that the length of
the pushdown tap e alternatively increases and decreases at most a xed
b ounded numb er of times during anysweep of the automaton. Such pda
reject words faster than an arbitrary pda.
In baseball you sometimes face a situation: all or nothing. Bases loaded,
b ottom of the ninth inning, two out. An "-small additional push on your high
ball can bring it within the reach of a sp ectator this happ ened to the Yankees
in the 1996 playo s: homerun from the hands of an out elder. Baseball b eing
the most American game, such an all-or-nothing attitude has b een visible in
the writings of many Americans concerning formal languages and related areas.
The quoted passage by Ginsburg is very optimistic: \excellent approximations of
programming languages", \implement the construction either by hardware or by
software", etc. Many similar examples of such a motivational \all"-mo o d, to b e
contrasted with the present \nothing"-mo o d, can b e given from the 60s. I just
mention pap ers by Greibach and Hop croft, [10], and Greibach and Ginsburg,
[9], the latter propagating in the Journal of the ACM the nowadays esoteric
theory of multitap e AFA abstract families of automata.
By the end of the 60s, language theory had b ecome already a very big eld.
A bibliographyby DerickWo o d, [21], from 1970 contains close to one thousand
items, although it misses most of the Russian literature, whichwas quite exten-
sive and also go o d in the early days. Some of the key p oints of language theory
are brought home by following excerpts from well-known early authors. Eachof
the excerpts is of an intro ductory character. We b egin with Michael Harrison,
[11]:
Formal language theory was rst develop ed in the mid 1950s in an
attempt to develop theories of natural language acquisition. It was so on
realized that this theory particularly the context-free p ortion was quite
relevant to the arti cial languages that had originated in computer sci-
ence. Since those days, the theory of formal languages has b een develop ed
extensively, and has several discernible trends, which include applications
to the syntactic analysis of programming languages, program schemes,
mo dels of biological systems, and relationships with natural languages.
In addition, basic research into the abstract prop erties of families of
languages has continued.
Ron Bo ok, [2], follows the same general line, emphasizing esp ecially parsing and compiling:
258 Salomaa
Formal language theory is concerned with problems of sp eci cation
of sets of strings of symb ols, i.e., languages, and prop erties of these sets.
Mathematical mo dels are constructed with the goal of faithfully repre-
senting prop erties of the languages involved and of devices which gener-
ate, accept or p erform op erations on languages. The languages studied
arise from problems in abstract automata theory, in programming lan-
guage and compiler theory, and in linguistics particularly, mathematical
mo dels for natural languages. ::: An extremely active area of formal lan-
guage theory involves the construction of mo dels for programming lan-
guages and for translator systems. Mo dels are constructed in attempt to
formalize the compiling pro cess and to formalize various parsing meth-
o ds, and resource requirements for parsing are considered.
The next twointro ductory excerpts are from Aho and Ullman, [1], and my
own \Formal Languages", [18], resp ectively:
The theory of languages is concerned with the description of lan-
guages, their recognition and pro cessing. A language may contain an
in nite numb er of strings, so at the least, one needs a nite description
of the language. Particular typ es of nite descriptions will yield use-
ful prop erties of the languages they de ne, esp ecially when the class of
languages they de ne is \small" i.e., not every language of conceivable
interest is de ned. If C is the class of languages de ned by a certain
typ e of description, one would liketo know whether memb ership in C
is preserved under various op erations. One would like to know that the
languages in class C could b e recognized quickly and simply, esp ecially
if one were attempting to develop a compiling system for a language or
languages in C . Also useful are characterizations of languages in C ,so
one can tell easily if a given language is in class C . Finally, one wants
algorithms, if they exist, to answer certain questions ab out the languages
in C .
A language, whether a natural language such as English or a pro-
gramming language suchas ALGO L, can b e considered to b e a set of
sentences, that is, nite strings of elements of some basic vo cabulary.
The de nition of a language given b elow is based on this notion and is,
consequently, general enough to include b oth natural and programming
languages. The syntactic sp eci cation of a language with nitely many
sentences can b e given, at least in principle, by listing the sentences. This
is not p ossible for languages with in nitely many sentences. The main
problem of formal language theory is to develop nite representations
for in nite languages. Such a representation may b e accomplished bya
generative device or by a recognition device. By imp osing restrictions on
the devices, di erent language families are obtained.
Events and Languages 259
Instead of trying to comment and develop further the many issues raised in
the ab ove quotations the authors themselves might not agree any more with
many p oints, I am taking a direct leap to the present. A summary of my cur-
rent views is contained in the following excerpt from the Preface to the recent
\Handb o ok of Formal Languages", [15]:
The theory of formal languages constitutes the stem or backb one of
the eld of science now generally known as theoretical computer science.
In a very true sense its role has b een the same as that of philosophy
with resp ect to science in general: it has nourished and often initiated a
numb er of more sp ecialized elds. In this sense formal language theory
has b een the origin of many other elds. However, the historical devel-
opment can b e viewed also from a di erent angle. The origins of formal
language theory,aswe knowittoday, come from di erent parts of hu-
man knowledge. This also explains the wide and diverse applicabilityof
the theory.
Thereafter, various origins of language theory are discussed: mathematics,
linguistics, diverse mo deling situations. The reader is invited to read more ab out
them in the Handb o ok itself !
Rational theories and irregular b ehavior
Regular languages or events are called rational by the French scho ol. This
two-fold terminology makes much sense. The languages come from strictly nite
computing devices, so they should b e very regular indeed. The term \regular"
was apparently coined by Kleene. That regular languages are the same as rational
ones can be argued on b oth linguistic and mathematical grounds. The Latin
words \regula" and \ratio" are not so far apart as regards their meaning and
usage. Regular languages emerge as supp orts of rational power series, where
\rational" is understo o d in the mathematical meaning of rational functions.
Terminology aside, the theory of regular languages constitutes nowadays a
comprehensive blo ck of knowledge. The theory is pleasing in the sense that prac-
tically everything is decidable. However, the theory ab ounds with challenging
problems. You are almost sure that your problem, or at least a closely related
problem, has b een settled somewhere in the early literature but cannot nd
the place. Let us take an example. Ab ove, at the b eginning of Section 2, I men-
tioned quasi-uniform events but did not tell what they are. They are catenations
R R :::R of regular languages R , where each language with an o dd index is
0 1 2i
a singleton letter and each language with an even index is fundamental that is,
equals B for some subset B of the alphab et. Given a regular language, howdo
you decide whether or not it is a quasi-uniform event? The problem is apparently
decidable but some details are not so clear. We are dealing with the p ossibility
of a representation of a certain typ e, where some of the B -parts may reduce to
the emptyword. Think ab out it! You may also consult the already quoted pap er
by Rozenb erg, [13], if you can get hold of it.
260 Salomaa
My own acquaintance with regular languages { they were exclusively called
\events" that time { stems from a time ten years prior to the publication of the
quoted pap er [13] ab out quasi-uniform events. During the spring 1957 I was a
student in John Myhill's seminar in Berkeley, where topics were chosen from the
newly app eared red-cover Princeton b o ok Automata Studies, [20]. For instance,
the Kleene basic pap er ab out \the representation of events in nerve nets" is in
this b o ok. Myhill gave also some lectures himself. He was very impressive, to
say the least. But he was also out of this world; o ccasionally he was taken to a
sanatorium. His b ehaviour could b e very irregular. Once he did not showupat
all. We started searching him and found him in a wrong ro om. He was lecturing,
the b oard was already half-full, but he had apparently not noticed that there
was no audience!
Myhill's sp each was not easy to understand, at least not for me. It was
referred to as the \Birmingham accent". My work in his seminar, carried out with
Howard Jackson, was ab out self-repro ducing automata. The automata worked in
the plane, where necessary comp onents, suchaspower supplies, were randomly
scattered around. The nal construction was quite detailed, with instructions
for each sp eci c con guration, a welding op eration and so forth. Myhill was
quite pleased and started to talk ab out a publication. But he had his go o d days
and bad days. The day when wewere supp osed to talk ab out the details of the
publication happ ened to b e a bad one, and so nothing came out of the matter.
During one of his lectures Myhill formulated a theorem ab out regular events.
Apparently nob o dy quite understo o d what the statementofthe theorem was.
At the end of the lecture, Myhill asked us to prove the converse of the theorem
by the next time. He was then very upset when nob o dy had done it; apparently
nob o dy knew what was to b e shown. Later on I found out that the theorem and
its converse constituted what is now known as the Myhill-Nero de Theorem.
It was not b efore 1963 that I regained myinterest in regular languages. I just
attacked some of the problems that were around in the literature { initially I
had nob o dy in Turku to talk with ab out such problems. Those days the Russian
literature ab out regular events was very go o d { p erhaps b etter than the western
literature. My b o ok [17] contains the relevant references. Many of the problems
investigated were closely connected with switching circuits and mightnow seem
somewhat unusual, out of place or esoteric. I will mention some of the problems
Iwas working with in this area in the rst half of the 60s.
Take a regular language R . Following [17], I denote by w R the small-
est number of states in any deterministic automaton accepting R , and by
w R the smallest number of nal states in any deterministic automaton ac-
1
cepting R . For instance, if R is the nite language consisting of the words
k
x ;x x ;x x x ;:::;x x x :::x , where the words x ;:::;x are nonempty,
1 1 2 1 2 3 1 2 3 k 2 k
then a di erent nal state is needed for each of the k words, implying that
w R =k . A language R is termed irreducible if w R equals the cardinality
1 k 1
of the language. Clearly, all languages R are irreducible. It is a bit more dif-
k
cult to see that there are no other irreducible regular languages, [17]. There
certainly are other irreducible nonregular languages. Then one starts to talk
Events and Languages 261
ab out automata with in nitely many states. In the 60s considerations were of-
ten extended to nonconstructive cases, for instance, in [6] and [17]. In the early
pap ers dealing with w R and w R itwas not at all clear that b oth numb ers
1
are actually achieved in the same minimal automaton. No trade-o is p ossible:
w R cannot b e made smaller by allowing w R to b e larger than the minimal
1
value. I develop ed a metho d, based on a maximal usage of the distributivelaw,
to reduce a nite language into a normal form from which b oth of the numb ers
w R and w R can b e readily seen. The details are explained in [17].
1
I did also some work concerning languages over the one-letter alphabet. Many
of the p eople interested in nite automata were also working with probabilistic
automata. This was certainly true of myself and, as a result, [17] contains some
40 pages ab out probabilistic automata. Nasu and Honda had given an example of
a nite probabilistic automaton accepting a non-context-free language, although
all the probabilities and the cut-p oint for the acceptance are rational numb ers.
Could the same b e done for one-letter languages? The answer is p ositive but the
automaton must have at least three states. These problems led me to consid-
erations dealing with rational p ower series and characteristic values of p ositive
matrices, very similar to the ones I needed a decade later for growth functions
of L systems.
A problem of sp ecial interest concerned the characterization of equations b e-
tween regular expressions. As long as you are dealing with union and catenation
only,everything is clear: you just take the appropriate asso ciative, commutative
and distributive laws. The problems b egin with the Kleene star. How do you
characterize the relations arising from comp ositions of all three op erations, es-
p ecially in view of the fact that in the general case you need unb oundedly many
nested stars? Even some relatively simple equations, suchas
a [ ab [ ba =ba [ a ab a
might b e hard to deduce from any set of basic equations you have in mind.
In fact, V. N. Redko from Ukraine showed in the early 60s that no nite
set of equations can serve as a basis for all p ossible equations. The core of the
argument dealt with equations of the form
2 n 1 n
a = [ a [ a [ ::: [ a a ;
valid for all n.From the p oint of view of automata, the equation tells you howto
make a lo op longer. Let n assume prime values in such equations. Then one can
argue that in nitely many equations are needed to generate all of them b ecause,
for any prime p, equations with primes smaller than p are unable to pro duce the
equation with p.
But could some rule stronger than the customary logic of equations do the
trick? I was working with the following rule for \solving equations". Assume
that ; ; are regular expressions and that the language denoted by do es not
contain the emptyword. The latter condition can b e expressed purely syntac-
tically, following the recursive de nition of a regular expression. Then from the
equation = [ you may conclude the equation = .
262 Salomaa
Now Redko's problem do es not arise b ecause the desired equation may be
concluded from the equation
n 2 n 1
a = a a [ [ a [ a [ ::: [ a ;
the latter, in turn, b eing a consequence of the basic expansion a = a a [ ,
characterizing the Kleene star.
The validity of the rule is a straightforward matter. The assumption concern-
ing the emptyword is necessary b ecause, otherwise, the solution is not necessarily
unique. I was able to show in 1964 that, using the rule mentioned, all equations
can b e deduced from 11 simple equations. Essentially, the result is the same as
that by John Brzozowski concerning the niteness of the numb er of derivatives.
Arguments similar in spirit were given also by V. Bo dnarchuk, M. A. Spivak
and S. Aanderaa. References and details can b e found in [17]. Regular expres-
sion equations have later on turned out to b e rather imp ortant and have p opp ed
up in various algebraic and semantic considerations. For instance, John Conway,
Robin Milner, Vaughn Pratt and Dexter Kozen have made imp ortant contribu-
tions in the area. Even at the time of this writing, I am handling the refereeing
of a pap er, where a problem stemming from [17] is solved.
When I was working with axiom systems for regular expression equations,
I did not yet know p ersonally anybody in the area. I was corresp onding with
Bob McNaughton, Azaria Paz and Patrick Fischer ab out these questions. When I
moved to Canada in 1966, I very so on met John Brzozowski, Neil Jones, Michael
Harrison, Azaria Paz and many others.
Stars and Schutzenb erger
Since the foremost gure of the French scho ol, Marco Schutzenb erger, passed
away in 1996, I feel it very appropriate to discuss in a separate section some
matters connected with him. This ts very well also as a continuation of the
discussion concerning regular languages. I already said that if you see some fresh-
lo oking challenging problem ab out regular languages, you can be almost sure
that the problem has already b een settled somewhere. If you ask a Frenchman,
you very often get the answer that the solution is at least implicitly presented
in such and such old pap er bySchutzenb erger.
As a p erson Schutzenb erger was most remarkable and memorable. I always
had the feeling that I lagged two steps b ehind in understanding his arguments
and jokes. He was laughing very much, and often his laughter gave out a diab olic
impression. Yet I got convinced that he wasavery considerate p erson.
For the rst time I met Schutzenb erger in March 1971 in Paris. We discussed
only technical matters. Schutzenb erger acted as an \informant" for Samuel Eilen-
b erg. He wanted to know all kinds of things ab out ! -words, most of which I could
say nothing ab out.
Later on I met him at numerous conferences. Schutzenb erger was also an
invited sp eaker at the Turku ICALP.Hewas in an esp ecially go o d mo o d during
a 1975 spring scho ol ab out syntactic monoids in Vic sur Cer. Because of some
Events and Languages 263
reason, he did not attend the lectures in the ro om itself but was lying in the
corridor and shouting his comments through the op en do or.
Schutzenb erger's lectures were by no means easy to follow, certainly not
for me. Seymour Ginsburg put it as follows. \It do esn't make any di erence
whether Marco lectures in English or in French. I don't understand it anyway!"
But it made a di erence. Usually at a conference, if Schutzenb erger gave his
talk in French, so did also all the other French participants. This happ ened at
an IBM conference in Madrid. Again Seymour was ready to advise me. \The
French give their talks in French, the Spanish in Spanish. Why don't you talk in
Finnish?" That's what I did. But it was only a ten-minute p ortion of my lecture
and, b esides, quite an indep endent topic. Yet it was hard to get through at all,
b ecause the protests from the huge audience grew enormous.
Schutzenb erger could make his p oint. It was told that he submitted the
same pap er to an American conference in three consecutiveyears. The technical
content remained the same, only the background material and, most imp ortantly,
the references were changed. At the third try, the pap er was nally accepted.
Schutzenb erger commented that the Catholic Church has much fairer metho ds
and p olicy than American program committees, b ecause the former tells op enly
who the saints are at any given moment of time.
It takes a lot of work to read through Schutzenb erger's pap ers { but it is
usually worth the trouble. The argumentation is very compact yet reliable. The
author do es quite little to aid the reader. Most of the time, the latter has to
rethink the pro of. Or mayb e he hasn't if his thoughts do not lag two steps
b ehind. I will illustrate this with the basic notion from the solution of the
star height problem. In doing so I will compare the original pap er [5] with the
exp osition in my b o ok [17], where an interested reader will nd all the details.
In fact, this is one of the many star height problems that were around in the
60s { some of them remain still unsolved. The star height of a regular expression
is the maximum numb er of nested stars in it. To get the star height of a regular
language, you take the most economical regular expression for the language, that
is, a regular expression with the least p ossible star height. Sometimes you can
make rather surprising reductions. For instance, the star heightof the regular
language ab c is 1 rather than 2, b ecause the same language is denoted also
by the regular expression [ ab [ ca c.
Are there languages over the binary alphab et fa; bg with an arbitrary given
star height? A p ositive answer was given in [5] already in the middle 60s. In
fact, there is also a somewhat older unpublished pap er by McNaughton. What
would a language of star height q b e, given q 1?
Schutzenb erger de ned such a language M as follows. Let h b e the morphism
q
q q
of fa; bg into the additive group f0; 1;:::;2 1g of the integers mo dulo 2