TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 359, Number 2, February 2007, Pages 605–618 S 0002-9947(06)03958-4 Article electronically published on June 13, 2006
GROWTH AND ERGODICITY OF CONTEXT-FREE LANGUAGES II: THE LINEAR CASE
TULLIO CECCHERINI–SILBERSTEIN
To Wolfgang Woess on his 50th anniversary
Abstract. A language L over a finite alphabet Σ is called growth-sensitive if forbidding any non-empty set F of subwords yields a sub-language LF whose exponential growth rate is smaller than that of L. Say that a context-free grammar (and associated language) is ergodic if its dependency di-graph is strongly connected. It is known that regular and unambiguous non-linear context-free languages which are ergodic are growth-sensitive. In this note it is shown that ergodic unambiguous linear languags are growth-sensitive, closing the gap that remained open.
1. Introduction Let L be a language over the alphabet Σ, that is, a subset of the free monoid Σ∗ of all finite words over Σ.Wewriteε for the empty word and Σ+ = Σ∗ \{ε}. For a word w ∈ Σ∗,itslength (number of letters) is denoted by |w|.Thegrowth rate of L ⊆ Σ∗ is the number 1 (1.1) γ(L) = lim sup |{w ∈ L : |w|≤n}| n . n→∞ Also recall that L has exponential growth if γ(L) > 1 and it has sub-exponential growth otherwise, namely, if γ(L)=1. A language L over Σ is growth-sensitive if γ(LF ) <γ(L) for any non-empty F ⊂ Σ∗ consisting of subwords of elements of L,where LF = {w ∈ L :no v ∈ F is a subword of w}. A context-free grammar is a quadruple C =(V, Σ, P,S), where V is a finite set of variables, disjoint from the finite alphabet Σ,thevariableS is the start symbol, and P ⊂ V × (V ∪ Σ)∗ is a finite set of production rules. We write T u or ∗ (T u) ∈ P if (T,u) ∈ P.Forv, w ∈ (V ∪ Σ) ,wewritev =⇒ w if v = v1Tv2 ∗ ∗ and w = v1uv2,whereT u, v1 ∈ (V ∪ Σ) and v2 ∈ Σ .Arightmost derivation ∗ is a sequence v = w0,w1,...,wk = w ∈ (V ∪ Σ) such that wi−1 =⇒ wi (the wi’s ∗ are called sentential forms); we then write v =⇒ w.ForT ∈ V, we consider the ∗ ∗ language LT = {w ∈ Σ : T =⇒ w}.Thelanguage generated by C is L(C)=LS.
Received by the editors November 4, 2004. 2000 Mathematics Subject Classification. Primary 68Q45; Secondary 05A16, 05C20, 68Q42. Key words and phrases. Context-free grammar, linear language, dependency di-graph, ergod- icity, ambiguity, growth, higher block languages, automaton, bilateral automaton.