cmp-lg/9405025 26 May 94 as has recognized the rst symbol b oth to the time complexityamounts and to much the nondeterminism which space complexity is detrimental b egin with the samegrammar few contains grammars symbols e g vantage algorithm very e cient parsing algorithm related to Tomita s e cient which cannot handle leftoften recursion used and as is an generallyat alternative the less to same top down time TD Since is parsing LC able parsing to is deal a with very left simple recursion parsing technique it and is where of lo okahead can handle the class of LC which In this pap er we relate a number of parsing algorithms puter science has b een used in di erent guises in various areas of com Left corner represent the same sub derivation rithms parsing search NWO under grant p ossible prop erty that theing table namely will how to contain obtain asquestion a which little parsing has entries algorithm come with up as of the others in the areap of ortunity to tabular improve pars some algorithms based on features underlying ideas We show that these Nondeterministic By relating existing ideas we hop e to provide an op next Supp orted by the Dutch Organisation for Scienti c Re AN have tabular theory however step is but A second purp ose of this pap er is to answer a and not LC b een OPTIMAL without predict Deterministic LC parsing with Earley s the and algorithms A parsing developed which Introduction many rules empty string LC algorithms are based on the same which Abstract all the University of Nijmegen Department of parsing is algorithm b ecomes 1 aforementioned is j p ossibility orooiveld Toerno include and in a 2 parsing whose X very j a noticeable of such an the After deterministic TABULAR parallel It di erent that right hand foundation of a strategy has k an grammars rules two Mark Jan one algorithm LC k when markjan cs kun nl areas symbols it will entries parser disad which algo sides This ED the of Nijmegen The Netherlands Nederhof PARSING tive in case we havethe e g left hand the sides rules of the rules are the same neous pro cessing where again then a In case the stackthe algorithm is grammar very can nondeterministic b e for handled almost deterministically of sub derivations that each state representsalization the computation This of can a bnumber e number of informally explained entries by increases the the the fact number of states may lead to an increasing as a tabular realisation ofpaths nondeterministic LR parsing allows sharing of computation b etween di erent search cussed parsing algorithms mentioned ab ove have not b een dis each tabular parsing which canand b e ELR describ parsing ed are as obvious follows solutions to a problem of lead purp ose symbols realisation tions b etween these algorithms then rithm represent b een describ ed ingorithm their from own the right existingof literature tabular but parsing they algorithms havemon and not pre xes a parallel parsinggorithms al capable of simultaneous pro cessing ELR of and all common pre x com CP therefore parsing go which one are step al further and discuss extended LR of its tabular realisation A rst attempt to solve this problem is to use predic certain At A To In LR second to PLR parsing will not improve the general parsing it this the For example Tomita s algorithm can b e seen which together e cient is PLR of o ccurring Computer Science b est an grammar p oint inevitable using this purp ose algorithm working overlapping p owerful also ELR and CP parsing are the foundation is not the empty string but now of parsing pap er we in tabular a of a common pre x the If the number of states o ccur parse on the use however of this ALGORITHM that author s is the algorithms working existing the parse table of the tabular re parsing to as PLR table collection at stack term make pap er entries A sophistication some parsing knowledge where literature on algorithms of state explicit is in p oint a of the to 1 stack the the provided that allows to sub derivations and e ciency show that original indicate some the is increased the parse parse on B The there which However provided simulta connec A a various states stack table main table algo is For CP We the in B 2 a
notation of rules A A with the same which may lead to work b eing rep eated during parsing
1 2
lhs is often simpli ed to A j j Furthermore the parse forest a compact representa
1 2
A rule of the form A is called an epsilon rule tion of all parse trees which is output by a tabular
algorithm may in this case not b e optimally dense
We assume grammars do not have epsilon rules unless
stated otherwise
We conclude that we have a tradeo b etween the case