Backtracking Recursive Descent Parser for the following grammar

Question

I am trying to figure out some details involving parsing expression grammars, and am stuck on the following question:

For the given grammar:

a = b Z
b = Z Z | Z

(where lower-case letters indicate productions, and uppercase letters indicate terminals). Is the production "a" supposed to match against the string "Z Z"?

Here is the pseudo-code that I've seen the above grammar get translated to, where each production is mapped to a function that outputs two values. The first indicates whether the parse succeeded. And the second indicates the resulting position in the stream after the parse.

defn parse-a (i:Int) -> [True|False, Int] :
   val [r1, i1] = parse-b(i)
   if r1 : eat("Z", i1)
   else : [false, i]

defn parse-b1 (i:Int) -> [True|False, Int] :
   val [r1, i1] = eat("Z", i)
   if r1 : eat("Z", i1)
   else : [false, i]

defn parse-b2 (i:Int) -> [True|False, Int] :
   eat("Z", i)

defn parse-b (i:Int) -> [True|False, Int] :
   val [r1, i1] = parse-b1(i)
   if r1 : [r1, i1]
   else : parse-b2(i)

The above code will fail when trying to parse the production "a" on the input "Z Z". This is because the parsing function for "b" is incorrect. It will greedily consume both Z's in the input and succeed, and then leave nothing left for a to parse. Is this what a parsing expression grammar is supposed to do? The pseudocode in Ford's thesis seems to indicate this.

Thanks very much.

-Patrick

rici · Accepted Answer · 2015-03-28T02:39:49.583

1

In PEGs, disjunctions (alternatives) are indeed ordered. In Ford's thesis, the operator is written / and called "ordered choice", which distinguishes it from the | disjunction operator.

That makes PEGs fundamentally different from CFGs. In particular, given PEG rules a -> b Z and b -> Z Z / Z, a will not match Z Z.

edited Mar 28 '15 at 02:39

answered Mar 28 '15 at 02:25

rici

234,347
28
237
341

score 0 · Answer 2 · answered Mar 30 '15 at 16:30

0

Thanks for your reply Rici.

I re-read Ford's thesis much more closely, and it reaffirms what you said. PEGs / operator are both ordered and greedy. So the rule presented above is supposed to fail.

-Patrick

answered Mar 30 '15 at 16:30

Patrick Li

672
4
11

Backtracking Recursive Descent Parser for the following grammar

2 Answers2