Provide a context free grammar for the following languages:

Question

a) L = {1^i - 1^j = 1^(i-j) | i-j>=0, i,j>=0}  
b) L = {a^i b^j c^k | k!=i+j, i,j,k>=0

I can't seem to figure out how to implement these procedures. I understand the concept, I just can't seem to work it out. Any help would be great! I've been staring at this stuff for approximately 6 hours, and have gotten nowhere.

Close voters: I know this question is (a) homework and (b) not a programming problem and (c) lacking any obvious attempt at a solution. However, it comes up over and over again with variations, and I'm trying to provide a canonical duplicate reference. If you feel that this isn't worth the effort, go ahead and close the question. — rici, Oct 17 '15 at 04:19
Yes it is homework, but I honestly have been trying to figure this out. I've made several attempts, but none that pan out. — Tiffany, Oct 17 '15 at 05:18

rici · Answer 1 · 2015-10-17T04:51:56.167

There are really only a few tricks to solving problems like this, and the same ones come up over and over again. However, the only way to really get them into your head is to solve a few problems, so this answer won't actually solve the homework assignment in the OP. I hope it provides some ideas.

Context-free languages can be composed from other context free languages, and it doesn't really require much thought to see how to write the composition once you know what the composition is. In the following examples, I don't distinguish between non-terminals and context-free languages, because a non-terminal actually defines a context-free language. For example, if L₁ and L₂ are two CFGs, with productions

L₁ → …
L₂ → …

then the union L = L₁∪ L₂ is simply:

L → L₁
L → L₂
L₁ → …
L₂ → …

while the concatenation language M = L₁L₂ is:

M → L₁ L₂
L₁ → …
L₂ → …

Now, here are a couple of useful compositions. First, parenthetic balancing. Context-free languages can't count, except that they can count up and then count down. So (ⁿ[^m]^m)ⁿ is a context-free language, and it's easy to see that it can be composed by starting with

L₁ = [^m]^m

and then defining

L₂ = (ⁿL₁)ⁿ

using the following simple productions:

L₁ → ε
L₁ → ( L₁ )
L₂ → L₁
L₂ → [ L₁ ]

I could have used a, b, c and d instead of the parentheses and brackets, but I think the intent is a bit clearer when you use parentheses.

So that's how to do equality counts. What about inequality? Let's start with a very simple language: the Kleene ⁺. a⁺ is "one or more as", and the language is straightforward:

L → a
L → L a

Now consider {aⁿb^m | n ≠ m}. We can rewrite that as a union:

{aⁿb^m | n>m} ∪ {aⁿb^m | m>n}

since if m≠n then exactly one of m>n and n>m must be true.

Now look at {aⁿb^m | n>m}. Since n is strictly greater than m, we can rewrite aⁿb^m as a^m-na^mb^m. But we don't really care what n-m is, just that it's at least one. So we can use the Kleene ⁺ to make a^m-na^mb^m and that is obviously:

L₁ → a
L₁ → L₁ a
L₂ → ε
L₂ → a L₂ b
L → L₁ L₂

The m>n case is very similar, and to get m≠n we just need to find the union of those two languages.

I hope all that was clear.

It's common to provide puzzles like this with odd little algebraic identities. To solve them, you just need to reduce the formulas to the small number of cases shown above, using decompositions like a^i+j = aⁱa^j, which is not exactly rocket science.

For example, another SO question asked about the language {aⁿb^mc^2n+m | n,m > 0}.

Solving it is simple. First, we want to parenthetically match b^m, so we need to rewrite c^2n+m as c^mc²ⁿ. That leaves us parenthetically matching aⁿ with c²ⁿ; to make the two repetition counts the same, we need to change c²ⁿ to ccⁿ.

Having rewritten the original as aⁿb^mc^mccⁿ, it becomes clear that the language is:

L₁ → ε
L₁ → b L₁ c
L₂ → L₁
L₂ → a L₁ cc

which is essentially identical to the parenthetic balancing example way at the top of this answer.

For b) I've come up with: S -> A | X A -> B | C B -> Bc | aBc | C C -> Cc | bCc | λ X -> aY | Yb Y -> aY | Yb | aYb | Z Z -> bZc | λ Does that look right? — Tiffany, Oct 18 '15 at 02:15
@Tifffany: It's clear that Z is b^n c^n. What language is Y? And X? C is b^n c^n c*, but what is B? (By the way, Since B -> C, there is not a lot of point writing both A -> B | A -> C. From A->B and B->C you can derive A->C. That means that A is identical to B, so one of them is redundant.) You should be able to write down the language for each non-terminal, which will make it clear that the end result is (or in this case is not) what you're looking for. — rici, Oct 18 '15 at 06:10

Provide a context free grammar for the following languages:

1 Answers1