Questions tagged [tatsu]

Use the [tatsu] tag for all questions related to the TatSu or Grako parser generators.

TatSu (the successor to Grako) is a tool that takes grammars in a variation of EBNF as input, and outputs memoizing (Packrat) PEG parsers in Python.

TatSu can also compile a grammar stored in a string into a tatsu.grammars.Grammar object that can be used to parse any given input, much like the re module does with regular expressions.

46 questions

vote

2 answers

TatSu: How to optimize the following grammar logic for faster parse time?

I have the following grammar in TatSu. To reduce parse time, I implemented cut operations (i.e., commit to a particular rule option once a particular token is seen). However, I still see long runtimes. On a file with about 830K lines, it takes…

tatsu

asked Nov 19 '19 at 04:37

user4979733

3,181
4
26
41

vote

1 answer

Is there a way to do context sensitive parsing in tatsu

context sensitive '%' ..... eol comments I'm starting with the grammar for PDF described here https://github.com/caradoc-org/caradoc/blob/master/doc/grammar/grammar.pdf which seems to lack the definition of eol comments. PDF has end of line…

tatsu

asked Nov 03 '19 at 09:05

Robin Becker

vote

1 answer

How to include a literal '#' in a Tatsu grammar?

I can't get Tatsu to parse a grammar that includes a literal '#'. Here is a minimal example: G = r''' atom = /[0-9]+/ | '#' atom ; ''' p = tatsu.compile(G) p.parse('#345', trace=True) The parse throws a FailedParse exception. The trace…

tatsu

asked May 30 '19 at 01:28

RootTwo

4,288
1
11
15

vote

2 answers

Alphabetic characters not recognized in tatsu parse

I have defined a very simple grammar, but tatsu does not behave as expected. I have added a "start" rule and terminated it with a "$" character, but I still see the same behavior. If I define the "fingering" rule with a regular expression (digit =…

tatsu

asked Apr 20 '19 at 00:38

David Randolph

vote

1 answer

Tatsu Parsing Performance

I've implemented a grammar in Tatsu for parsing a description of a quantum program Quipper ASCII (link). The parser works but is slow for the files I'm looking at (about 10kB-1MB size, see the resources directory). It takes approximately 10-30…

performance tatsu

asked Feb 27 '18 at 19:30

Eddie Schoute

vote

1 answer

How to get concise syntax error messages from grako/TatSu

If the input to a grako/tatsu generated parser has a syntax error, such as 3 + / 3 to the calc.py examples, one gets a long list of Python calling sequences in addition to the relevant 3 + / 3 ^ I could use try - except constructions but then…

error-handling exception grako tatsu

asked Feb 27 '18 at 09:46

koskenni

vote

2 answers

Is it possible to use a different lexer?

I would like to use a different lexer for tatsu, yet use tatsu's parser. Is this possible? For example, in the grammar: expr = NUM | ID | (expr '+' expr) ; is it possible to use an alternative lexer to provide NUM and ID?

tatsu

asked Jul 20 '17 at 09:35

user2629532

vote

1 answer

Cannot define rule priority in grako grammar for handling special tokens

I am trying to analyze some documents by a grammar generated via Grako that should parse simple sentences for further analysis but face some difficulties with some special tokens. The (Grako-style) EBNF looks like: abbr::str = "etc." |…

python grammar grako tatsu

asked Dec 22 '16 at 10:47

voidpointercast

votes

1 answer

How to use #include in TatSu grammar files?

The #include pragma with relative path does not work. With a grammar file containing ... #include :: "secondary.ebnf" and code to compile it with open("/full/path/to/main.ebnf") as source: psr = tatsu.compile(source.read()) I'm getting…

tatsu

asked Aug 31 '23 at 09:58

volferine

votes

2 answers

Matching the hash character in Tatsu

I am getting an exception attempting to parse the # character using Tatsu: import tatsu grammar = r''' @@comments :: // @@eol_comments :: // start = '#' ; ''' print(tatsu.__version__) parser = tatsu.compile(grammar) ast = parser.parse('#',…

python tatsu

asked Jun 07 '23 at 23:27

Patrick

votes

0 answers

Unexpected output from TatSu parser

The below TatSu grammar (TatSu 5.8.3, Python 3.11) creates an unexpected output from the given input: I expected a nested xxx yy, but the brackets [] are completely ignorded: @@grammar :: Test @@whitespace :: /[\t ]+/ start = script ; script =…

parsing grammar ebnf tatsu

asked Mar 21 '23 at 18:41

Painter

votes

1 answer

Tatsu Parser, unclear why it isn't moving to the next rule in the line?

I am writing a code parser/formatter for a language that doesn't have one, OSTW (Overwatch higher level language for workshop code). So that I can be lazy and have pretty code. I am pretty new to this idea, so if tatsu is a poor choice for this…

tatsu

asked Feb 17 '23 at 20:33

Mriswithe

votes

1 answer

tatsu.exceptions.FailedParse while using a C BNF grammar adapted to Tatsu

tatsu.exceptions.FailedParse: (52:24) expecting one of: "'" '"' : declarator = {pointer}? direct_declarator ; ^ I found a C BNF grammar here:…

parsing lexer bnf ebnf tatsu

asked Dec 24 '22 at 15:54

jokoon

6,207
11
48
85

votes

1 answer

Parsing unique but unordered named blocks

I have a DSL where a file consists of multiple named blocks. Ideally, each block should occur only once, but the order doesn't matter. How do I write a parser that ignores block order, but gives syntax errors if the same block is repeated?

parsing dsl tatsu

asked Aug 11 '22 at 20:07

shader

votes

1 answer

Is there a Tatsu or any PEG-format grammar available for the [g]awk language syntax?

As the subject asks, does anyone know of an existing Tatsu grammar (or at least a PEG-format grammar) for the [g]awk language? I did already browse all existing Tatsu examples that I could find, and searched extensively around the net for any…

awk grammar peg tatsu

asked Mar 28 '22 at 20:50

pjfarley3

Prev 1

3 4 Next