Lexical Analyzer Generator Quex / Discussion / General Discussion: Parsing very large tokens

Parsing very large tokens

Forum: General Discussion

Creator: Zemantics

Created: 2013-09-15

Updated: 2013-09-16

Zemantics - 2013-09-15

First of all, thanks for a great project. I've written a Turtle file parser using QUEX. Potentially, or in an edge case, the tokens in a Turtle file could be very large - even gigabytes if someone wanted to store a BLOB or something like that. Is there a way of parsing the token in smaller bits, ex. 4kb at a time - like a stream or something to prevent exhausting internal memory resources?

Last edit: Zemantics 2013-09-15

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Frank-Rene Schäfer - 2013-09-16

Can you try to use the 'accumulator' in a dedictated mode?
You might have to break your token-pattern into three:

BEGIN, REPEAT and/or END

Then, for example:

mode GENERAL : { {BEGIN} GOSUB EAT; } } mode EAT : { {REPEAT} { accumulate(lexeme) } \Not{{REPEAT}} GOUP() } }

or, replace REPEAT with \Not{END}.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.