jflex-users Mailing List for JFlex (Page 6)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Peter,

thanks for the answer.

I already tried that (including \r\n to allow for multi-line) and basically that 
is working fine. The problem is, that the input string "PRIMARY KEYS" will 
return the wrong tokens: the first one being "PRIMARY KEY" and second one "S" - 
which is not correct.

I tried adding a whitespace after the last word "PRIMARY"{WS}"KEY"{WS} but then 
a token at the very end of the input string is not recognized, so I tried adding 
a word boundary "PRIMARY"{WS}"KEY"\b but that doesn't work either. It seems that 
the generated scanner handles the \b differently then Pattern class from Java.

Regards
Thomas

Peter L. Bird wrote on 09.07.2009 01:37:
> Thomas,
> 
> Here are pattern definitions that will get you a multi-word keyword to 
> be processed as a single token:
> 
> WS = [ \t]+
> 
> primaryKeyword = "PRIMARY"{WS}"KEY"
> 
> 
> This won't allow the keyword to be split across newline boundaries.  To 
> support that, you may need to use exclusive states.
> 
> peter
> 
> 
> At 05:32 PM 7/8/2009, Thomas Kellerer wrote:
>> Hi,
>>
>> I'm using a modified version of Stephen Ostermillers SQL Lexer that is 
>> part of
>> his syntax highlighting package (http://ostermiller.org/syntax)
>>
>> I want to add keywords that consists of more than one word e.g. 
>> "PRIMARY KEY"
>>
>> Currently the lexer definition contains something like this:
>>
>> keyword=("SELECT"|"UPDATE"|"DELETE")
>>
>> (apparently a very simplified version)
>>
>> Now I'm trying to add e.g. "PRIMARY KEY" as a single keyword to this 
>> list, but
>> the following will not work properly
>>
>> keyword=("SELECT"|"UPDATE"|"DELETE"|"PRIMARY KEY")
>>
>> as there may be any number of whitespace between PRIMARY and KEY and e.g.
>> PRIMARY    KEY would not be recognized by the Lexer. According to the
>> documentation (and my tests) the following will not work as well:
>>
>> keyword=("SELECT"|"UPDATE"|"DELETE"|"PRIMARY[ \t\r\n]+KEY")
>>
>> because the escape syntax is not interpreted in quoted strings.
>>
>> Leaving out the quotes is not working correctly either as e.g. PRIMARY 
>> KEYPOINT
>> will recognize PRIMARY KEY as a token as well.
>>
>> So my question is: how do I need to define multi-word keywords so that
>> whitespace is accepted but "partial" matches are processed correctly?
>>
>> I'm sorry, I'm using this somehow as a black-box as I'm not really 
>> experienced
>> with building lexers and all the theory behind it. So I'm maybe doing 
>> it the
>> wrong way or missing something very obvious.
>>
>> Any help is greatly appreciated
>>
>> Thanks
>> Thomas
>>
>>
>>
>> ------------------------------------------------------------------------------ 
>>
>> Enter the BlackBerry Developer Challenge
>> This is your chance to win up to $100,000 in prizes! For a limited time,
>> vendors submitting new applications to BlackBerry App World(TM) will have
>> the opportunity to enter the BlackBerry Developer Challenge. See full 
>> prize
>> details at: http://p.sf.net/sfu/Challenge
>> -- 
>> jflex-users mailing list
>> https://lists.sourceforge.net/lists/listinfo/jflex-users
> 

2001	Jan	Feb	Mar (2)	Apr	May	Jun	Jul (1)	Aug (5)	Sep (1)	Oct (5)	Nov	Dec (6)
2002	Jan (3)	Feb (12)	Mar (14)	Apr	May	Jun	Jul	Aug	Sep	Oct (3)	Nov (3)	Dec (6)
2003	Jan (8)	Feb (5)	Mar (7)	Apr (2)	May (5)	Jun	Jul (5)	Aug (4)	Sep (7)	Oct	Nov (21)	Dec (7)
2004	Jan (6)	Feb (5)	Mar	Apr (1)	May (10)	Jun (1)	Jul	Aug (1)	Sep (4)	Oct	Nov (2)	Dec (2)
2005	Jan (13)	Feb (2)	Mar (6)	Apr (4)	May (2)	Jun	Jul (4)	Aug (12)	Sep (3)	Oct (6)	Nov (1)	Dec
2006	Jan (7)	Feb (3)	Mar (11)	Apr (5)	May (1)	Jun (2)	Jul (2)	Aug	Sep (13)	Oct	Nov (3)	Dec (6)
2007	Jan (1)	Feb (4)	Mar (2)	Apr	May (4)	Jun (11)	Jul (2)	Aug (4)	Sep	Oct	Nov	Dec (2)
2008	Jan (1)	Feb (4)	Mar (7)	Apr	May (8)	Jun (1)	Jul (2)	Aug (4)	Sep (3)	Oct	Nov	Dec
2009	Jan (3)	Feb (10)	Mar (6)	Apr	May (6)	Jun (8)	Jul (7)	Aug	Sep	Oct	Nov (3)	Dec (4)
2010	Jan	Feb	Mar	Apr (15)	May	Jun (7)	Jul	Aug (5)	Sep	Oct	Nov	Dec
2011	Jan	Feb	Mar	Apr (7)	May (2)	Jun	Jul (2)	Aug (4)	Sep (3)	Oct	Nov	Dec
2012	Jan	Feb (1)	Mar (3)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2013	Jan (2)	Feb	Mar	Apr	May (2)	Jun (2)	Jul	Aug (6)	Sep	Oct	Nov (3)	Dec
2014	Jan (8)	Feb (3)	Mar (5)	Apr	May (7)	Jun (1)	Jul	Aug	Sep	Oct	Nov (4)	Dec
2015	Jan (2)	Feb	Mar (3)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (2)	Dec
2016	Jan (1)	Feb (3)	Mar (3)	Apr (2)	May (7)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2017	Jan	Feb (1)	Mar	Apr	May (1)	Jun	Jul	Aug	Sep (1)	Oct	Nov	Dec
2019	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (4)	Nov	Dec (1)

jflex-users Mailing List for JFlex (Page 6)

The fast lexer generator for Java

jflex-users — from users for users, help, problems, discussions