Thread: Re: [CEDET-devel] semantic lexer for python

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi Richard & Eric,

[...]
> Yes in most cases.  In Dave's code however, INDENT tokens
> may be generated by the empty string at the beginning of
> lines without leading white spaces!

That is what I tried to achieve ;-)

> Also a key difference betwen INDENT and whitespace tokens is
> that Dave's INDENT token does not consume any input
> characters!  Dave stuck in an entry in the middle of `cond'
> clauses that may generate INDENT tokens, but it does not
> move the current point.  There is no infinite recursion,
> because the cond clause Dave added always evaluates to `nil'
> so that it goes on to the next cond clause *always*. I had
> to look at the code for a couple of minutes before I
> understood what was going on.  Completely legal code, but
> unusual use of the `cond' form.  I have no problem with the
> code so long as we add a comment in capital letters what is
> going on.

You're right! It is an unusual use of `cond' that should be
emphasized.  Unless you (or Eric) got a better idea on how to
implement that ;-)

> Despite the fact that Dave turned on both
> semantic-flex-enable-indents and
> semantic-flex-enable-whitespace in his sample code, the two
> are independent features, i.e., they can be turn on/off
> independently.  I say this, because my first concern when I
> saw Dave's code was that whitespace tokens need to be turned
> on to turn on the INDENT tokens.  After studying his code,
> they seem to be indepedent.

Yes these two options are completely independent.  I my example I
just wanted to illustrate that INDENT tokens just match the empty
string at beginning of line and don't prevent to handle white spaces
if needed!

[...]
> If speed becomes an issue, it may make sense to implement
> part of semantic in C.  I don't know that we have reached
> that point yet with regard to python.
[...]

I don't know python at all but it seems that its original design
clearly exhibits the limits of Emacs which is clearly designed to
work well with languages syntax based on classic parenthesized block
structures.  Probably because of the Lisp inheritance ;-)

So, in the case of python, I think it will be difficult for
semantic-flex to easily produce the so nice 'semantic-list tokens
needed to recursively parse sub parts of code.  Thus allowing a simple
but general, robust and efficient mechanism to skip code with invalid
syntax without breaking the parser nor cluttering up the (LALR)
grammar with a lot of error recovery rules difficult to tune.

I agree with Eric that syntax tables are mainly oriented to navigate,
particularly through parenthesized blocks of code.  So, in the case of
python, because of the above orientation (limitation?), it will be
quasi impossible to use such powerful navigation tools like `up-list',
`down-list', etc., heavily used by semantic-ctxt stuff.  A lot of
semantic-ctxt functions will probably need to be overrode by specific
code certainly less efficient than the built-in Emacs one :-(

I don't think that writing parts of the Semantic lexer/parser tools in
C will improve Emacs design.  Maybe we could submit a Request For
Enhancement to Emacs developers, so Emacs could take into account new
language concepts like python's indentation?

Sincerely,
David

__________________________________________________________________
Your favorite stores, helpful shopping tools and great gift ideas. Experience the convenience of buying online with Shop@Netscape! http://shopnow.netscape.com/

Get your own FREE, personal Netscape Mail account today at http://webmail.netscape.com/

Thread: Re: [CEDET-devel] semantic lexer for python

cedet-devel