phpwiki-talk Mailing List for PhpWiki (Page 304)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

In message <147...@da...>,Arno Hollosi writes:
>The one place I can think of right now is the use of preg_match_all()
>in wiki_transform. Also, eregs don't have non-greedy matches. Can't
>remember which one, but I recall that there is at least one match
>which needs non-greediness.

Of course, "need" is always relative.  :-)

> > Perhaps we can live with [invalid HTML]?
>
>I can, because the above case will not appear very often, will it?

Not except as a result of typos and brainos.
If the wiki markup is esoteric or just wrong, I don't mind if it comes
out looking like garbage (in fact, it should).  However broken HTML makes
me nervous.  Who knows what it will come out looking like on whatever
random browser I happen to be using?  (I'll admit the world is unlikely to 
end.)

>Btw, as your FIXME states: the recursive logic does not work as
>advertised: "__''word''__" renders ok, but "''__word__''" is not
>rendered - instead __ is inserted verbatim. Just looking at the code it
>becomes clear where the "fault" lies: you are always processing $line.
>Real recursion means processing the created tokens. (I guess you are
>aware of that already) Oddly enough replacing __ with ''' makes it
>work in both cases, but that is due to the regexp and not
>because of the recursion.

You're right.

Actually, my original intent was to handle this via regexps.
My intent (not that it made it into the code) was that none of the
"''", "'''", or "__" quoted expressions are recognized unless they
contain no (untokenized) occurrence of either "''" or "__".

Ie. the regexp for the __Bold__ expressions should have been:
  "__[^_'](?:[^_']+|_(?!_)|'(?!'))+(?<!_)__"

There! Haha. Make sense?

No really, you're right.  It's broken.

> > I suppose we could eliminate the recursable logic, while keeping the
> > tokenization by applying each of the currently recursed transformations
> > twice.
>
>Apart from doing ''' before '' (otherwise '''word''' becomes '<i>word</i>')
>it does not immediately solve the problem. You need to transfrom the
>tokens and not $line as you do right now.

Of course.  Okay, so never mind... 

>So my conclusion is: recursion adds complexity (while having its benefits).
>Let's start with HTML-in-place right now, and once some time has
>passed and the dust settled, we can do the recursion stuff - we will
>then have a better understanding of the issue.
>
>[Or you write a functioning and beautiful recursion right away ;o)]

Let me search for a nicer solution for a little while more.  (A week or two.)
As I see it, there's no big rush for this, as the present 
wiki_transform works just fine.

Jeff

2000	Jan	Feb	Mar	Apr	May (1)	Jun (103)	Jul (105)	Aug (16)	Sep (16)	Oct (78)	Nov (36)	Dec (58)
2001	Jan (100)	Feb (155)	Mar (84)	Apr (33)	May (22)	Jun (77)	Jul (36)	Aug (37)	Sep (183)	Oct (74)	Nov (235)	Dec (165)
2002	Jan (187)	Feb (183)	Mar (52)	Apr (10)	May (15)	Jun (19)	Jul (43)	Aug (90)	Sep (144)	Oct (144)	Nov (171)	Dec (78)
2003	Jan (113)	Feb (99)	Mar (80)	Apr (44)	May (35)	Jun (32)	Jul (34)	Aug (34)	Sep (30)	Oct (57)	Nov (97)	Dec (139)
2004	Jan (132)	Feb (223)	Mar (300)	Apr (221)	May (171)	Jun (286)	Jul (188)	Aug (107)	Sep (97)	Oct (106)	Nov (139)	Dec (125)
2005	Jan (200)	Feb (116)	Mar (68)	Apr (158)	May (70)	Jun (80)	Jul (55)	Aug (52)	Sep (92)	Oct (141)	Nov (86)	Dec (41)
2006	Jan (35)	Feb (62)	Mar (59)	Apr (52)	May (51)	Jun (61)	Jul (30)	Aug (36)	Sep (12)	Oct (4)	Nov (22)	Dec (34)
2007	Jan (49)	Feb (19)	Mar (37)	Apr (16)	May (9)	Jun (38)	Jul (17)	Aug (31)	Sep (16)	Oct (34)	Nov (4)	Dec (8)
2008	Jan (8)	Feb (16)	Mar (14)	Apr (6)	May (4)	Jun (5)	Jul (9)	Aug (36)	Sep (6)	Oct (3)	Nov (3)	Dec (3)
2009	Jan (14)	Feb (2)	Mar (7)	Apr (16)	May (2)	Jun (10)	Jul (1)	Aug (10)	Sep (11)	Oct (4)	Nov (2)	Dec
2010	Jan (1)	Feb	Mar (13)	Apr (11)	May (18)	Jun (44)	Jul (7)	Aug (2)	Sep (14)	Oct	Nov (6)	Dec
2011	Jan (2)	Feb (6)	Mar (3)	Apr (2)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2012	Jan (11)	Feb (3)	Mar (11)	Apr	May	Jun	Jul	Aug	Sep	Oct (1)	Nov (4)	Dec
2013	Jan	Feb	Mar	Apr (3)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2014	Jan	Feb	Mar	Apr	May (4)	Jun	Jul	Aug	Sep	Oct	Nov (8)	Dec (1)
2015	Jan (3)	Feb (2)	Mar	Apr (3)	May (1)	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec (2)
2016	Jan	Feb (4)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2017	Jan	Feb	Mar	Apr	May	Jun	Jul (3)	Aug	Sep	Oct	Nov	Dec
2018	Jan	Feb	Mar	Apr	May (3)	Jun (1)	Jul	Aug	Sep	Oct	Nov	Dec
2020	Jan	Feb	Mar	Apr	May (3)	Jun	Jul (5)	Aug	Sep	Oct	Nov	Dec
2021	Jan	Feb (4)	Mar	Apr	May	Jun	Jul (1)	Aug (6)	Sep (3)	Oct	Nov	Dec
2022	Jan (11)	Feb (2)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2023	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (1)	Nov (3)	Dec (3)
2024	Jan (7)	Feb (2)	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2025	Jan	Feb	Mar	Apr (1)	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec