[phpXML] Re: [phpXML] bugs with character entities?
Brought to you by:
bs_php,
nigelswinson
From: <ph...@pe...> - 2001-10-08 16:26:06
|
On Saturday 07 July 2001 20:29, you wrote: > Hello, > > I have the following in an XML file: > > <title>DTD's & schema</title> > > Using <phpXML/>, I extract that string using get_content() and > display it on a web page. For some reason, the spaces around the > ampersand are removed. > > Does someone have a fix for this? Any help would be really > appreciated. This is because the parser treats entities like & as separate elements and passes them separately through the character data handler. Because phpxml trims all strings in its handler, entities also get trimmed. There is a similar problem with the embedded nodes we've been discussing (<text>brave <ex>new</ex> world</text>). It needs to do some trimming to get rid of carriage returns, tabs, etc in the file, but I rather think instead of getting rid of all of them, it would be better to replace large amounts of whitespace with one space. Alternatively, only trim spacing at the beginning and end of nodes, not in the middle. Curiously php has a (undocumented) SKIP_WHITE option which is used in the parse_into_struct example, but this doesn't seem to have any effect in the ordinary parse. > Also, there is a bug on line 125: "&apos" should be "'". To > make things more troublesome, it looks like Internet Explorer (at > least mine!) doesn't recognize ' but will recognize '. solution: don't use IE! (or simply use ' instead) -- This message has been sent through the <phpXML/> user discussion list. To unsubscribe, please visit https://sslsites.de/mailinglisten/user/us...@li.../ |