From: Stefan B. <bo...@ap...> - 2009-10-21 19:07:19
|
On 2009-10-20, Tony Rozga <ton...@ni...> wrote: > I'm using XMLUnit primarily for HTMLDocumentBuilder and > TolerantSaxDocumentBuilder (nice tools btw!). Honestly, I would have thought they had been obsoleted by things like tagsoup or jtidy. > If I build a Document from html with in it the String contents > of the Node in question have weird bytes where the space should be. There are way more space characters in Unicode than you'll ever want to know. In particular there is a "non-breaking space" character and I guess this is what you see as "weird bytes". is not a simple space and so you can't expect to get that. Stefan |