I'm confused about what happens when character data contain a multibyte character: xmlstarlet is consistently breaking up the character data into two portions of character data, as it were, at the first multibyte character.
Consider the following XML document:
<?xml version="1.0" encoding="UTF-8"?>
<elem>Höret!</elem>
encoded in UTF-8. xml pyx...
2009-05-02 09:18:33 UTC in XMLStarlet command line XML toolkit