...
- Handles no-break space (NBSP) and soft hyphen (SHY) sensible.
- Uses Unicode internally, reads and writes 8-bit oder UTF-8 encoded files.
- Recognizes preformatted paragraphs (eg source code, tables).
- Reformats paragraphs either using the traditional greedy line breaking algorithm, or a TeX-like optimizing algorithm.