clean-html.sh - bash script to clean HTML.
1) Convert the stored pages in UTF-8.
2) defecating saved pages of extra spaces, tabs,
blank lines, scripts, images, meta-information.
PS: when an <pre> produces limited filtering!
! Not all characters can be transcoding UTF-8. Be careful.