clean-html.sh - bash script to clean HTML.

1) Convert the stored pages in UTF-8.
2) defecating saved pages of extra spaces, tabs,
     blank lines, scripts, images, meta-information.

     PS: when an <pre> produces limited filtering!

! Not all characters can be transcoding UTF-8. Be careful.

Project Samples

Project Activity

See All Activity >

Categories

HTML/XHTML

License

GNU General Public License version 2.0 (GPLv2)

Follow clean-html-sh

clean-html-sh Web Site

Other Useful Business Software
Level Up Your Cyber Defense with External Threat Management Icon
Level Up Your Cyber Defense with External Threat Management

See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
Try for Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of clean-html-sh!

Additional Project Details

Operating Systems

Linux

Languages

Russian

User Interface

Console/Terminal

Programming Language

Unix Shell

Related Categories

Unix Shell HTML XHTML

Registered

2012-09-26