clean-html.sh - bash script to clean HTML.

1) Convert the stored pages in UTF-8.
2) defecating saved pages of extra spaces, tabs,
     blank lines, scripts, images, meta-information.

     PS: when an <pre> produces limited filtering!

! Not all characters can be transcoding UTF-8. Be careful.

Project Samples

Project Activity

See All Activity >

Categories

HTML/XHTML

License

GNU General Public License version 2.0 (GPLv2)

Follow clean-html-sh

clean-html-sh Web Site

Other Useful Business Software
Go From AI Idea to AI App Fast Icon
Go From AI Idea to AI App Fast

One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of clean-html-sh!

Additional Project Details

Operating Systems

Linux

Languages

Russian

User Interface

Console/Terminal

Programming Language

Unix Shell

Related Categories

Unix Shell HTML XHTML

Registered

2012-09-26