cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
Features
- Extendable framework for detection strategies
- Byte order mark detection
- ASCII detection
- Guessing strategy (jchartdet, based on the mozilla code page detection)
- XML header detection
- HTML header detection
- Command line interface for transcoding / detecting / sorting (by codepage) trees of files
- See comparison: http://fredeaker.blogspot.com/2007/01/character-encoding-detection.html
- Fast: http://tinyurl.com/cpdetector-icu-performance
License
Mozilla Public License 1.1 (MPL 1.1)Follow cpDetector
Other Useful Business Software
Your top-rated shield against malware and online scams | Avast Free Antivirus
Our antivirus software scans for security and performance issues and helps you to fix them instantly. It also protects you in real time by analyzing unknown files before they reach your desktop PC or laptop — all for free.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of cpDetector!