cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
Features
- Extendable framework for detection strategies
- Byte order mark detection
- ASCII detection
- Guessing strategy (jchartdet, based on the mozilla code page detection)
- XML header detection
- HTML header detection
- Command line interface for transcoding / detecting / sorting (by codepage) trees of files
- See comparison: http://fredeaker.blogspot.com/2007/01/character-encoding-detection.html
- Fast: http://tinyurl.com/cpdetector-icu-performance
License
Mozilla Public License 1.1 (MPL 1.1)Follow cpDetector
Other Useful Business Software
Error to trace to log to deploy. One click. No SSH.
AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of cpDetector!