cpDetector

Add a Review
73 Downloads (This Week)
Last Update:
Download cpdetector_1.0.10_binary.zip
Browse All Files
Windows Mac Linux

Description

cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.

cpDetector Web Site

Features

  • Extendable framework for detection strategies
  • Byte order mark detection
  • ASCII detection
  • Guessing strategy (jchartdet, based on the mozilla code page detection)
  • XML header detection
  • HTML header detection
  • Command line interface for transcoding / detecting / sorting (by codepage) trees of files
  • See comparison: http://fredeaker.blogspot.com/2007/01/character-encoding-detection.html
  • Fast: http://tinyurl.com/cpdetector-icu-performance

Update Notifications





Write a Review

User Reviews

Be the first to post a review of cpDetector!

Additional Project Details

Languages

English

Intended Audience

Developers, Information Technology

Programming Language

Java

Registered

2004-07-13
Screenshots can attract more users to your project.
Features can attract more users to your project.

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.