#81 Charset detection in text/html

open
nobody
htdig (31)
5
2005-04-14
2005-04-14
Anonymous
No

Automatically detect the charset of file and recode it if
needed is wery userfull, especially for russan language
(for ex. koi8-r & windows-1251).
I have a working patch that supports:
1. detect charset of HTML files by 'charset' value
2. detect charset of TEXT files using the dictionary
3. automatically recode if needed
TODOs:
1. import character maps from recode command/library
2. multiply recodes (as 'recode 1..2..3' does)
3. detect HTML file charset by dictionary if 'charset' does
not exists

Should I send a patch/continue my work or it is already
implemented? Please, e-mail me to
murzik_007@sertolovo.ru

Discussion

  • Nobody/Anonymous

    Logged In: NO

    My work is now complete. TODOs were made too.

     
  • Nobody/Anonymous

    Logged In: NO

    My work now complete with this TODOs.

     

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks