Hello dv___ ther you have some ISO-8859-2 (latin2 - east europe) links.
I'll be very glad to see it in n++ :)
It's very helpfull for mostly polish (slavonic) user's
After few minutes of hard work :-) Add this to ConvertExt.enc
Czech, Slovaks and others have to add their own accents, I made only Polish national characters.
GROUP "ISO-8859-2"
TABLE "WINDOWS-1250"
{
// Ą Ć Ę Ł Ń Ó Ś Ź Ż ą ć ę ł ń ó ś ź ż
a5 c6 ca a3 d1 d3 8c 8f af b9 e6 ea b3 f1 f3 9c 9f bf
}
TABLE "ISO-8859-2"
{
// Ą Ć Ę Ł Ń Ó Ś Ź Ż ą ć ę ł ń ó ś ź ż
a1 c6 ca a3 d1 d3 a6 ac af b1 e6 ea b3 f1 f3 b6 bc bf
}
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
...I forgot - of course you have to set proper code page in Windows. This is I don't like, but this is probably Scintilla is not internally Unicode?
To DV: Do you think this is possible to add event handler on file loading and (optionally) choose "View as..." automatically? As I remember SciTe parses 2 first lines of source code and looks for string like coding="ISO-8859-2".
artur
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Can somebody tell me what is ISO-8859-2? Is it some kind of additional encoding (codepage) such as DOS (OEM)? If it is, can somebody show me the differences between his standard (WIN) encoding and his ISO-8859-x? For example, hex codes of cyrillic letters which look like 'A' and 'B' are 0xC0 and 0xC2 in WIN (cp1251) encoding, but 0xB0 and 0xB2 in ISO-8869-5 encoding.
If I see your alphabet (including capital letters) in WIN (default) and ISO-8859-x encodings, it could be possible to encode your text files from ISO-8859-x to WIN and from WIN to ISO-8859-x.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2006-07-13
Windows uses internally the ANSI set of codepages, but web pages are frequently written in ISO-8859-x codepages (ISO-8859-1, ISO-8859-2, etc). Those two sets differ a bit in the positions of the "national" chars and other special characters.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
OK, I understand. I plan to add external encoding tables to a new version of my plugin, but your alphabet in ANSI and in ISO-8859-x is needed to build the encoding tables.
As an example, there are 3 encoding tables of the first 10 cyrillic letters (octal codes):
/* CP1251 - WINDOWS */
"\340\341\342\343\344\345\270\346\347\350"
/* CP866 - DOS */
"\240\241\242\243\244\245\361\246\247\250"
Each position of these tables represents the same cyrillic character in different encodings (codepages). I.e. cyrillic 'a' is 0340 (0xE0) in cp1251, 0240 (0xA0) in cp866 and 0301 (0xC1) in koi8-r. I prefer hex codes, by the way.
If you give me such encoding tables for your alphabet (for all characters of your alhabet, including capital letters), I'll try to include them into my plugin. Now I can't say how much time it will take, but sure it can be done.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
By the way, current version of ConvertExt plugin does support external encoding tables. If you are interested, download ConvertExt v1.1 RC1 - its archive contains detailed documentation. Note: currently only one-byted encodings are supported i.e. encodings where one character is one byte. Double-byted encodings are in the future plans (not developed yet).
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I am quite keen to get a reply on that: is going notepad++ implement ISO-8859-x in its new version? I really do not want to use two different programs! I want notepadd++ only!
Best regards
Seb
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
VERY nice editor, but what about ISO 8859-2 encoding options? As default, for new and opened files?...
without that option useless for me :( have to stay with edit plus 2 so far...
rgds,m.
This one bugged me too, but also forced to switching to utf-8 which I see now as a very good decision ;)
Without ISO-8859-x it's rather useless in Central end West Europe :/
It's a standard codepage in Poland !
Please, it's only reason that I need other editors ;(
It will be big plus for Notepad++
+1 !
(I need ISO-8859-15)
Please !!!
Hello dv___ ther you have some ISO-8859-2 (latin2 - east europe) links.
I'll be very glad to see it in n++ :)
It's very helpfull for mostly polish (slavonic) user's
http://nl.ijs.si/gnusl/cee/iso8859-2.html
http://nl.ijs.si/gnusl/cee/charset.html
http://czyborra.com/charsets/iso8859.html#ISO-8859-2
http://www.langbox.com/codeset/iso8859-2.html
http://biega.com/ISO-8859-2.gif
After few minutes of hard work :-) Add this to ConvertExt.enc
Czech, Slovaks and others have to add their own accents, I made only Polish national characters.
GROUP "ISO-8859-2"
TABLE "WINDOWS-1250"
{
// Ą Ć Ę Ł Ń Ó Ś Ź Ż ą ć ę ł ń ó ś ź ż
a5 c6 ca a3 d1 d3 8c 8f af b9 e6 ea b3 f1 f3 9c 9f bf
}
TABLE "ISO-8859-2"
{
// Ą Ć Ę Ł Ń Ó Ś Ź Ż ą ć ę ł ń ó ś ź ż
a1 c6 ca a3 d1 d3 a6 ac af b1 e6 ea b3 f1 f3 b6 bc bf
}
Works absolutely great! Thank you very much.
After restarting, uder PlugIn menu, you have got what you (and myself) need.
...I forgot - of course you have to set proper code page in Windows. This is I don't like, but this is probably Scintilla is not internally Unicode?
To DV: Do you think this is possible to add event handler on file loading and (optionally) choose "View as..." automatically? As I remember SciTe parses 2 first lines of source code and looks for string like coding="ISO-8859-2".
artur
Can somebody tell me what is ISO-8859-2? Is it some kind of additional encoding (codepage) such as DOS (OEM)? If it is, can somebody show me the differences between his standard (WIN) encoding and his ISO-8859-x? For example, hex codes of cyrillic letters which look like 'A' and 'B' are 0xC0 and 0xC2 in WIN (cp1251) encoding, but 0xB0 and 0xB2 in ISO-8869-5 encoding.
If I see your alphabet (including capital letters) in WIN (default) and ISO-8859-x encodings, it could be possible to encode your text files from ISO-8859-x to WIN and from WIN to ISO-8859-x.
Windows uses internally the ANSI set of codepages, but web pages are frequently written in ISO-8859-x codepages (ISO-8859-1, ISO-8859-2, etc). Those two sets differ a bit in the positions of the "national" chars and other special characters.
OK, I understand. I plan to add external encoding tables to a new version of my plugin, but your alphabet in ANSI and in ISO-8859-x is needed to build the encoding tables.
As an example, there are 3 encoding tables of the first 10 cyrillic letters (octal codes):
/* CP1251 - WINDOWS */
"\340\341\342\343\344\345\270\346\347\350"
/* CP866 - DOS */
"\240\241\242\243\244\245\361\246\247\250"
/* KOI8_R */
"\301\302\327\307\304\305\243\326\332\311"
Each position of these tables represents the same cyrillic character in different encodings (codepages). I.e. cyrillic 'a' is 0340 (0xE0) in cp1251, 0240 (0xA0) in cp866 and 0301 (0xC1) in koi8-r. I prefer hex codes, by the way.
If you give me such encoding tables for your alphabet (for all characters of your alhabet, including capital letters), I'll try to include them into my plugin. Now I can't say how much time it will take, but sure it can be done.
This is function for PHP:
function plCharset($string) {
return strtr($string, "\xa5\x8c\x8f\xb9\x9c\x9f", "\xa1\xa6\xac\xb1\xb6\xbc");
}
--
KJ
By the way, current version of ConvertExt plugin does support external encoding tables. If you are interested, download ConvertExt v1.1 RC1 - its archive contains detailed documentation. Note: currently only one-byted encodings are supported i.e. encodings where one character is one byte. Double-byted encodings are in the future plans (not developed yet).
Hi there!
I am quite keen to get a reply on that: is going notepad++ implement ISO-8859-x in its new version? I really do not want to use two different programs! I want notepadd++ only!
Best regards
Seb
Don't you like implementation of this feature as a plugin? You can describe your CP125x and ISO-8859-x in this file: "ConvertExt.enc".