Menu

Nice editor, but where ISO-8859-2 encoding?!

2006-05-16
2012-11-13
  • Nobody/Anonymous

    Hi,

    VERY nice editor, but what about ISO 8859-2 encoding options? As default, for new and opened files?...

    without that option useless for me :( have to stay with edit plus 2 so far...

    rgds,m.

     
    • Anonymous

      Anonymous - 2006-05-18

      This one bugged me too, but also forced to switching to utf-8 which I see now as a very good decision ;)

       
      • Nobody/Anonymous

        Without ISO-8859-x it's rather useless in Central end West Europe :/

         
        • Nobody/Anonymous

          It's a standard codepage in Poland !
          Please, it's only reason that I need other editors ;(

           
          • Nobody/Anonymous

            It will be big plus for Notepad++

             
            • Nobody/Anonymous

              +1 !

              (I need ISO-8859-15)

              Please !!!

               
        • Sławowid Wojsławicki

          Hello dv___ ther you have some ISO-8859-2 (latin2 - east europe) links.
          I'll be very glad to see it in n++ :)
          It's very helpfull for mostly polish (slavonic) user's

          http://nl.ijs.si/gnusl/cee/iso8859-2.html
          http://nl.ijs.si/gnusl/cee/charset.html
          http://czyborra.com/charsets/iso8859.html#ISO-8859-2
          http://www.langbox.com/codeset/iso8859-2.html
          http://biega.com/ISO-8859-2.gif

           
          • Nobody/Anonymous

            After few minutes of hard work :-) Add this to ConvertExt.enc
            Czech, Slovaks and others have to add their own accents, I made only Polish national characters.

            GROUP "ISO-8859-2"

                TABLE "WINDOWS-1250"
                {
                //    Ą  Ć  Ę  Ł  Ń  Ó  Ś  Ź  Ż  ą  ć  ę  ł  ń  ó  ś  ź  ż
                    a5 c6 ca a3 d1 d3 8c 8f af b9 e6 ea b3 f1 f3 9c 9f bf
                }

                TABLE "ISO-8859-2"
                {
                //    Ą  Ć  Ę  Ł  Ń  Ó  Ś  Ź  Ż  ą  ć  ę  ł  ń  ó  ś  ź  ż
                    a1 c6 ca a3 d1 d3 a6 ac af b1 e6 ea b3 f1 f3 b6 bc bf
                }

             
            • Nobody/Anonymous

              Works absolutely great! Thank you very much.
              After restarting, uder PlugIn menu, you have got what you (and myself) need.

               
            • Nobody/Anonymous

              ...I forgot - of course you have to set proper code page in Windows. This is I don't like, but this is probably Scintilla is not internally Unicode?
              To DV: Do you think this is possible to add event handler on file loading and (optionally) choose "View as..." automatically? As I remember SciTe parses 2 first lines of source code and looks for string like coding="ISO-8859-2".

              artur

               
    • DV

      DV - 2006-07-10

      Can somebody tell me what is ISO-8859-2? Is it some kind of additional encoding (codepage) such as DOS (OEM)? If it is, can somebody show me the differences between his standard (WIN) encoding and his ISO-8859-x? For example, hex codes of cyrillic letters which look like 'A' and 'B' are 0xC0 and 0xC2 in WIN (cp1251) encoding, but 0xB0 and 0xB2 in ISO-8869-5 encoding.
      If I see your alphabet (including capital letters) in WIN (default) and ISO-8859-x encodings, it could be possible to encode your text files from ISO-8859-x to WIN and from WIN to ISO-8859-x.

       
    • Anonymous

      Anonymous - 2006-07-13

      Windows uses internally the ANSI set of codepages, but web pages are frequently written in ISO-8859-x codepages (ISO-8859-1, ISO-8859-2, etc). Those two sets differ a bit in the positions of the "national" chars and other special characters.

       
    • DV

      DV - 2006-07-13

      OK, I understand. I plan to add external encoding tables to a new version of my plugin, but your alphabet in ANSI and in ISO-8859-x is needed to build the encoding tables.
      As an example, there are 3 encoding tables of the first 10 cyrillic letters (octal codes):

      /* CP1251 - WINDOWS */
      "\340\341\342\343\344\345\270\346\347\350"

      /* CP866 - DOS */
      "\240\241\242\243\244\245\361\246\247\250"

      /* KOI8_R */
      "\301\302\327\307\304\305\243\326\332\311"

      Each position of these tables represents the same cyrillic character in different encodings (codepages). I.e. cyrillic 'a' is 0340 (0xE0) in cp1251, 0240 (0xA0) in cp866 and 0301 (0xC1) in koi8-r. I prefer hex codes, by the way.

      If you give me such encoding tables for your alphabet (for all characters of your alhabet, including capital letters), I'll try to include them into my plugin. Now I can't say how much time it will take, but sure it can be done.

       
      • Nobody/Anonymous

        This is function for PHP:

        function plCharset($string) {
          return strtr($string, "\xa5\x8c\x8f\xb9\x9c\x9f", "\xa1\xa6\xac\xb1\xb6\xbc");
        }

        --
        KJ

         
    • DV

      DV - 2006-12-11

      By the way, current version of ConvertExt plugin does support external encoding tables. If you are interested, download ConvertExt v1.1 RC1 - its archive contains detailed documentation. Note: currently only one-byted encodings are supported i.e. encodings where one character is one byte. Double-byted encodings are in the future plans (not developed yet).

       
    • Nobody/Anonymous

      Hi there!

      I am quite keen to get a reply on that: is going notepad++ implement ISO-8859-x in its new version? I really do not want to use two different programs! I want notepadd++ only!

      Best regards
      Seb

       
    • DV

      DV - 2006-12-18

      Don't you like implementation of this feature as a plugin? You can describe your CP125x and ISO-8859-x in this file: "ConvertExt.enc".

       
MongoDB Logo MongoDB