compatibility characters among East Asian logographs

TEI produces the TEI Guidelines and associated software

Brought to you by: bleekere, ebeshero, hcayless, heb, and 7 others

#736 compatibility characters among East Asian logographs

Milestone: AMBER

Status: closed-fixed

Owner: Stefan Majewski

Labels: None

Priority: 5(default)

Updated: 2015-05-30

Created: 2015-02-18

Creator: Jens Østergaard Petersen

Private: No

CH-LanguagesCharacterSets.xml
/div/div[2]/div[7]/div[2]/p[1]

Together with essentially analogous issues arising from the encoding of certain East Asian ideographs, this
=>
This

I think Christian must have thinking of compatibility characters among East Asian logographs. Chinese characters do use base character/combining marks, but many compatibility characters occur among CJK characters.

As a consequence, leave out:

It should be noted however, that normalization as discussed in the documents above does not cover the problems mentioned above with East-Asian characters, except for issues connected with composed characters in Hangul.

Discussion

Hugh A. Cayless - 2015-03-12

assigned_to: James Cummings
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Hugh A. Cayless - 2015-03-12

Assigning to James for review.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

James Cummings - 2015-05-28

I believe the suggested edits are reasonable and if no one from council objects will implement.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jens Østergaard Petersen - 2015-05-28

I will ask Wittern what he meant.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jens Østergaard Petersen - 2015-05-28

I now see a fatal typo in my posting: "Chinese characters do use base character/combining marks" should be "Chinese characters do not use base character/combining marks".

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jens Østergaard Petersen - 2015-05-28

I have asked Christian Wittern and he writes to me that "At this point in the document, this is still essentially a general statement without reference to a specific Unicode recommendation, which is coming later. This might not technically be covered by the current iteration of the Unicode Normalization, but this does not mean that one should not think of this issue, which is why I put it in." So here he is (as I read him) talking about normalization in a broader sense, something that ideally should also (or could eventually) concern compatibility graphs.

About the passage I argue should be deleted, he writes: "the whole point of this and the previous thing is to remind you that you can't completely rely on Unicode rules (which might change while you are not looking), but should set your one [scil. own] rules for the East-Asian characters. And when I wrote this more than 10 years ago, nobody would have thought what a mess would come of all of this... " While this is true, I don't think there have been any changes in the Unicode rules in this area during the last 10 years.

Wittern ends up with: "But like secular law, the book is changed by those in power at any given moment, so in consultation with the council, do what you think needs to be done..." …

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Syd Bauman - 2015-05-30

Council sub-group saw no reason not to implement this change, and did so. SM fixed in r13233.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Syd Bauman - 2015-05-30

status: open --> closed-fixed

assigned_to: James Cummings --> Stefan Majewski
If you would like to refer to this comment somewhere else in this project, copy and paste the following link: