This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH] BZ #19575: Clarify status of entries in GB 18030-2005.


On 02/10/2016 09:14 AM, Andreas Schwab wrote:
> "Carlos O'Donell" <carlos@redhat.com> writes:
> 
>> On 02/10/2016 04:15 AM, Andreas Schwab wrote:
>>> "Carlos O'Donell" <carlos@redhat.com> writes:
>>>
>>>> This statement is only partly correct. Some of the mappings were updated
>                                            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
>>>> but 24 mappings for PUA code points still remained.
>>>
>>> What are the updated mappings apart from the 24 being left?
>>
>> Sorry, I don't quite understand the question.
>>
>> Could you please clarify exactly what you would like to know?
> 
> Which are those updated mappings?

So you would like to know which mappings changed between GB 18030-2000
and GB 18030-2005? I don't have such a list. In "CJKV Information Processing"
it is noted that there are 2 major areas of revision for 2000 -> 2005:

* Acknowledgment of CJK Unified Ideographs Extension B --- 42,711 hanzi

* Acknowledgment of the six regional scripts: Korean, Mongolian, Tai Le, Tibetan, Uyghur, and Yi.

So it supports all 42,711 hanzi characters, and the six scripts (all 4-byte
regions). There are also 4 pictoral glyph corrections.

May I ask why such a list of updated mappings is relevant here?

The only important thing here is that with those 24 PUA mappings made
into non-PUA equivalents the *entire* GB 18030-2005 can be represented
in Unicode without the use of PUA code points. Which is great because
it means normal unmodified programs can process and represent those
characters correctly.

In summary:
- glibc support GB 18030-2005.
- glibc modifies GB 18030-2005 to use 24 non-PUA code points and make
  the implementation fully use Unicode only.
- My comments are there to indicate the modifications for non-PUA code
  points (which deviate from the standard).

Cheers,
Carlos.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]