This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug locale/22898] Some Chinese characters cannot be sorted by adding sorting rules to LC_COLLATE


https://sourceware.org/bugzilla/show_bug.cgi?id=22898

Carlos O'Donell <carlos at redhat dot com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |carlos at redhat dot com

--- Comment #2 from Carlos O'Donell <carlos at redhat dot com> ---
(In reply to Mike FABIAN from comment #0)
> Created attachment 10854 [details]
> 0001-Test-patch-to-show-that-some-Chinese-characters-cann.patch
> 
> Some Chinese characters cannot be sorted by adding collation rules to
> LC_COLLATE.
> 
> For example:
> 
> 𫡅 U+2B845
> 
> cannot be sorted but
> 
> 𠮞 U+20B9E
> 
> can be sorted.
> 
> The attached patch demonstrates this problem.

In the C.UTF-8 work I've found at least 3 more instances like this. Something
is wrong with the parser or with the input expected by the parser. I will have
to debug this along with the other failures in CJK symbols I've seen when I
expand C.UTF-8 to the full code point set.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]