This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/15616] New: Characters outside the Pinyin collation are incorrectly sorted


http://sourceware.org/bugzilla/show_bug.cgi?id=15616

            Bug ID: 15616
           Summary: Characters outside the Pinyin collation are
                    incorrectly sorted
           Product: glibc
           Version: 2.15
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: mcaropreso@yammer-inc.com
                CC: libc-locales at sourceware dot org

Created attachment 7077
  --> http://sourceware.org/bugzilla/attachment.cgi?id=7077&action=edit
Test file

If you set LC_COLLATE to zh_CN and sort the attached file using:

sort testdata

you can see that characters not included into the pinyin collation
(/usr/share/i18n/locales/iso14651_t1_pinyin) are not sorted (they are left in
place)
For example, in the testfile we have ã","ãä" and "ãä" and even if they start
with the same character they are not sorted together.

It would be useful to sort those characters according to their codepoint in
Unicode. I tried to put an UNDEFINED rule at the end of the iso14651_t1_pinyin
collation but it didn't fix the problem.

Mario

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]