This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/19919] New: [PATCH] Correct the Malayalam sorting order of 0D36 and 0D37


https://sourceware.org/bugzilla/show_bug.cgi?id=19919

            Bug ID: 19919
           Summary: [PATCH] Correct the Malayalam sorting order of 0D36
                    and 0D37
           Product: glibc
           Version: 2.25
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: santhosh.thottingal at gmail dot com
                CC: libc-locales at sourceware dot org
  Target Milestone: ---

Created attachment 9161
  --> https://sourceware.org/bugzilla/attachment.cgi?id=9161&action=edit
Patch to correct the sorting order of 0D36 and 0D37

The Malayalam characters à(U+0D36) and à(U+0D37) should be sorted just like the
order of they unicode code points. In the master version of glibc, the order is
swapped.

It sorts à(U+0D36) after à(U+0D37) which is a bug in the patch I submitted long
time back(Bug 12541)

$ LANG=ml_IN.UTF-8 sort ~/sort.txt
à
à
à
à
à

This is wrong.

With the attached patch, I will get and it is correct.

$ LANG=ml_IN.UTF-8 sort ~/sort.txt
à
à
à
à
à

The mistake was probably happened because of confusing naming of Unicode
characters that have very similar pronunciation.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]