This is the mail archive of the libc-locales@sourceware.org mailing list for the GNU libc locales project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/17750] New: wrong collation order of diacritics in most locales


https://sourceware.org/bugzilla/show_bug.cgi?id=17750

            Bug ID: 17750
           Summary: wrong collation order of diacritics in most locales
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: aoliva at sourceware dot org
                CC: libc-locales at sourceware dot org

http://www.unicode.org/reports/tr10/tr10-30.html states:

<quote>
Normally, all differences in sorting are assessed from the start to the end of
the string. If all of the base letters are the same, the first accent
difference determines the final order. In row 1 of Table 5, the first accent
difference is on the o, so that is what determines the order. In some French
dictionary ordering traditions, however, it is the last accent difference that
determines the order, as shown in row 2.
</quote>

Table 5 says:

<pre>
Normal Accent Ordering      cote < cotà < cÃte < cÃtÃ
Backward Accent Ordering     cote < cÃte < cotà < cÃtÃ
</pre>

However, glibc implements backward accent ordering for all locales except de_DE
and lb_LU.  

Unicode CLDR 26 confirms this is wrong: the only file in
http://unicode.org/cldr/trac/browser/tags/release-26/common/collation/ that has
settings backwards="on" is fr_CA.xml.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]