This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug localedata/21547] Tibetan script collation broken (Dzongkha and Tibetan)
- From: "maiku.fabian at gmail dot com" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sourceware dot org
- Date: Mon, 15 Jan 2018 10:25:39 +0000
- Subject: [Bug localedata/21547] Tibetan script collation broken (Dzongkha and Tibetan)
- Auto-submitted: auto-generated
- References: <bug-21547-131@http.sourceware.org/bugzilla/>
https://sourceware.org/bugzilla/show_bug.cgi?id=21547
--- Comment #4 from Mike FABIAN <maiku.fabian at gmail dot com> ---
I wonder whether there isn’t a contradicton in your rules.
https://github.com/eroux/tibetan-collation/blob/master/implementations/Unicode/rules.txt#L7
contains:
&གཉ<གཉྫ
so གཉ comes *before* གཉྫ as a primary difference.
But then
https://github.com/eroux/tibetan-collation/blob/master/implementations/Unicode/rules.txt#L30
contains:
&ཉ<<ྋྙ<གཉ<མཉ<རྙ=ཪྙ<སྙ<བརྙ=བཪྙ<བསྙ
And this causes གཉ to be sorted *after* གཉྫ.
(I tested this with icu 57.1 using it via Python3.
--
You are receiving this mail because:
You are on the CC list for the bug.