This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

[Bug localedata/21547] Tibetan script collation broken (Dzongkha and Tibetan)

From: "maiku.fabian at gmail dot com" <sourceware-bugzilla at sourceware dot org>
To: glibc-bugs at sourceware dot org
Date: Mon, 15 Jan 2018 10:25:39 +0000
Subject: [Bug localedata/21547] Tibetan script collation broken (Dzongkha and Tibetan)
Auto-submitted: auto-generated
References: <bug-21547-131@http.sourceware.org/bugzilla/>

https://sourceware.org/bugzilla/show_bug.cgi?id=21547

--- Comment #4 from Mike FABIAN <maiku.fabian at gmail dot com> ---
I wonder whether there isn’t a contradicton in your rules.

https://github.com/eroux/tibetan-collation/blob/master/implementations/Unicode/rules.txt#L7

contains:

&གཉ<གཉྫ

so གཉ comes *before* གཉྫ as a primary difference.

But then 

https://github.com/eroux/tibetan-collation/blob/master/implementations/Unicode/rules.txt#L30

contains:

&ཉ<<ྋྙ<གཉ<མཉ<རྙ=ཪྙ<སྙ<བརྙ=བཪྙ<བསྙ

And this causes གཉ to be sorted *after* གཉྫ.

(I tested this with icu 57.1 using it via Python3.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]