This is the mail archive of the
libc-locales@sourceware.org
mailing list for the GNU libc locales project.
[Bug localedata/17750] wrong collation order of diacritics in most locales
- From: "keld at keldix dot com" <sourceware-bugzilla at sourceware dot org>
- To: libc-locales at sourceware dot org
- Date: Thu, 30 Nov 2017 07:40:18 +0000
- Subject: [Bug localedata/17750] wrong collation order of diacritics in most locales
- Auto-submitted: auto-generated
- References: <bug-17750-716@http.sourceware.org/bugzilla/>
https://sourceware.org/bugzilla/show_bug.cgi?id=17750
--- Comment #22 from keld at keldix dot com <keld at keldix dot com> ---
On Wed, Nov 29, 2017 at 07:49:36PM +0000, fweimer at redhat dot com wrote:
> https://sourceware.org/bugzilla/show_bug.cgi?id=17750
>
> --- Comment #19 from Florian Weimer <fweimer at redhat dot com> ---
> (In reply to Egmont Koblinger from comment #18)
>
> > By the way, don't these language have some "official" collation rules, or at
> > least some established common practice?
>
> I expect that many languages/scripts have multiple collation rules, depending
> on use, particularly when it comes to sorting foreign languages using the same
> base script.
That is not my experience. For Danish (my language) there is only one standard,
and that
takes care of many foreign characters. Then there is a spec from Danish
Standard
that is more elaborate, in the form of a POSIX/Linux locale, covering all of
ISO 10646/Unicode,
that builds on ISO 14651, with the backwards diacrit spec. For German, I know
there are 2 sorting
specs, one where ä, ö and ü etc are considered accented versions of a, o and u,
and one
where they are interpreted as ae oe and ue. There are sorting standards for all
of these
languages, that are well adhered to in the market place.
best regards
keld
--
You are receiving this mail because:
You are on the CC list for the bug.