This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [Patch v3 1/14] [BZ #14095] update collation data from Unicode / ISO 14651
- From: Carlos O'Donell <carlos at redhat dot com>
- To: Mike FABIAN <mfabian at redhat dot com>, libc-alpha at sourceware dot org
- Cc: "Dmitry V. Levin" <ldv at altlinux dot org>
- Date: Fri, 23 Feb 2018 21:54:22 -0800
- Subject: Re: [Patch v3 1/14] [BZ #14095] update collation data from Unicode / ISO 14651
- Authentication-results: sourceware.org; auth=none
- References: <s9dh8q8dnk3.fsf@taka.site>
On 02/23/2018 02:10 AM, Mike FABIAN wrote:
> From 8a78b549bd634e4e4a53b082ab2b90477cdb6290 Mon Sep 17 00:00:00 2001
> From: Mike FABIAN <mfabian@redhat.com>
> Date: Tue, 30 Jan 2018 17:59:00 +0100
> Subject: [PATCH 01/14] Update iso14651_t1_common file to
> ISO14651_2016_TABLE1_en.txt [BZ #14095]
> MIME-Version: 1.0
> Content-Type: text/plain; charset=UTF-8
> Content-Transfer-Encoding: 8bit
>
> [BZ #14095] - Review / update collation data from Unicode / ISO 14651
>
> File downloaded from:
> http://standards.iso.org/iso-iec/14651/ed-4/ISO14651_2016_TABLE1_en.txt
>
> Updating this file alone is not enough, there are problems in the new
> file which need to be fixed and the collation rules for many locales
> need to be adapted. This is done by the following patches.
>
> This update also fixes the problem that many characters are treated as
> identical when sorting because they were not yet in the old
> iso14651_t1_common file, see:
>
> https://bugzilla.redhat.com/show_bug.cgi?id=1336308
> - Infinite (∞) and empty set (∅) are treated as if they were the same character by sort and uniq
>
> [BZ #14095]
> * localedata/locales/iso14651_t1_common: Update file to
> latest version from ISO (ISO14651_2016_TABLE1_en.txt).
> ---
> localedata/locales/iso14651_t1_common | 62065 +++++++++++++++++++++++++++-----
> 1 file changed, 52571 insertions(+), 9494 deletions(-)
LGTM.
This data looks good, generated from Unicode 9, using the latest
ISO14651_2016_TABLE1_en.txt file.
Nice to see this fix swbz#14095!
Signed-off-by: Carlos O'Donell <carlos@redhat.com>
--
Cheers,
Carlos.