This is the mail archive of the
libc-locales@sourceware.org
mailing list for the GNU libc locales project.
[Bug localedata/22070] charmaps/UTF-8: wcwidth for Prepended_Concatenation_Mark codepoints set to 0 (should be 1)
- From: "maiku.fabian at gmail dot com" <sourceware-bugzilla at sourceware dot org>
- To: libc-locales at sourceware dot org
- Date: Mon, 04 Sep 2017 14:44:09 +0000
- Subject: [Bug localedata/22070] charmaps/UTF-8: wcwidth for Prepended_Concatenation_Mark codepoints set to 0 (should be 1)
- Auto-submitted: auto-generated
- References: <bug-22070-716@http.sourceware.org/bugzilla/>
https://sourceware.org/bugzilla/show_bug.cgi?id=22070
--- Comment #1 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Mike Frysinger from comment #0)
> we currently mark all Cf (Format Character) as width 0, but this ignores
> Prepended_Concatenation_Mark codepoints. specifically these should all have
> a wcwidth of 1:
> 0600..0605 ; Prepended_Concatenation_Mark # Cf ARABIC NUMBER SIGN..ARABIC
> NUMBER MARK ABOVE
> 06DD ; Prepended_Concatenation_Mark # Cf ARABIC END OF AYAH
> 070F ; Prepended_Concatenation_Mark # Cf SYRIAC ABBREVIATION MARK
> 08E2 ; Prepended_Concatenation_Mark # Cf ARABIC DISPUTED END OF AYAH
> 110BD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN
This list is from
ftp://ftp.unicode.org/Public/10.0.0/ucd/PropList.txt
So maybe I should add that file as well to glibc/localedata/unicode-gen/
and parse it in glibc/localedata/unicode-gen/utf8_gen.py ?
(If this list never changes I could also hardcode it in utf8_gen.py).
--
You are receiving this mail because:
You are on the CC list for the bug.