This is the mail archive of the libc-locales@sourceware.org mailing list for the GNU libc locales project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/22070] charmaps/UTF-8: wcwidth for Prepended_Concatenation_Mark codepoints set to 0 (should be 1)


https://sourceware.org/bugzilla/show_bug.cgi?id=22070

--- Comment #1 from Mike FABIAN <maiku.fabian at gmail dot com> ---
(In reply to Mike Frysinger from comment #0)
> we currently mark all Cf (Format Character) as width 0, but this ignores
> Prepended_Concatenation_Mark codepoints.  specifically these should all have
> a wcwidth of 1:
> 0600..0605 ; Prepended_Concatenation_Mark # Cf  ARABIC NUMBER SIGN..ARABIC
> NUMBER MARK ABOVE
> 06DD       ; Prepended_Concatenation_Mark # Cf  ARABIC END OF AYAH
> 070F       ; Prepended_Concatenation_Mark # Cf  SYRIAC ABBREVIATION MARK
> 08E2       ; Prepended_Concatenation_Mark # Cf  ARABIC DISPUTED END OF AYAH
> 110BD      ; Prepended_Concatenation_Mark # Cf  KAITHI NUMBER SIGN

This list is from

ftp://ftp.unicode.org/Public/10.0.0/ucd/PropList.txt

So maybe I should add that file as well to glibc/localedata/unicode-gen/
and parse it in glibc/localedata/unicode-gen/utf8_gen.py ?

(If this list never changes I could also hardcode it in utf8_gen.py).

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]