This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug localedata/22073] New: charmaps/UTF-8: wcwidth of U+00AD: 0 or 1 ?
- From: "vapier at gentoo dot org" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sourceware dot org
- Date: Sun, 03 Sep 2017 20:42:22 +0000
- Subject: [Bug localedata/22073] New: charmaps/UTF-8: wcwidth of U+00AD: 0 or 1 ?
- Auto-submitted: auto-generated
https://sourceware.org/bugzilla/show_bug.cgi?id=22073
Bug ID: 22073
Summary: charmaps/UTF-8: wcwidth of U+00AD: 0 or 1 ?
Product: glibc
Version: 2.26
Status: NEW
Severity: normal
Priority: P2
Component: localedata
Assignee: unassigned at sourceware dot org
Reporter: vapier at gentoo dot org
CC: egmont at gmail dot com, libc-locales at sourceware dot org,
maiku.fabian at gmail dot com, tg at mirbsd dot de
Depends on: 21750
Target Milestone: ---
+++ This bug was initially created as a clone of Bug #21750 +++
I’ve compared the new autogenerated column width from
localedata/unicode-gen/utf8_gen.py with the results of the classical wcwidth()
implementation from xterm (adjusted to Unicode 10.0.0) and found a few
divergences (and bugs on my (MirBSD, which uses something based on xterm’s data
system-wide) side, which I fixed).
U+00AD is forced to width 1 in xterm, autodetected as combining in glibc
Rationale for forcing it to 1 is likely that U+0000‥U+00FF are latin1, which,
when displayed as 8bit on terminals, had no combining characters at all.
Change Request to glibc: force U+00AD to width 1.
more background discussion with different standards can be found here:
https://www.cs.tut.fi/~jkorpela/shy.html
Referenced Bugs:
https://sourceware.org/bugzilla/show_bug.cgi?id=21750
[Bug 21750] column width of characters incompatible with classical wcwidth
--
You are receiving this mail because:
You are on the CC list for the bug.