This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug libc/18384] New: iswalnum(0x345) incorrectly return 1


https://sourceware.org/bugzilla/show_bug.cgi?id=18384

            Bug ID: 18384
           Summary: iswalnum(0x345) incorrectly return 1
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: libc
          Assignee: unassigned at sourceware dot org
          Reporter: Michael1972 at mail dot ru
                CC: drepper.fsp at gmail dot com
  Target Milestone: ---

Created attachment 8303
  --> https://sourceware.org/bugzilla/attachment.cgi?id=8303&action=edit
Screenshot

In Greek traditional text analysis, the iota subscript (U+0345) is often talked
about as a "diacritic", see:

"http://en.wikipedia.org/wiki/Iota_subscript";

In the recourse: http://www.unicode.org/ucd/ 
I found two documents:
1. http://www.unicode.org/Public/UCD/latest/charts/CodeCharts.pdf
2. http://www.unicode.org/Public/UCD/latest/ucd/PropList.txt
In the first document in the section: "Combining Diacritical Marks" I found
this character in the table. See at the attached screenshot.
In the second document in line 752 we have:
0300..034E    ; Diacritic # Mn  [79] COMBINING GRAVE ACCENT..COMBINING UPWARDS
ARROW BELOW

As a result, I think, it is pretty clear that iswalnum(0x345) should not return
1, but rather 0.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]