This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug libc/12830] New: ISO-2022-JP-2 maps C1 control characters incorrectly


http://sourceware.org/bugzilla/show_bug.cgi?id=12830

           Summary: ISO-2022-JP-2 maps C1 control characters incorrectly
           Product: glibc
           Version: 2.13
            Status: NEW
          Severity: normal
          Priority: P2
         Component: libc
        AssignedTo: drepper.fsp@gmail.com
        ReportedBy: glibcbugz@ghalkes.nl


In the ISO-2022-JP-2 converter, the C1 control codes (U0080-U009F) are encoded
as 1B 2E 41 1B 4E [00 - 1F] (i.e., load ISO-8859-1 in the G2 graphics set, use
single shift to set G2 and encode the byte [00 - 1F]). However, if I understand
the standard correctly, switching to the G2 set _only_ changes the mapping of
the 96 characters in the range 20-7F (or the 94 charaacters in the range 21-7E
if a smaller set is used). The control characters are unaffected. To access the
C1 control set, one should use 1B [40 - 5F]. This is actually done for the
encoding of the "single shift 2" control (U+008E) in the sequence above, which
is encoded as 1B 4E.

-- 
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]