This is the mail archive of the
libc-alpha@sources.redhat.com
mailing list for the glibc project.
wrong character names in charmaps
- From: Bruno Haible <bruno at clisp dot org>
- To: libc-alpha at sources dot redhat dot com
- Date: Fri, 18 Oct 2002 13:52:42 +0200 (CEST)
- Subject: wrong character names in charmaps
Hi,
Martin v. Löwis noted that the KOI8-T charmap contains a wrong character name.
Here is a patch to align the character names in the charmaps to the official
Unicode 3.x names. Yes I know the character names are only comments in the
charmaps, but if they start confusing people, it's better to fix them.
2002-10-18 Bruno Haible <bruno@clisp.org>
* charmaps/KOI8-T: Correct a typo.
* charmaps/CP1256: Use official Unicode character names.
* charmaps/EUC-JP: Likewise.
* charmaps/GBK: Likewise.
* charmaps/ISO-8859-11: Likewise.
* charmaps/KOI8-U: Likewise.
* charmaps/MAC-SAMI: Likewise.
* charmaps/TIS-620: Likewise.
* charmaps/CP949: Likewise. Use Hangul syllable names according to
Unicode 3.0 book, section 3.11.
* charmaps/EUC-KR: Likewise.
* charmaps/JOHAB: Likewise.
Part 1 of the patch is appended, part 2 is so big that it is at
http://www.haible.de/bruno/gnu/libc-charmaps-patch.bz2
--- localedata/charmaps/CP1256.bak 2000-07-03 16:39:42.000000000 +0200
+++ localedata/charmaps/CP1256 2002-10-17 01:07:25.000000000 +0200
@@ -166,7 +166,7 @@
<U0153> /x9c LATIN SMALL LIGATURE OE
<U200C> /x9d ZERO WIDTH NON-JOINER
<U200D> /x9e ZERO WIDTH JOINER
-<U06BA> /x9f ARABIC LETTER NOON
+<U06BA> /x9f ARABIC LETTER NOON GHUNNA
<U00A0> /xa0 NO-BREAK SPACE
<U060C> /xa1 ARABIC COMMA
<U00A2> /xa2 CENT SIGN
--- localedata/charmaps/EUC-JP.bak 2000-12-04 19:53:45.000000000 +0100
+++ localedata/charmaps/EUC-JP 2002-10-18 01:19:54.000000000 +0200
@@ -192,7 +192,7 @@
<UFF6C> /x8e/xac HALFWIDTH KATAKANA LETTER SMALL YA
<UFF6D> /x8e/xad HALFWIDTH KATAKANA LETTER SMALL YU
<UFF6E> /x8e/xae HALFWIDTH KATAKANA LETTER SMALL YO
-<UFF6F> /x8e/xaf HALFWIDTH KATAKANA LETTER SMALL TSU
+<UFF6F> /x8e/xaf HALFWIDTH KATAKANA LETTER SMALL TU
<UFF70> /x8e/xb0 HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
<UFF71> /x8e/xb1 HALFWIDTH KATAKANA LETTER A
<UFF72> /x8e/xb2 HALFWIDTH KATAKANA LETTER I
@@ -205,13 +205,13 @@
<UFF79> /x8e/xb9 HALFWIDTH KATAKANA LETTER KE
<UFF7A> /x8e/xba HALFWIDTH KATAKANA LETTER KO
<UFF7B> /x8e/xbb HALFWIDTH KATAKANA LETTER SA
-<UFF7C> /x8e/xbc HALFWIDTH KATAKANA LETTER SHI
+<UFF7C> /x8e/xbc HALFWIDTH KATAKANA LETTER SI
<UFF7D> /x8e/xbd HALFWIDTH KATAKANA LETTER SU
<UFF7E> /x8e/xbe HALFWIDTH KATAKANA LETTER SE
<UFF7F> /x8e/xbf HALFWIDTH KATAKANA LETTER SO
<UFF80> /x8e/xc0 HALFWIDTH KATAKANA LETTER TA
-<UFF81> /x8e/xc1 HALFWIDTH KATAKANA LETTER CHI
-<UFF82> /x8e/xc2 HALFWIDTH KATAKANA LETTER TSU
+<UFF81> /x8e/xc1 HALFWIDTH KATAKANA LETTER TI
+<UFF82> /x8e/xc2 HALFWIDTH KATAKANA LETTER TU
<UFF83> /x8e/xc3 HALFWIDTH KATAKANA LETTER TE
<UFF84> /x8e/xc4 HALFWIDTH KATAKANA LETTER TO
<UFF85> /x8e/xc5 HALFWIDTH KATAKANA LETTER NA
@@ -221,7 +221,7 @@
<UFF89> /x8e/xc9 HALFWIDTH KATAKANA LETTER NO
<UFF8A> /x8e/xca HALFWIDTH KATAKANA LETTER HA
<UFF8B> /x8e/xcb HALFWIDTH KATAKANA LETTER HI
-<UFF8C> /x8e/xcc HALFWIDTH KATAKANA LETTER FU
+<UFF8C> /x8e/xcc HALFWIDTH KATAKANA LETTER HU
<UFF8D> /x8e/xcd HALFWIDTH KATAKANA LETTER HE
<UFF8E> /x8e/xce HALFWIDTH KATAKANA LETTER HO
<UFF8F> /x8e/xcf HALFWIDTH KATAKANA LETTER MA
@@ -7196,7 +7196,7 @@
<U045C> /x8f/xa7/xfc CYRILLIC SMALL LETTER KJE
<U045E> /x8f/xa7/xfd CYRILLIC SMALL LETTER SHORT U
<U045F> /x8f/xa7/xfe CYRILLIC SMALL LETTER DZHE
-<U00C6> /x8f/xa9/xa1 LATIN CAPITAL LIGATURE AE
+<U00C6> /x8f/xa9/xa1 LATIN CAPITAL LETTER AE
<U0110> /x8f/xa9/xa2 LATIN CAPITAL LETTER D WITH STROKE
<U0126> /x8f/xa9/xa4 LATIN CAPITAL LETTER H WITH STROKE
<U0132> /x8f/xa9/xa6 LATIN CAPITAL LIGATURE IJ
@@ -7207,7 +7207,7 @@
<U0152> /x8f/xa9/xad LATIN CAPITAL LIGATURE OE
<U0166> /x8f/xa9/xaf LATIN CAPITAL LETTER T WITH STROKE
<U00DE> /x8f/xa9/xb0 LATIN CAPITAL LETTER THORN
-<U00E6> /x8f/xa9/xc1 LATIN SMALL LIGATURE AE
+<U00E6> /x8f/xa9/xc1 LATIN SMALL LETTER AE
<U0111> /x8f/xa9/xc2 LATIN SMALL LETTER D WITH STROKE
<U00F0> /x8f/xa9/xc3 LATIN SMALL LETTER ETH
<U0127> /x8f/xa9/xc4 LATIN SMALL LETTER H WITH STROKE
--- localedata/charmaps/GBK.bak 2000-09-26 14:39:56.000000000 +0200
+++ localedata/charmaps/GBK 2002-10-18 01:12:04.000000000 +0200
@@ -6243,7 +6243,7 @@
<U300F> /xa1/xbb RIGHT WHITE CORNER BRACKET
<U3016> /xa1/xbc LEFT WHITE LENTICULAR BRACKET
<U3017> /xa1/xbd RIGHT WHITE LENTICULAR BRACKET
-<U3010> /xa1/xbe LEFT BLACK LENTICULAR
+<U3010> /xa1/xbe LEFT BLACK LENTICULAR BRACKET
<U3011> /xa1/xbf RIGHT BLACK LENTICULAR BRACKET
<U00B1> /xa1/xc0 PLUS-MINUS SIGN
<U00D7> /xa1/xc1 MULTIPLICATION SIGN
@@ -6263,8 +6263,8 @@
<U2220> /xa1/xcf ANGLE
<U2312> /xa1/xd0 ARC
<U2299> /xa1/xd1 CIRCLED DOT OPERATOR
-<U222B> /xa1/xd2 SQUARE IMAGE OF
-<U222E> /xa1/xd3 SQUARE ORIGINAL OF OR EQUAL TO
+<U222B> /xa1/xd2 INTEGRAL
+<U222E> /xa1/xd3 CONTOUR INTEGRAL
<U2261> /xa1/xd4 IDENTICAL TO
<U224C> /xa1/xd5 ALL EQUAL TO
<U2248> /xa1/xd6 ALMOST EQUAL TO
@@ -6393,7 +6393,7 @@
<UFF01> /xa3/xa1 FULLWIDTH EXCLAMATION MARK
<UFF02> /xa3/xa2 FULLWIDTH QUOTATION MARK
<UFF03> /xa3/xa3 FULLWIDTH NUMBER SIGN
-<UFFE5> /xa3/xa4 FULLWIDTH DOLLAR SIGN
+<UFFE5> /xa3/xa4 FULLWIDTH YEN SIGN
<UFF05> /xa3/xa5 FULLWIDTH PERCENT SIGN
<UFF06> /xa3/xa6 FULLWIDTH AMPERSAND
<UFF07> /xa3/xa7 FULLWIDTH APOSTROPHE
--- localedata/charmaps/ISO-8859-11.bak 2002-07-11 10:54:41.000000000 +0200
+++ localedata/charmaps/ISO-8859-11 2002-10-17 01:39:16.000000000 +0200
@@ -189,7 +189,7 @@
<U0E38> /xd8 THAI CHARACTER SARA U
<U0E39> /xd9 THAI CHARACTER SARA UU
<U0E3A> /xda THAI CHARACTER PHINTHU
-<U0E3F> /xdf THAI CHARACTER SYMBOL BAHT
+<U0E3F> /xdf THAI CURRENCY SYMBOL BAHT
<U0E40> /xe0 THAI CHARACTER SARA E
<U0E41> /xe1 THAI CHARACTER SARA AE
<U0E42> /xe2 THAI CHARACTER SARA O
--- localedata/charmaps/KOI8-T.bak 2001-08-03 20:42:01.000000000 +0200
+++ localedata/charmaps/KOI8-T 2002-10-17 02:03:38.000000000 +0200
@@ -172,7 +172,7 @@
<U00B0> /xb0 DEGREE SIGN
<U00B1> /xb1 PLUS-MINUS SIGN
<U00B2> /xb2 SUPERSCRIPT TWO
-<U0401> /xb3 CYRILLIC CAPITAL LETTER IE
+<U0401> /xb3 CYRILLIC CAPITAL LETTER IO
<U04E2> /xb5 CYRILLIC CAPITAL LETTER I WITH MACRON
<U00B6> /xb6 PILCROW SIGN
<U00B7> /xb7 MIDDLE DOT
--- localedata/charmaps/KOI8-U.bak 2000-07-03 16:45:24.000000000 +0200
+++ localedata/charmaps/KOI8-U 2002-10-17 02:00:02.000000000 +0200
@@ -158,12 +158,12 @@
<U2219> /x95 BULLET OPERATOR
<U221A> /x96 SQUARE ROOT
<U2248> /x97 ALMOST EQUAL TO
-<U2264> /x98 LESS THAN OR EQUAL TO
-<U2265> /x99 GREATER THAN OR EQUAL TO
+<U2264> /x98 LESS-THAN OR EQUAL TO
+<U2265> /x99 GREATER-THAN OR EQUAL TO
<U00A0> /x9a NO-BREAK SPACE
<U2321> /x9b BOTTOM HALF INTEGRAL
<U00B0> /x9c DEGREE SIGN
-<U00B2> /x9d SUPERSCRIPT DIGIT TWO
+<U00B2> /x9d SUPERSCRIPT TWO
<U00B7> /x9e MIDDLE DOT
<U00F7> /x9f DIVISION SIGN
<U2550> /xa0 BOX DRAWINGS DOUBLE HORIZONTAL
@@ -187,7 +187,7 @@
<U2561> /xb2 BOX DRAWINGS VERTICAL SINGLE AND LEFT DOUBLE
<U0401> /xb3 CYRILLIC CAPITAL LETTER IO
<U0404> /xb4 CYRILLIC CAPITAL LETTER UKRAINIAN IE
-<U2563> /xb5 DOUBLE VERTICAL AND LEFT
+<U2563> /xb5 BOX DRAWINGS DOUBLE VERTICAL AND LEFT
<U0406> /xb6 CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
<U0407> /xb7 CYRILLIC CAPITAL LETTER YI (Ukrainian)
<U2566> /xb8 BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL
--- localedata/charmaps/MAC-SAMI.bak 2002-04-25 15:45:03.000000000 +0200
+++ localedata/charmaps/MAC-SAMI 2002-10-17 12:04:39.000000000 +0200
@@ -213,8 +213,8 @@
<U00C0> /xcb LATIN CAPITAL LETTER A WITH GRAVE
<U00C3> /xcc LATIN CAPITAL LETTER A WITH TILDE
<U00D5> /xcd LATIN CAPITAL LETTER O WITH TILDE
-<U0152> /xce LATIN CAPITAL LETTER LIGATURE OE
-<U0153> /xcf LATIN SMALL LETTER LIGATURE OE
+<U0152> /xce LATIN CAPITAL LIGATURE OE
+<U0153> /xcf LATIN SMALL LIGATURE OE
<U2013> /xd0 EN DASH
<U2014> /xd1 EM DASH
<U201C> /xd2 LEFT DOUBLE QUOTATION MARK
@@ -258,7 +258,7 @@
<U01EE> /xf8 LATIN CAPITAL LETTER EZH WITH CARON
<U01EF> /xf9 LATIN SMALL LETTER EZH WITH CARON
<U01E4> /xfa LATIN CAPITAL LETTER G WITH STROKE
-<U01E5> /xfb LATIN SMALLL LETTER G WITH STROKE
+<U01E5> /xfb LATIN SMALL LETTER G WITH STROKE
<U01E6> /xfc LATIN CAPITAL LETTER G WITH CARON
<U01E7> /xfd LATIN SMALL LETTER G WITH CARON
<U01E8> /xfe LATIN CAPITAL LETTER K WITH CARON
--- localedata/charmaps/TIS-620.bak 2000-10-02 16:09:49.000000000 +0200
+++ localedata/charmaps/TIS-620 2002-10-17 12:18:03.000000000 +0200
@@ -195,7 +195,7 @@
<U0E38> /xd8 THAI CHARACTER SARA U
<U0E39> /xd9 THAI CHARACTER SARA UU
<U0E3A> /xda THAI CHARACTER PHINTHU
-<U0E3F> /xdf THAI CHARACTER SYMBOL BAHT
+<U0E3F> /xdf THAI CURRENCY SYMBOL BAHT
<U0E40> /xe0 THAI CHARACTER SARA E
<U0E41> /xe1 THAI CHARACTER SARA AE
<U0E42> /xe2 THAI CHARACTER SARA O