This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug localedata/2872] Transliteration Cyrillic -> ASCII fails
- From: "ekobylkin at paypal dot com" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sourceware dot org
- Date: Mon, 07 Sep 2015 12:17:53 +0000
- Subject: [Bug localedata/2872] Transliteration Cyrillic -> ASCII fails
- Auto-submitted: auto-generated
- References: <bug-2872-131 at http dot sourceware dot org/bugzilla/>
https://sourceware.org/bugzilla/show_bug.cgi?id=2872
--- Comment #7 from Egor Kobylkin <ekobylkin at paypal dot com> ---
Andreas, in your example the Cyrillic transliteration does not work either. My
understanding is that the tool is lacking a translation table for Cyrillic to
TRANSLIT for example in the ru_RU locale. This is what Ulrich Drepper asks for
in his comment2 here: https://sourceware.org/bugzilla/show_bug.cgi?id=2872#c2
I would like to know in which form this data should be provided?
I am only concerned with the Cyrillic for now. German serves as an example that
the functionality works at all in at least one case.
My first submission is from Cygwin on Windows 7. While it may be indeed some
effect of the Cygwin (name similarity to Cyrillic is coincidental) I have just
tried the same on Ubuntu 12 for essentially same effect.
$ echo ÐÐÐÐÐÑÐ |LANG=ru_RU.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=sr_CS.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo MÃÃte |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
Miconv: illegal input sequence at position 1
$ echo ÐÐÐÐÐÑÐ |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
???????
$ echo MÃÃte |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
Musste
--
You are receiving this mail because:
You are on the CC list for the bug.