This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/2872] Transliteration Cyrillic -> ASCII fails


https://sourceware.org/bugzilla/show_bug.cgi?id=2872

--- Comment #7 from Egor Kobylkin <ekobylkin at paypal dot com> ---
Andreas, in your example the Cyrillic transliteration does not work either.  My
understanding is that the tool is lacking a translation table for Cyrillic to
TRANSLIT for example in the ru_RU locale. This is what Ulrich Drepper asks for
in his comment2 here: https://sourceware.org/bugzilla/show_bug.cgi?id=2872#c2
I would like to know in which form this data should be provided?

I am only concerned with the Cyrillic for now. German serves as an example that
the functionality works at all in at least one case.

My first submission is from Cygwin on Windows 7. While it may be indeed some
effect of the Cygwin (name similarity to Cyrillic is coincidental) I have just
tried the same on Ubuntu 12 for essentially same effect.

$ echo ÐÐÐÐÐÑÐ |LANG=ru_RU.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=sr_CS.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo MÃÃte |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
Miconv: illegal input sequence at position 1
$ echo ÐÐÐÐÐÑÐ |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
???????
$ echo MÃÃte |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
Musste

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]