This is the mail archive of the libc-locales@sourceware.org mailing list for the GNU libc locales project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/2872] Transliteration Cyrillic -> ASCII fails


https://sourceware.org/bugzilla/show_bug.cgi?id=2872

--- Comment #7 from Egor Kobylkin <ekobylkin at paypal dot com> ---
Andreas, in your example the Cyrillic transliteration does not work either.  My
understanding is that the tool is lacking a translation table for Cyrillic to
TRANSLIT for example in the ru_RU locale. This is what Ulrich Drepper asks for
in his comment2 here: https://sourceware.org/bugzilla/show_bug.cgi?id=2872#c2
I would like to know in which form this data should be provided?

I am only concerned with the Cyrillic for now. German serves as an example that
the functionality works at all in at least one case.

My first submission is from Cygwin on Windows 7. While it may be indeed some
effect of the Cygwin (name similarity to Cyrillic is coincidental) I have just
tried the same on Ubuntu 12 for essentially same effect.

$ echo ÐÐÐÐÐÑÐ |LANG=ru_RU.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=sr_CS.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo ÐÐÐÐÐÑÐ |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
iconv: illegal input sequence at position 0
$ echo MÃÃte |LANG=de_DE.UTF-8 iconv -t ASCII//TRANSLIT
Miconv: illegal input sequence at position 1
$ echo ÐÐÐÐÐÑÐ |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
???????
$ echo MÃÃte |LANG=en_US.UTF-8 iconv -t ASCII//TRANSLIT
Musste

-- 
You are receiving this mail because:
You are the assignee for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]