This is the mail archive of the libc-alpha@sources.redhat.com mailing list for the glibc project.
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |
Other format: | [Raw text] |
Hello, Most of the GNU (and also non-GNU free) programs which depend on case conversion are problematic in tr_TR and tr_TR.UTF-8 locales. The problem is simple but a big pain for us. In Turkish upper-case version of i is "I with dot above" (0130;LATIN CAPITAL LETTER I WITH DOT ABOVE). A single byte character is converted to a multi-byte character. lowercasing I. Lower-case version of I in Turkish is "i without the dot above" (0131;LATIN SMALL LETTER DOTLESS I). Most programs (assuming a constant byte-count) apply the conversion on the original string and the string is ruined or at least the resulting string does not represent a correct result. Nowadays we try to find the problematic programs and patch them. Gawk, grep, coreutils, vim, emacs and some others are effected from the problem. What I want to ask is. Is this the only solution for us? If so is it possible to add a caution in the glibc documentation for the developer to be aware? best regards, -- Baris Metin http://www.metin.org
Attachment:
signature.asc
Description: Digital signature
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |