This is the mail archive of the
libc-alpha@sources.redhat.com
mailing list for the glibc project.
Re: ISIRI-3342 converter broken
- To: Pablo Saratxaga <pablo at mandrakesoft dot com>
- Subject: Re: ISIRI-3342 converter broken
- From: Roozbeh Pournader <roozbeh at sina dot sharif dot ac dot ir>
- Date: Fri, 4 Aug 2000 00:32:05 +0430 (IRDT)
- cc: Bruno Haible <haible at ilog dot fr>, libc-alpha at sourceware dot cygnus dot com
On Thu, 3 Aug 2000, Pablo Saratxaga wrote:
> That point isn't really very important (I doubt it would even be used).
I have seen not even a single usage of 0x80--0x9F in ISIRI-3342, except
the higher TAB.
> On the other hand an almost blind conversion (using the values in the charset
> description file) would be good enough in most of the cases.
I agree.
> Another thing; there are some chars that are close in shape, and sometimes
> are interchanged; for example I've seen web pages in Farsi language written
> in utf-8 that uses the unicode char ALEF MAKSURA in place of FARSI YEH (the
> shapes are almost identical).
That's because of some bugs in Microsoft software, of course. Regardless
of this, your word is completely true.
> Anyway, it may be desirable, from a user perspective, to convert both sets
> of digits to the digits 0xb0-0xb9 in ISIRI-3342
It is. Even from a programmers perpective.
--roozbeh