This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Improved check-localedef script


4.08.2017 11:50 Mike FABIAN <mfabian@redhat.com> wrote:
>
> Rafal Luzynski <digitalfreak@lingonborough.com> wrote:
> [...]
> > They may describe something more complex like <U00E0>. But even this
> > is not UTF-8 because UTF-8 would be <C3> <A0> (UTF-8 is 8-bit). The
> > closest charset would be UCS-2 or simply a generic Unicode.
>
> My understanding at the moment is that the “% Charset: ...” comment
> indicates the encoding used to write the source file.

I agree.

> So something like
> “<U20AC>” is definitely ASCII. Non-ASCII stuff in locale source files
> seems to exist only in comments at the moment.

So probably UTF-8 is a good choice.  And converting to UTF-8 if
there is something non-UTF-8.

Rafal


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]