This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

RE: Capitalizing content of a variable


Joerg Pietschmann wrote at  9 Aug 2001 13:59:54 +0200:
...
 > But as you brought up the point: AFAIK both "MASSE" and "MASZE" are
 > valid uppercase versions for "Maße", because the former is

Interesting.  I don't recall seeing the "SZ" form before.

...
 > I have to say that it is surprisingly hard to get comprehensive,
 > precisely worded standards for localised, language dependend string
 > processing. The Unicode Standard goes a long way to provide definitions
 > of categories to think in, but such stuff as what is applicable
 > for a certain language such as character set, concepts of character
 > case, case transformations, collation rules and the like appears to

The Unicode Standard provides case mappings in the UnicodeData.txt
file for every version of Unicode and, for non 1:1 mappings, in
SpecialCasing.txt.

Character case is identified in UnicodeData.txt.

The Unicode Collation Algorithm is specified in UTS #10.

There's certainly no shortage of desire on the part of the Unicode
Consortium to nail down every aspect of plain text processing with
Unicode.

...
 > If you have some good sources, I'd be quite interested in hearing about
 > them.

I have numerous books on writing systems and the like, but for the
rules for Unicode text, I look to the Unicode Standard.

Regards,


Tony Graham
------------------------------------------------------------------------
Tony Graham                           mailto:tony.graham@ireland.sun.com
Sun Microsystems Ireland Ltd                       Phone: +353 1 8199708
Hamilton House, East Point Business Park, Dublin 3            x(70)19708

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]