This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

Re: Escaping/converting special characters


At 02:45 29-10-2001, Ross Kendall wrote:
>On some files I get special characters (eg '£') which cause parse errors
>when trying to transform my (not quite valid) XML.  What I need to know is
>how can easily convert these special characters to entities or escape them
>with CDATA tags in such a manner that I can transform them to valid WML.  Is
>it a stupid idea to put all the data in CDATA tags?  Otherwise, how will I
>know which characters are going to need escaping.

It sounds like the incoming files are in ISO 8859-1 encoding (Latin-1).  If 
so, then tell your Perl script to output an encoding parameter in the XML 
declaration, or follow "cutlass"'s suggestion and turn everything above 
character 127 into a numeric character reference.  Either should be pretty 
easy to do in Perl.

-Chris
-- 
Christopher R. Maden, Principal Consultant, HMM Consulting Int'l, Inc.
DTDs/schemas - conversion - ebooks - publishing - Web - B2B - training
<URL: http://www.hmmci.com/ > <URL: http://crism.maden.org/consulting/ >
PGP Fingerprint: BBA6 4085 DED0 E176 D6D4  5DFC AC52 F825 AFEC 58DA

PGP signature


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]