This is the mail archive of the
docbook-apps@lists.oasis-open.org
mailing list .
Re: Choosing a characterset for DocBook
- From: Jirka Kosek <jirka at kosek dot cz>
- To: Jens Stavnstrup <js at ddre dot dk>
- Cc: docbook-apps at lists dot oasis-open dot org
- Date: Fri, 15 Mar 2002 12:49:40 +0100
- Subject: Re: DOCBOOK-APPS: Choosing a characterset for DocBook
- References: <Pine.LNX.4.44.0203151159260.2207-100000@ares.ddre.dk>
Jens Stavnstrup wrote:
> > If your documents will contain a lot of character outside of ISO Latin 1
> > or ASCII using UTF-8 is best choice, assuming that all editors used can
> > deal with UTF-8.
> >
>
> Not really, the problem is basically, that Word, which might be used to
> to edit the XML sources, kindly add invinsible characters to my document.
> And this might cause problems for my colleagues.
Even if you save it as plain text? If you will use ISO-8859-1 there
shouldn't be added any additional characters. Problem may be with UTF-8
usage. MS applications adds byte-order mark to the beginning of UTF-8
files. This is not supported some older XML parsers, as it was not
required by XML spec. I think that Second edition of XML 1.0 solved this
issue.
Jirka
--
-----------------------------------------------------------------
Jirka Kosek
e-mail: jirka@kosek.cz
http://www.kosek.cz