This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
Re: Q.) Encode URL inside HTML Anchor Tag.
- To: xsl-list at mulberrytech dot com
- Subject: Re: Q.) Encode URL inside HTML Anchor Tag.
- From: Mike Brown <mike at skew dot org>
- Date: Mon, 27 Nov 2000 15:47:26 -0700 (MST)
- Reply-To: xsl-list at mulberrytech dot com
On Nov 17, I wrote:
> (The URL-encoding algorithm is, roughly, replace certain reserved
> characters with their UTF-8 sequences, expressed as '%xx' for
> each octet, where xx is the hexadecimal representation of the
> octet; with the option of using '+' instead of '%20' for spaces.
Somehow I came to this conclusion after researching some related issues,
but I guess it's not quite accurate. The algorithm doesn't specify what
character set to use for non-ASCII characters; it just says it depends on
the scheme. The http scheme is described in the RFC for HTTP/1.1, and it
does not address the issue at all.
There are situations where HTML user agents are recommended to use UTF-8
in URIs, but this is not anything one can count on for URIs in general. I
posted more about this on xml-dev today, with references. See
http://lists.xml.org/archives/xml-dev/200011/msg00549.html
- Mike
____________________________________________________________________
Mike J. Brown, software engineer at My XML/XSL resources:
webb.net in Denver, Colorado, USA http://skew.org/xml/
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list