This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
Re: html to xml
David Carlisle writes:
> Surely grabbing floating PCDATA and sticking it in a paragraph element
> is something easily done in the post tidy XSL transformation to TEI.
easily? take this:
<h1>Intro</hi>
I am the walrus
<pre>Yes I am </pre>
<p>A new paragraph
and insert a <p> around
I am the walrus
<pre>Yes I am </pre>
....
> Grabbing html section heads into TEI/docbook style section containers is
> always a pain but you can do it in XSL with the usual "grouping"
> techniques.
hmm, I hadnt thought of it as a group problem. you are right, tho,
thanks
> It's made a bit easier if you know that the H? elements all
> appear in "correct" sequence, not jumping from h1 to h3. If you use
> ISO-HTML DTD then the SGML parser (eg sx ) will add any missing section
> levels automagically if you set the appropriate parameter entity.
we live and learn. I didn't know that either. as if I had valid HTML
that sx would accept, of course!
sebastian
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list