This is the mail archive of the
docbook-apps@lists.oasis-open.org
mailing list .
Re: How to translate HTML to DocBook
- From: "Dave Brooks, BCS Systems" <dave at bcs dot co dot nz>
- To: Andrew Westcombe <asw at getsystems dot com>,Patrick Hartling <patrick at vrac dot iastate dot edu>, docbook-apps at lists dot oasis-open dot org
- Date: Tue, 19 Mar 2002 16:04:21 +1200
- Subject: Re: DOCBOOK-APPS: How to translate HTML to DocBook
- References: <3C8E8878.20603@vrac.iastate.edu><20020312210028Z684242-9966+600@mail.centrum.cz>
At 12:53 19/03/2002 +1100, Andrew Westcombe wrote:
>At 05:00 PM 12/03/2002 -0600, Patrick Hartling wrote:
>
>> It also helps if the source is "good" HTML. Having closing tags such
>> as </li>, </p>, and </br> helps immensely.
>
>
>I've used DocParse myself, it's not bad, and very good value. As for
>having "good" HTML, Dreamweaver has a very nice command for stripping out
>junk, esp. from former MSWord files.
HTML Tidy (see http://www.w3.org/People/Raggett/tidy/) is very good for
cleaning up HTML.
Dave