Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 4 Nov 1999 18:28:18 +0100
From:      Wolfram Schneider <wosch@cs.tu-berlin.de>
To:        doc@freebsd.org
Cc:        wosch@freebsd.org
Subject:   HTML to XML converter.
Message-ID:  <19991104182818.A9400@freno.cs.tu-berlin.de>

next in thread | raw e-mail | index | archive | help

I'm seeking a HTML to XML converter.  Is this possible with the
FreeBSD sgml tools (jade, tiny etc.)?

E.g:

A HTML file:
<HTML>
        <HEAD>
                <META NAME="description" CONTENT="hello world">
                <META NAME="author" CONTENT="olaf">
                <TITLE>my title</TITLE>
        </HEAD>
        <BODY>
                <I>blab</I>
                <!-- START TEXT AREA -->
                <H1>Olaf is</H1>
                Hi <B><I>[HTML with erors]</B>World.
                <!-- STOP TEXT AREA -->
                [HTML banner]
                <!-- STOP TEXT AREA -->
                [Navigation elements]
                <B>blub</B>
        </BODY>
</HTML>


And the XML output:

<Document>
        <URL>...</URL>
        <ORIG_FORMAT>HTML</ORIG_FORMAT>
        <TITLE>my title</TITLE>
        <META>
                <DESCRIPTION>hello world</DESCRIPTION>
                <AUTHOR>olaf</AUTHOR>
        <META>
        <TEXT>
                <H1>Olaf is</H1>
                Hi World.
        </TEXT>
</Document>
-- 
Wolfram Schneider <wosch@freebsd.org> http://wolfram.schneider.org


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-doc" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19991104182818.A9400>