From owner-freebsd-vuxml@FreeBSD.ORG Sun Aug 29 21:44:07 2004 Return-Path: Delivered-To: freebsd-vuxml@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id F025316A4CE for ; Sun, 29 Aug 2004 21:44:07 +0000 (GMT) Received: from bast.unixathome.org (bast.unixathome.org [66.11.174.150]) by mx1.FreeBSD.org (Postfix) with ESMTP id C565243D3F for ; Sun, 29 Aug 2004 21:44:07 +0000 (GMT) (envelope-from dan@langille.org) Received: from xeon (xeon.unixathome.org [192.168.0.18]) by bast.unixathome.org (Postfix) with ESMTP id C287E3D40 for ; Sun, 29 Aug 2004 17:44:06 -0400 (EDT) Date: Sun, 29 Aug 2004 17:44:06 -0400 (EDT) From: Dan Langille X-X-Sender: dan@xeon.unixathome.org To: freebsd-vuxml@freebsd.org Message-ID: <20040829173317.U9281@xeon.unixathome.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Subject: parsing vuln.xml with XML::Node X-BeenThere: freebsd-vuxml@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Documenting security issues in VuXML List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 29 Aug 2004 21:44:08 -0000 Hi folks, I've run into a problem parsing the vuln.xlm file. I'm using the perl XML::Node module. The issue is the body field. This field contains XHTML tags (for example,

, ,

). I have been unable to extract the contents of .... The only solution I've found is to explicitly specify tags such as

, , and

. That is by no means an ideal solution. There must be something I'm missing. It has been suggested I use XML::Parser instead (XML::Node is based upon XML::Parser). See a working example at http://beta.freshports.org/tmp/testing.tgz it runs like this: perl load_vuxml_into_db.pl vuln.xml If someone can figure out how I can do this, it will be appreciated. Thanks. -- Dan Langille - http://www.langille.org/