Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 13 Nov 2007 12:43:15 +0000
From:      Thomas Hurst <tom.hurst@clara.net>
To:        Anton Yuzhaninov <citrin@citrin.ru>
Cc:        freebsd-geom@freebsd.org
Subject:   Re: gstat don't work on 7.0-BETA2
Message-ID:  <20071113124315.GB80959@voi.aagh.net>
In-Reply-To: <4735EA87.7040306@citrin.ru>
References:  <4735AAE3.7070209@citrin.ru> <4735EA87.7040306@citrin.ru>

next in thread | previous in thread | raw e-mail | index | archive | help
* Anton Yuzhaninov (citrin@citrin.ru) wrote:

> It fail to parse xml because it contain non-ASCII symbols (in my case 
> cyrillic in cp1251) from CD label:

Not non-ASCII, non-UTF-8, which is the default for XML:

 geom-tree.xml:268: parser error : Input is not proper UTF-8, indicate
 encoding !
 Bytes: 0xEE 0xEA 0xF2 0x20
       <name>iso9660/21 окт 2005</name>

These should be encoded as &#xEE;&#xEA;&#xF2;.

The file can also be made to parse by giving it an encoding by
prepending an XML prolog ala:

 <?xml version="1.0" encoding="iso-8859-1" ?>

Though this will just move the problem elsewhere (e.g. chars 1-31).
geom_dump.c doesn't even seem to cope with & or <; are these disallowed
at a higher level?

-- 
Thomas 'Freaky' Hurst
    http://hur.st/



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20071113124315.GB80959>