From owner-freebsd-geom@FreeBSD.ORG Tue Nov 13 12:43:28 2007 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7877916A46E for ; Tue, 13 Nov 2007 12:43:28 +0000 (UTC) (envelope-from tom.hurst@clara.net) Received: from spork.qfe3.net (spork.qfe3.net [212.13.207.101]) by mx1.freebsd.org (Postfix) with ESMTP id 3CFAA13C4BF for ; Tue, 13 Nov 2007 12:43:28 +0000 (UTC) (envelope-from tom.hurst@clara.net) Received: from [81.104.144.87] (helo=voi.aagh.net) by spork.qfe3.net with esmtp (Exim 4.66 (FreeBSD)) (envelope-from ) id 1Irv6q-000Npd-Gf; Tue, 13 Nov 2007 12:43:16 +0000 Received: from freaky by voi.aagh.net with local (Exim 4.68 (FreeBSD)) (envelope-from ) id 1Irv6p-000Op7-Uc; Tue, 13 Nov 2007 12:43:15 +0000 Date: Tue, 13 Nov 2007 12:43:15 +0000 From: Thomas Hurst To: Anton Yuzhaninov Message-ID: <20071113124315.GB80959@voi.aagh.net> Mail-Followup-To: Anton Yuzhaninov , freebsd-geom@freebsd.org References: <4735AAE3.7070209@citrin.ru> <4735EA87.7040306@citrin.ru> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <4735EA87.7040306@citrin.ru> Organization: Not much. User-Agent: Mutt/1.5.16 (2007-06-09) Sender: Thomas Hurst Cc: freebsd-geom@freebsd.org Subject: Re: gstat don't work on 7.0-BETA2 X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 13 Nov 2007 12:43:28 -0000 * Anton Yuzhaninov (citrin@citrin.ru) wrote: > It fail to parse xml because it contain non-ASCII symbols (in my case > cyrillic in cp1251) from CD label: Not non-ASCII, non-UTF-8, which is the default for XML: geom-tree.xml:268: parser error : Input is not proper UTF-8, indicate encoding ! Bytes: 0xEE 0xEA 0xF2 0x20 iso9660/21 окт 2005 These should be encoded as îêò. The file can also be made to parse by giving it an encoding by prepending an XML prolog ala: Though this will just move the problem elsewhere (e.g. chars 1-31). geom_dump.c doesn't even seem to cope with & or <; are these disallowed at a higher level? -- Thomas 'Freaky' Hurst http://hur.st/