Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 11 Apr 2002 11:54:18 +0000 (UTC)
From:      naddy@mips.inka.de (Christian Weisgerber)
To:        freebsd-chat@freebsd.org
Subject:   Re: what are these characters please?
Message-ID:  <a93tha$14ng$1@kemoauc.mips.inka.de>
References:  <20020411021840.42AC93F30@bast.unixathome.org>

next in thread | previous in thread | raw e-mail | index | archive | help
Dan Langille <dan@langille.org> wrote:

> When viewed under vi, I get:
> 
> Ville Skytt^[,Ad^[(B <ville.skytta@iki.fi>

I suspect that's an ISO 2022 character set switch sequence. Possibly
generated by (X)Emacs.

> The original email is at: 
> http://docs.freebsd.org/cgi/getmsg.cgi?fetch=1425317+0+current/cvs-all+raw 
> but it appears as "Ville Skyttd <ville.skytta@iki.fi>" which is presumably 
> what I also want.

Ville Skyttä.  The final character is a-umlaut.  In ISO Latin 1
it's code 0xE4.  This gives 0x64 ('d') if the top bit is reset.  If
vowel-umlaut can't be represented, Finns usually just substitute
the base vowel character, so you get Ville Skytta (cf. the email
address).

> FWIW: the xml is <?xml version="1.0" encoding="ISO-8859-1"?> and I thought 
> that encoding would handle those characters.

Yes, but for some reason it was written in a different encoding.

-- 
Christian "naddy" Weisgerber                          naddy@mips.inka.de


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-chat" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?a93tha$14ng$1>