From owner-freebsd-questions@FreeBSD.ORG Wed Sep 10 06:22:02 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6F2E1106566C for ; Wed, 10 Sep 2008 06:22:02 +0000 (UTC) (envelope-from rsmith@xs4all.nl) Received: from smtp-vbr9.xs4all.nl (smtp-vbr9.xs4all.nl [194.109.24.29]) by mx1.freebsd.org (Postfix) with ESMTP id 05FBF8FC14 for ; Wed, 10 Sep 2008 06:22:01 +0000 (UTC) (envelope-from rsmith@xs4all.nl) Received: from slackbox.xs4all.nl (slackbox.xs4all.nl [213.84.242.160]) by smtp-vbr9.xs4all.nl (8.13.8/8.13.8) with ESMTP id m8A6LwWs017275; Wed, 10 Sep 2008 08:21:59 +0200 (CEST) (envelope-from rsmith@xs4all.nl) Received: by slackbox.xs4all.nl (Postfix, from userid 1001) id 2ACF3B85F; Wed, 10 Sep 2008 08:21:49 +0200 (CEST) Date: Wed, 10 Sep 2008 08:21:49 +0200 From: Roland Smith To: Gary Kline Message-ID: <20080910062149.GA78805@slackbox.xs4all.nl> References: <20080909043503.GA21663@thought.org> <20080909165456.GA56556@slackbox.xs4all.nl> <20080909121430.M12798@qroenaqrq.6qbyyneqvnyhc.pbz> <20080909221608.GB51272@thought.org> <20080909223941.GC65291@slackbox.xs4all.nl> <20080909230840.GB54455@thought.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="PNTmBPCT7hxwcZjr" Content-Disposition: inline In-Reply-To: <20080909230840.GB54455@thought.org> X-GPG-Fingerprint: 1A2B 477F 9970 BA3C 2914 B7CE 1277 EFB0 C321 A725 X-GPG-Key: http://www.xs4all.nl/~rsmith/pubkey.txt X-GPG-Notice: If this message is not signed, don't assume I sent it! User-Agent: Mutt/1.5.18 (2008-05-17) X-Virus-Scanned: by XS4ALL Virus Scanner Cc: FreeBSD Mailing List Subject: Re: how can i use ISO-8859-1?? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 10 Sep 2008 06:22:02 -0000 --PNTmBPCT7hxwcZjr Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Sep 09, 2008 at 04:08:40PM -0700, Gary Kline wrote: > On Wed, Sep 10, 2008 at 12:39:41AM +0200, Roland Smith wrote: > > On Tue, Sep 09, 2008 at 03:16:08PM -0700, Gary Kline wrote: > > > > Because it is a hiddeous waste for most readers and writers of > > > > English and other European languages. > >=20 > > > I also argured that utf-8 was a waste of a whole byte per char > > > for most of us. > >=20 > > That's not true. UTF-8 is a variable-length encoding. It is backwards > > compatible with ASCII, i.e. ascii characters are one byte in UTF-8 as > > well. Are you thinking about UTF-16? >=20 >=20 > I don't know. (Mark Twain.) Back in the late 1990's I was > assigned the project of converting all the utilities I had ported > to three European languages. Until now I had no idea there was > anything *but* utf-16, i.e. 2-bytes/char. =20 Both UTF-8 and UTF-16 are variable-width encodings.=20 =20 > With memory seriously getting to be dirt-cheap, "wasting 8-bits > doesn't seem that big a deal.=20 Indeed. > Maybe some future wizard will > invent a UTF-32 that will hold all ~90 000 Chinese characters and > these will be downsized automatically to UTF-8 when you're mixing > Mandarin with, say, Cesk [Czeck]. =20 UTF-32 already exists, but it's a fixed-width (4 bytes) encoding. > Hmm, somebody just told me that "aigu" is not English but French > and means "acute". ...all these years i thought ... oh well. > Anyway, do you know if '\0351' is a 16-bit character? is is 0xE9 > and decimal 233 and certaing should fit into a byte. just > wondering. Obviously it is a 8-bit character; anything in the range 0-255 is. In ISO 8859-1(5) it is "=E9" (e with accent aigu). Please look up UTF-8,16,32 and ISO-8859-15 on Wikipedia for further enlightenment. Roland --=20 R.F.Smith http://www.xs4all.nl/~rsmith/ [plain text _non-HTML_ PGP/GnuPG encrypted/signed email much appreciated] pgp: 1A2B 477F 9970 BA3C 2914 B7CE 1277 EFB0 C321 A725 (KeyID: C321A725) --PNTmBPCT7hxwcZjr Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.9 (FreeBSD) iEYEARECAAYFAkjHZ30ACgkQEnfvsMMhpyWlFgCcCLx749G8KqudatQcw1Fwg9Ys W74AnAtpNu5M+zEr+JfCHpVPZmBJKnZT =nbc4 -----END PGP SIGNATURE----- --PNTmBPCT7hxwcZjr--