From owner-freebsd-bugs Mon Sep 22 20:50:03 1997 Return-Path: Received: (from root@localhost) by hub.freebsd.org (8.8.7/8.8.7) id UAA12858 for bugs-outgoing; Mon, 22 Sep 1997 20:50:03 -0700 (PDT) Received: (from gnats@localhost) by hub.freebsd.org (8.8.7/8.8.7) id UAA12850; Mon, 22 Sep 1997 20:50:01 -0700 (PDT) Date: Mon, 22 Sep 1997 20:50:01 -0700 (PDT) Message-Id: <199709230350.UAA12850@hub.freebsd.org> To: freebsd-bugs Cc: From: "Daniel O'Callaghan" Subject: Re: docs/4555: Typo in utf2(4) man page Reply-To: "Daniel O'Callaghan" Sender: owner-freebsd-bugs@FreeBSD.ORG X-Loop: FreeBSD.org Precedence: bulk The following reply was made to PR docs/4555; it has been noted by GNATS. From: "Daniel O'Callaghan" To: freebsd-gnats-submit@freebsd.org, tejblum@arc.hq.cti.ru Cc: Subject: Re: docs/4555: Typo in utf2(4) man page Date: Tue, 23 Sep 1997 13:43:58 +1100 Hello, you wrote: > utf2(4) claims that runes 0x0080-0x03ff (00000bbb.bbbbbbbb) encoded with 2 > ^ ^^^ > bytes. Of course, it really means 0x0080-0x07ff. > > How-To-Repeat > Fix > > --- /usr/src/lib/libc/locale/utf2.4 Tue Sep 16 18:20:11 1997 > +++ utf2.4 Tue Sep 16 21:45:31 1997 > @@ -60,7 +60,7 @@ > encoding is represented by the following table: > .Bd -literal > [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb > -[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb > +[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb > [0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb Look closely and you'll see that the ranges are 0x0000-0x007f, 0x0080-0x03ff and 0x0400 -0xffff, making three contiguous ranges. Your patch would suggest an overlap as the middle range would end at 0x07ff, after the third range starts at 0x0400. Danny