Date: Mon, 22 Sep 1997 20:50:01 -0700 (PDT) From: "Daniel O'Callaghan" <danny@FreeBSD.ORG> To: freebsd-bugs Subject: Re: docs/4555: Typo in utf2(4) man page Message-ID: <199709230350.UAA12850@hub.freebsd.org>
next in thread | raw e-mail | index | archive | help
The following reply was made to PR docs/4555; it has been noted by GNATS. From: "Daniel O'Callaghan" <danny@FreeBSD.ORG> To: freebsd-gnats-submit@freebsd.org, tejblum@arc.hq.cti.ru Cc: Subject: Re: docs/4555: Typo in utf2(4) man page Date: Tue, 23 Sep 1997 13:43:58 +1100 Hello, you wrote: > utf2(4) claims that runes 0x0080-0x03ff (00000bbb.bbbbbbbb) encoded with 2 > ^ ^^^ > bytes. Of course, it really means 0x0080-0x07ff. > > How-To-Repeat > Fix > > --- /usr/src/lib/libc/locale/utf2.4 Tue Sep 16 18:20:11 1997 > +++ utf2.4 Tue Sep 16 21:45:31 1997 > @@ -60,7 +60,7 @@ > encoding is represented by the following table: > .Bd -literal > [0x0000 - 0x007f] [00000000.0bbbbbbb] -> 0bbbbbbb > -[0x0080 - 0x03ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb > +[0x0080 - 0x07ff] [00000bbb.bbbbbbbb] -> 110bbbbb, 10bbbbbb > [0x0400 - 0xffff] [bbbbbbbb.bbbbbbbb] -> 1110bbbb, 10bbbbbb, 10bbbbbb Look closely and you'll see that the ranges are 0x0000-0x007f, 0x0080-0x03ff and 0x0400 -0xffff, making three contiguous ranges. Your patch would suggest an overlap as the middle range would end at 0x07ff, after the third range starts at 0x0400. Danny
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199709230350.UAA12850>