From owner-freebsd-current@FreeBSD.ORG Wed Sep 19 02:12:10 2007 Return-Path: Delivered-To: current@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A008916A417; Wed, 19 Sep 2007 02:12:10 +0000 (UTC) (envelope-from taku@tackymt.homeip.net) Received: from basalt.tackymt.homeip.net (unknown [IPv6:2001:3e0:577:0:20d:61ff:fecc:2253]) by mx1.freebsd.org (Postfix) with ESMTP id 2C94913C46A; Wed, 19 Sep 2007 02:12:10 +0000 (UTC) (envelope-from taku@tackymt.homeip.net) Received: from localhost (localhost [127.0.0.1]) by basalt.tackymt.homeip.net (Postfix) with ESMTP id E7C1210749; Wed, 19 Sep 2007 11:12:08 +0900 (JST) Received: from basalt.tackymt.homeip.net ([127.0.0.1]) by localhost (basalt.tackymt.homeip.net [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 45232-02; Wed, 19 Sep 2007 11:12:07 +0900 (JST) Received: from basalt.tackymt.homeip.net (basalt.tackymt.homeip.net [IPv6:2001:3e0:577:0:20d:61ff:fecc:2253]) by basalt.tackymt.homeip.net (Postfix) with ESMTP; Wed, 19 Sep 2007 11:12:07 +0900 (JST) Date: Wed, 19 Sep 2007 11:12:07 +0900 From: Taku YAMAMOTO To: Andrey Chernov Message-Id: <20070919111207.f37653fc.taku@tackymt.homeip.net> In-Reply-To: <20070917171633.GA31179@nagual.pp.ru> References: <20070916192924.GA12678@nagual.pp.ru> <20070917092130.GA24424@nagual.pp.ru> <20070918020100.d43beb0b.taku@tackymt.homeip.net> <20070917171633.GA31179@nagual.pp.ru> X-Mailer: Sylpheed 2.4.0 (GTK+ 2.10.11; i386-portbld-freebsd7.0) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Virus-Scanned: amavisd-new at tackymt.homeip.net Cc: i18n@FreeBSD.ORG, Petr Hroudn?? , perky@FreeBSD.ORG, current@FreeBSD.ORG Subject: Re: Ctype patch for review X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 19 Sep 2007 02:12:10 -0000 On Mon, 17 Sep 2007 21:16:33 +0400 Andrey Chernov wrote: > On Tue, Sep 18, 2007 at 02:01:00AM +0900, YAMAMOTO, Taku wrote: > > Checking for __mb_cur_max is not enough for certain locales. > > For example, SJIS has following range for JIS X0201 (a.k.a. HALFWIDTH KANA). > > > > /* > > * JIS X201 > > */ > > PUNCT 0xa1-0xa5 > > SPACE 0xa0 > > BLANK 0xa0 > > SPECIAL 0xa1-0xdf > > PHONOGRAM 0xa6-0xdf > > SWIDTH1 0xa0-0xdf > > I don't understand your remark. MSKanji have __mb_cur_max = 2 and so those > ranges are wchar_t ranges. My patch restrict unsigned char ranges only. These characters ARE single byte. The problem is that a byte >= 0x80 does not always mean it composes a multi-byte character in that locale. -- -|-__ YAMAMOTO, Taku | __ < - A chicken is an egg's way of producing more eggs. -