From owner-freebsd-stable@freebsd.org Sun Nov 6 21:30:18 2016 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 8A954C3344E for ; Sun, 6 Nov 2016 21:30:18 +0000 (UTC) (envelope-from stb@lassitu.de) Received: from gilb.zs64.net (gilb.zs64.net [IPv6:2a00:14b0:4200:32e0::1ea]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "gilb.zs64.net", Issuer "Let's Encrypt Authority X3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 57B0B1A6; Sun, 6 Nov 2016 21:30:18 +0000 (UTC) (envelope-from stb@lassitu.de) Received: by gilb.zs64.net (Postfix, from stb@lassitu.de) id D3ACA1E2420; Sun, 6 Nov 2016 21:30:16 +0000 (UTC) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 10.1 \(3251\)) Subject: Re: Uppercase RE matching problems in FreeBSD 11 From: Stefan Bethke In-Reply-To: Date: Sun, 6 Nov 2016 22:30:16 +0100 Cc: Baptiste Daroussin , Greg Rivers , freebsd-stable@freebsd.org Content-Transfer-Encoding: quoted-printable Message-Id: References: <20161106110729.z2px7mzlhcwxvrvu@ivaldir.etoilebsd.net> <29451103-E8DB-4656-A5BB-AEB924A728D6@lassitu.de> To: Stefan Ehmann X-Mailer: Apple Mail (2.3251) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 06 Nov 2016 21:30:18 -0000 > Am 06.11.2016 um 22:14 schrieb Stefan Ehmann : >=20 >> That is rather surprising. Is there a normative reference for the >> treatment of bracket expressions and character classes when using >> locales other than C and/or encodings like UTF-8? >=20 > I found an interesting article about this issue in gawk: > = https://www.gnu.org/software/gawk/manual/html_node/Ranges-and-Locales.html= OK, I give up. Back to jwz: "now you have two problems.=E2=80=9C Although with en_US.UTF-8 on other systems, I have not had that = experience. A quick check on stuff I have immediate access to: macOS 10.12: $ echo 'abcdABCD' | sed 's/[A-Z]/X/g=E2=80=99 abcdXXXX Ubuntu 14.04.5 $ echo 'abcdABCD' | sed 's/[A-Z]/X/g=E2=80=99 abcdXXXX FreeBSD 10-stable $ echo 'abcdABCD' | sed 's/[A-Z]/X/g' abcdXXXX Stefan --=20 Stefan Bethke Fon +49 151 14070811