From owner-freebsd-stable@freebsd.org Sun Nov 6 12:32:10 2016 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 19140C30C2F for ; Sun, 6 Nov 2016 12:32:10 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: from mail-wm0-x231.google.com (mail-wm0-x231.google.com [IPv6:2a00:1450:400c:c09::231]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 9E6BE644 for ; Sun, 6 Nov 2016 12:32:09 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: by mail-wm0-x231.google.com with SMTP id p190so133260997wmp.1 for ; Sun, 06 Nov 2016 04:32:09 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=xacQOZ6zpYw9iHQW28qmxL8OSYynrBVPIAkkKouEYxI=; b=MqpAyZr58SRoNJXqbSlWPcOoGfvXiWFQq9BzsBVQeqZP38LQ/7UX9CsGdpkPTGPz5h TOfhGg23QjVsZuy40qJ0pk7mGQg5kald6xPBkKEX39zF6JLY9v9M9NxH0w238P1KTp2H pIypujuuXMgQqVjXfcoHz0Fr8DXUt5elTW4FEBuMkBAR/d7u1gctu762+ds4+SfBW5p5 w8i1kY1OCnkF2vpWIP+VYvy+OaDMd0Gqn2yE4x243mMq9YNcDDn5V0vTMvWHrq73Mqoe Gd/8WsSluwCPcIKeepF/IN3WwYGGK8TmpT2iO2ttW6Hj6uD6+O5+A4O/AtMuMCTwwDuv 2nUw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=xacQOZ6zpYw9iHQW28qmxL8OSYynrBVPIAkkKouEYxI=; b=BjbUCVB8no9nz/1vVJOHZSLGjXwyeZ4rxjnwZ2yPQrz+FiDgg1g9F6VGYvi6BUN7To TfL2vHxixnaLLayHJP/XWGCiKkyAlFh1ZJhIHQ7bMiupOI4D3w4kbuLeyyTxa2qKFJ3x UIjQw21sAzKTXN++m3l+ABDy0X7ALwBH3ATpqWpG6oDC/SVfXxxjANz2Lk7DHkT24p51 tfXZnfvb8sAwm5ZVoQVCxeJtWraOvKn+4IUwDDGgx2H+Ii7YZwURV9PRY6+J9lLjsboc XxQIGKElS3vpjbUYO1aX/SeBLhBE8MaFXIkPfEoMYzbEEXzZ+YvjFEGqv4ObQyJvoz/3 PpXg== X-Gm-Message-State: ABUngvcjynN+mxGU3B1D6SEBbElDxC59aYsSsKtsFUvvOYPE7xBpmvDa2yOh+ealy3uYpQ== X-Received: by 10.194.179.41 with SMTP id dd9mr2073432wjc.122.1478435527671; Sun, 06 Nov 2016 04:32:07 -0800 (PST) Received: from ivaldir.etoilebsd.net ([2001:41d0:8:db4c::1]) by smtp.gmail.com with ESMTPSA id hy10sm24926300wjb.10.2016.11.06.04.32.06 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 06 Nov 2016 04:32:06 -0800 (PST) Sender: Baptiste Daroussin Date: Sun, 6 Nov 2016 13:32:06 +0100 From: Baptiste Daroussin To: Mark Martinec Cc: freebsd-stable@freebsd.org Subject: Re: Uppercase RE matching problems in FreeBSD 11 Message-ID: <20161106123206.yr2dhtd2ce7ln27a@ivaldir.etoilebsd.net> References: <20161106110729.z2px7mzlhcwxvrvu@ivaldir.etoilebsd.net> <71a45ece6ec63bf696edab5b31abdaf5@ijs.si> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="zgfhybgeirpmmsh5" Content-Disposition: inline In-Reply-To: <71a45ece6ec63bf696edab5b31abdaf5@ijs.si> User-Agent: NeoMutt/20161104 (1.7.1) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 06 Nov 2016 12:32:10 -0000 --zgfhybgeirpmmsh5 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sun, Nov 06, 2016 at 01:26:51PM +0100, Mark Martinec wrote: > 2016-11-06 12:07, Baptiste Daroussin wrote: > > Yes A-Z only means uppercase in an ASCII only world in a unicode world > > it means > > AaBb... Z because there are way more characters that simple A-Z. In > > FreeBSD 11 > > we have a unicode collation instead of falling back in on LC_COLLATE=3DC > > which > > means ascii only > >=20 > > For regrexp for example one should use the classes: :upper: or :lower:. >=20 > It is a good idea to keep LC_COLLATE and LC_NUMERIC (and LC_MONETARY?) at > "C" > when LANG or LC_CTYPE is set to something else, otherwise unexpected > things may happen. >=20 In scripts clearly, the collation rules, numeric rules and monetary rules m= ay vary depending on the locale. Best regards, Bapt --zgfhybgeirpmmsh5 Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIcBAABCAAGBQJYHyLGAAoJEGOJi9zxtz5ataQP/1T9MGo61fPefw9irgpooc2C xwi+NVMhDVflW5mdm623Ax+jIvEtfTg3Lr8pc47MTfWKYq799YVB3qgghb0CrUtb L+fpEo6QQYT/2vj/wduSdAGGBWbvwh0PybS7gFZZB8PJVOjUwylIFmaAk4pFAERj 2SCb7gW/GpgWED8MVBoOtZ+UVjnmdDZWNSeifn1CHu2fdV+s9tBokWB0eAjC4L5B pKLoX/yUTmPDRzw37u1PkZVwkdGUJFhCdTnvgxG3RoA8bv2M0GWRL8e3rqKDh5xA OrN9PJubL0HDBZyh1EXCGkX178CEV5P6poytA3OcyCvC0uSi3PIoFeEIz1ha3YYV Mko3bFbkbpKihRpAflO3gbSmVNiyeCNZ5E65CrI3kEoCvsOrOYwaEqItBbjr4OaP e9ZaHp/3pmyr8AB3nP6s/7uthakEdphLonfS05QuCmd9HxMwFi8rUi2cDKWeIrvY WzL74g0eAjkc9Hug5PEA9QLDzqf0rI2lABaxLaZXPvWh0vowkXscrTJPL1sdIFte briWLAWUvmoTKmdXn6Lrwt8NXx/pROnLCWWVW9eB884UnQYCBBlfwro2WBn25qiq nUBEOAjqPBLoPZIqHI7GXZNM6ldjn0dX8wMtLTyrPKw5lOoLVrdo0fvMNvQPXVI0 wBOIco+QBaeolBeqBIr1 =Wau7 -----END PGP SIGNATURE----- --zgfhybgeirpmmsh5--