From owner-freebsd-stable@freebsd.org Mon Nov 7 21:13:48 2016 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 47149C341D9 for ; Mon, 7 Nov 2016 21:13:48 +0000 (UTC) (envelope-from cswiger@mac.com) Received: from mail-in2.apple.com (mail-out2.apple.com [17.151.62.25]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 27DFEED1 for ; Mon, 7 Nov 2016 21:13:47 +0000 (UTC) (envelope-from cswiger@mac.com) Received: from relay5.apple.com (relay5.apple.com [17.128.113.88]) by mail-in2.apple.com (Apple Secure Mail Relay) with SMTP id 35.B9.09085.58EE0285; Mon, 7 Nov 2016 13:13:41 -0800 (PST) X-AuditID: 11973e11-0d5e19a00000237d-24-5820ee8533ca Received: from [17.149.234.89] (Unknown_Domain [17.149.234.89]) (using TLS with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by relay5.apple.com (Apple SCV relay) with SMTP id BD.72.27929.58EE0285; Mon, 7 Nov 2016 13:13:41 -0800 (PST) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 10.1 \(3251\)) Subject: Re: Uppercase RE matching problems in FreeBSD 11 From: Charles Swiger In-Reply-To: Date: Mon, 7 Nov 2016 13:13:40 -0800 Cc: freebsd-stable Content-Transfer-Encoding: quoted-printable Message-Id: <99E209EA-75B0-430D-8F0C-E51D614143BA@mac.com> References: <20161106110729.z2px7mzlhcwxvrvu@ivaldir.etoilebsd.net> <29451103-E8DB-4656-A5BB-AEB924A728D6@lassitu.de> <20161106210628.hg3dcpozfjtuo3nt@ivaldir.etoilebsd.net> <20161106212729.z2edg44kg7hc4r2z@ivaldir.etoilebsd.net> To: Stefan Bethke X-Mailer: Apple Mail (2.3251) X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFlrALMWRmVeSWpSXmKPExsUi2FAYodv6TiHC4PwlbovDzUIWpw8tYHNg 8pjxaT6Lx6K5/5kCmKK4bFJSczLLUov07RK4Mja92cBY8Jmt4sGvFuYGxr2sXYycHBICJhLH ty8Fsrk4hAT2MkrM3DMVLvH/7xYWiMRUJolzE2ezgySYBdQl/sy7xAxi8wroS1z59oyxi5GD Q1jAUmLz63oQk01ATWLCRB6QCk4BO4nepm8sIGEWARWJ1w1eEEP0Jb4sfcEMYWtLLFv4Gmqg lcTVUweYQGwhgR5miatnPUBsEQElibNTbjNCXCYr8en5T3aQyyQEPrJKfN28km0Co+AsJMfN QnLcLCQ7FjAyr2IUyk3MzNHNzDPSSywoyEnVS87P3cQICtLpdoI7GI+vsjrEKMDBqMTD+6Jf IUKINbGsuDL3EKM0B4uSOO8mftkIIYH0xJLU7NTUgtSi+KLSnNTiQ4xMHJxSDYxOso9PzTq+ 9FNWP093RrDRj8SdxoY+FhcrN8qUFt8KOOv0OMPSokmSJ0jW+YvhY78FOxmu7FBbrKXuMbFC MP1qb19JwNr/5pwyzwzeaQaXNa+xfq2xMmiXlk/9tik7GJPeMUVsy/fNPuPLlXV680Tujdaq r2Wilu5XfJgVFNRylG+jUkp/kxJLcUaioRZzUXEiAJAUmdkzAgAA X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFprPLMWRmVeSWpSXmKPExsUiOPVVpG7rO4UIg1OrOCwONwtZnD60gM2B yWPGp/ksHovm/mcKYIrisklJzcksSy3St0vgytj0ZgNjwWe2ige/WpgbGPeydjFyckgImEj8 /7uFBcIWk7hwbz1bFyMXh5DAVCaJcxNns4MkmAXUJf7Mu8QMYvMK6Etc+faMsYuRg0NYwFJi 8+t6EJNNQE1iwkQekApOATuJ3qZvLCBhFgEVidcNXhBD9CW+LH3BDGFrSyxb+BpqoJXE1VMH mEBsIYEeZomrZz1AbBEBJYmzU24zQlwmK/Hp+U/2CYz8s5DcMwvJPbOQjF3AyLyKUaAoNSex 0lQvsaAgJ1UvOT93EyMo2BoKI3Yw/l9mdYhRgINRiYf3Rb9ChBBrYllxZe4hRgkOZiUR3sVP gEK8KYmVValF+fFFpTmpxYcYpTlYlMR5r3XIRwgJpCeWpGanphakFsFkmTg4pRoYZ7VJmm57 47EsN/pwUGpC0LF0Rv5nW89ZSH37M/eK/7m8ysUuM3gXXNDU0xGfMT/24Ocv1fO1yvSPH+0z FRL5kZ9zP0pc78mknxqes7NFyx6ur7uza3tRFPurJQ8uNZol1sSvLBV47HDrkMev666H6tNP 33RjL94hlf3W84b7zx3+rilPeKs9lFiKMxINtZiLihMBtgmhzDICAAA= X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 07 Nov 2016 21:13:48 -0000 On Nov 6, 2016, at 1:49 PM, Stefan Bethke wrote: > Am 06.11.2016 um 22:27 schrieb Baptiste Daroussin : >> That works for POSIX locale aka C aka ASCII only world >=20 > So what do I set my LANG and LC variables to? I do want UTF-8, but I = do also want my scripts to continue to work. Clearly, en_US.UTF-8 is = not what I want. Is it C.UTF-8? Or do I set LANG=3Den_US.UTF-8 and = LC_COLLATE=3DC? If you want to use a UTF8 locale, then you must start using character = classes like '[:upper:]' and '[:lower:]' because those will-- or at = least "should", modulo bugs-- properly handle the collation issues = including for languages which do not possess a 1-1 mapping between upper = and lower case letters. Someone with a German email address is presumably familiar with =C3=9F / = Eszett...? :-) Regards, --=20 -Chuck