From owner-freebsd-current@freebsd.org Wed Jul 20 19:31:06 2016 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 4FE2EB9E3AD for ; Wed, 20 Jul 2016 19:31:06 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: from mail-wm0-x233.google.com (mail-wm0-x233.google.com [IPv6:2a00:1450:400c:c09::233]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id D4C641A67 for ; Wed, 20 Jul 2016 19:31:05 +0000 (UTC) (envelope-from baptiste.daroussin@gmail.com) Received: by mail-wm0-x233.google.com with SMTP id o80so83121924wme.1 for ; Wed, 20 Jul 2016 12:31:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=zwGtq6OhB50WW1DFRfGdSbw09epXPkLCCuaT48IhMp0=; b=f8gre4YpS64YaNTXpFYmRZhPSFboe8oaDOc707anaXt5EZSLCy966EZ8FUgnZ+DFFe hNYkZAL01R8p1kk9jxsfo7uR6NkM31gIAM74tPk2f4UcM04dOSjGmfbywOxrzqz+kaOq taOEmeZ8WEPdtknEgpr4waBchNOzJDy94LuQj7UrQEBCVEX7XtH0Q8m1iivIjvOq1DQ4 VBdipk6g3gFENidJKXfun7iSN9OhICbk3wQbtTbDpwOWawBAof+FU6KFZSo291Ovqxfl l0Of92gHPg8/yT6jiPQSEAevengKyf0yeIrcgIiBzOMtStwDhJVSDTJC7XbhWGkrIBRl 8B6Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=zwGtq6OhB50WW1DFRfGdSbw09epXPkLCCuaT48IhMp0=; b=Px3DMP7VvD4zU+Ufv7x10TuZqvsy0Q35QDchIZSSFgalPJQJHM25Xlhh+sLObBTB4p S/pUxUuu0Sv6lyMCWl5O6bMgxSXzrjY1PdhPV7zUFFa8cfemBqUX2q3l4ox8DMSu9JBk ZdTS1ukzGyID6Gy6yesPVn407hG0Kn14yGwwW7fnlXGQt5mf/njkqpf3bjGQ8i3N9Erw F1eoT57Tbj/MtUO9KITU4svpXuR0hyiq8zTXqRkAwjpWY3Kujg2GEzyQBdN1ROQ+5+b5 WMcYIVNXYI/Fix4prs/I8D6v6Fd2lU8Fk6Nu91+3s/jLDM/qqXrOEZs3U2HKgpVLUg1S yiKw== X-Gm-Message-State: ALyK8tJnxgvPkG38N5L7Au14S/MvVdR5CP1jBfTztRgFPMmH35m7fMh70i09z/aTCOyMxQ== X-Received: by 10.28.203.6 with SMTP id b6mr1149770wmg.78.1469043063314; Wed, 20 Jul 2016 12:31:03 -0700 (PDT) Received: from ivaldir.etoilebsd.net ([2001:41d0:8:db4c::1]) by smtp.gmail.com with ESMTPSA id e4sm2688422wjy.20.2016.07.20.12.31.01 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 20 Jul 2016 12:31:02 -0700 (PDT) Sender: Baptiste Daroussin Date: Wed, 20 Jul 2016 21:31:01 +0200 From: Baptiste Daroussin To: Tim =?utf-8?Q?=C4=8Cas?= Cc: freebsd-current@freebsd.org Subject: Re: UTF-8 by default? Message-ID: <20160720193101.jggjyak2vqdnkivr@ivaldir.etoilebsd.net> References: <20160720140741.yi7vfgmmqtg6eprx@ivaldir.etoilebsd.net> <201607201833.u6KIXEpB054887@gw.catspoiler.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="6qrw4drxbthtfbnj" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.6.1-neo (2016-06-11) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 20 Jul 2016 19:31:06 -0000 --6qrw4drxbthtfbnj Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Jul 20, 2016 at 09:22:23PM +0200, Tim =C4=8Cas wrote: > On 20 July 2016 at 20:33, Don Lewis wrote: > > wc(1) has problems with its multibyte support pointed out by Coverity > > as I recall. >=20 > Not sure how critical that issue is (e.g. byte counts [`-c`], line > counts [`-l`], and such should still work as intended; whether word > counts work or not depends on whether we should count Unicode > whitespace as, well, whitespace). I do wonder if everyone agrees that > an effort should be made towards UTF-8 default, though? >=20 > I'm willing to contribute some of my time to fixing these bugs, but I > don't think I can fix *all* of this by myself. I guess wc(1) is as > good a start as any, but I'd first like to talk to whoever is the > maintainer for that bit of code, as I've never done any work in base > before (only in ports). good I would recommand to have a look at work done in OpenBSD in that regar= ds, since about a year Ingo Schwarze is atting UTF-8 support to all the tool in their base system, including wc(1) not sure how theirs differs from our. If working on that do not hesitate to push the changes you do propose in phabricator: https://reviews.freebsd.org and add me as a reviewer https://wiki.freebsd.org/CodeReview might be useful to determine how to sim= ply use phabricator. Do not hesitate to mail me if you need any help in that area. Best regards, Bapt --6qrw4drxbthtfbnj Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJXj9F1AAoJEGOJi9zxtz5aZnYP/37bszn+qCCB0VQCY5E10Cgc VTDC6BMZm2LRFSIW5qzULsQ9DWV6DcXe9qFL3WATVqxileELvoeOBqD805tz2vLP e5DhNYsuuTTzsBose+ozcHPvNbOHavI7rzCyOk+irkfaiBoI3s/z3d8Grn3HHAW1 4zMWyK8Nc2s6UXSwInvJMKk362Kqq5K3Y0n7Q0vLSp0ujitSEK192fujYz/D6Y23 FXs8LJTCUMhB7PWeCcbJ2Osr/Gn78OTgNtJtNsIMl/PVx+te+pEQ+7JElIYztDbi VXZI1+Ac074TijnTm5V9Kh4yQ6UyqfWSDIFIWjTu5AYwtvzL195/Cr4ArPi9pcF+ sj32Hx3Q2Qj4PJC+odUuJpt5WMYQXkMxGdVz8f9GQDJq/k6my4SpQO1YzEPbPbbZ UbP+sf1acFoaS6thAY1g4sbE/JPH3g9YNfT4MxFGCW0l7unW8RDul0jpYNfwfkrf Ubv/iBHMLHnvyKmY/hhOvy0llmpt/OyB9bsoThe0zEWvXarzcLgXwjIFI1lkET20 X4T5R31seqJtXGCNKmixpHbXEYbZKlcLE6BqWqcq+CPGAUurYn8E74fnxvGsisVw fMxGCUda/WSNmzpltGUlx3GM2+roZA+YyyJn/JLY99i11EJScAPMza6gqF2SY2sx L77P/NcG4OrdRDv9Fyf+ =ilV8 -----END PGP SIGNATURE----- --6qrw4drxbthtfbnj--