From owner-svn-src-head@freebsd.org Thu Mar 2 06:56:17 2017 Return-Path: Delivered-To: svn-src-head@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id B5348CF5A20; Thu, 2 Mar 2017 06:56:17 +0000 (UTC) (envelope-from cse.cem@gmail.com) Received: from mail-wm0-f46.google.com (mail-wm0-f46.google.com [74.125.82.46]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 5290B86C; Thu, 2 Mar 2017 06:56:17 +0000 (UTC) (envelope-from cse.cem@gmail.com) Received: by mail-wm0-f46.google.com with SMTP id v186so127266106wmd.0; Wed, 01 Mar 2017 22:56:16 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:reply-to:in-reply-to:references :from:date:message-id:subject:to:cc:content-transfer-encoding; bh=tNlKxbahByIWpo7qner9kFKWcQLc5WRU/6WgMV55RTU=; b=TP0Igy/ZmKdO7CA0D/XooroXhgHbd2JYi+xclQrFCrdnhbK51Ec+N2rCj5qu9d1DQc 1ctR1Tn+0tumEke5GBDiijSr370E1Lh7Z5/W+uVcEXQcEibEORTcs/JiTLQEwLT782Ss eJv0g0vvIxwwI0P/1HT7zFwA5vWJMSaKUTCdxTasg7D2d53BD1Y69QjXnjKNe855PM0V y5iqRvtSCkZBhlRRTk3qs6mHfRb+iw/+MGwlpRkF7cbP5slDZwWgQv1xEwWZs51DZT+5 itPvQcB4gosOZwi8wDrWQvDmawbBZzd73S+7EVLFdVkGp0BNDOdo1qUGajU1JZfL6FQN T5Gg== X-Gm-Message-State: AMke39nztfVUFr4RZH4l+0SwriSzSOT07/ADKo5AjseXUgxpYNWP3fXawJs+IDB1hBC4/A== X-Received: by 10.28.170.4 with SMTP id t4mr6575362wme.89.1488434073716; Wed, 01 Mar 2017 21:54:33 -0800 (PST) Received: from mail-wm0-f49.google.com (mail-wm0-f49.google.com. [74.125.82.49]) by smtp.gmail.com with ESMTPSA id 136sm9704732wms.32.2017.03.01.21.54.32 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 01 Mar 2017 21:54:33 -0800 (PST) Received: by mail-wm0-f49.google.com with SMTP id v186so126480222wmd.0; Wed, 01 Mar 2017 21:54:32 -0800 (PST) X-Received: by 10.28.221.11 with SMTP id u11mr6269176wmg.75.1488434072834; Wed, 01 Mar 2017 21:54:32 -0800 (PST) MIME-Version: 1.0 Reply-To: cem@freebsd.org Received: by 10.80.152.82 with HTTP; Wed, 1 Mar 2017 21:54:32 -0800 (PST) In-Reply-To: <20170302162120.C8136@besplex.bde.org> References: <201701310326.v0V3QW30024375@repo.freebsd.org> <20170202184819.GP2092@kib.kiev.ua> <20170203062806.A2690@besplex.bde.org> <20170228121335.Q2733@besplex.bde.org> <20170302162120.C8136@besplex.bde.org> From: Conrad Meyer Date: Wed, 1 Mar 2017 21:54:32 -0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: svn commit: r313006 - in head: sys/conf sys/libkern sys/libkern/x86 sys/sys tests/sys/kern To: Bruce Evans Cc: src-committers , svn-src-all@freebsd.org, svn-src-head@freebsd.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: svn-src-head@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: SVN commit messages for the src tree for head/-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 02 Mar 2017 06:56:17 -0000 On Wed, Mar 1, 2017 at 9:27 PM, Bruce Evans wrote: > On Wed, 1 Mar 2017, Conrad Meyer wrote: > >> On my laptop (Intel(R) Core(TM) i5-3320M CPU =E2=80=94 Ivy Bridge) I sti= ll see >> a little worse performance with this patch. Please excuse the ugly >> graphs, I don't have a better graphing tool set up at this time: >> >> https://people.freebsd.org/~cem/crc32/sse42_bde.png >> https://people.freebsd.org/~cem/crc32/sse42_bde_log.png > > > Try doubling the loop sizes. There shouldn't be any significant differen= ce > above size 3*LONG unless LONG is too small. Apparently it is too small f= or > older CPUs. > > I now have a Sandybridge i5-2xxx laptop to test on, but don't have it set > up for much yet. > > Bruce Hi Bruce, Doubling the loop sizes seems to make it slightly worse, actually: https://people.freebsd.org/~cem/crc32/sse42_bde2.png https://people.freebsd.org/~cem/crc32/sse42_bde_log2.png I haven't made any attempt to inspect the generated assembly. This is Clang 3.9.1 with -O2. Best, Conrad