From owner-freebsd-current Sun Jan 4 21:40:42 1998 Return-Path: Received: (from root@localhost) by hub.freebsd.org (8.8.7/8.8.7) id VAA04756 for current-outgoing; Sun, 4 Jan 1998 21:40:42 -0800 (PST) (envelope-from owner-freebsd-current) Received: from ganymede.or.intel.com (root@ganymede.or.intel.com [134.134.248.3]) by hub.freebsd.org (8.8.7/8.8.7) with ESMTP id VAA04600; Sun, 4 Jan 1998 21:39:25 -0800 (PST) (envelope-from wscott@ichips.intel.com) Received: from ichips-jf.jf.intel.com (ichips-jf.jf.intel.com [134.134.50.200]) by ganymede.or.intel.com (8.8.6/8.8.5) with ESMTP id VAA09293; Sun, 4 Jan 1998 21:56:53 -0800 (PST) Received: from pdxlx008.pdx.intel.com (pdxlx008.pdx.intel.com [137.102.206.194]) by ichips-jf.jf.intel.com (8.8.8/8.8.7) with ESMTP id VAA17220; Sun, 4 Jan 1998 21:46:17 -0800 (PST) Received: from pdxlx008.pdx.intel.com (localhost [127.0.0.1]) by pdxlx008.pdx.intel.com (8.8.7/8.8.7) with ESMTP id VAA02628; Sun, 4 Jan 1998 21:39:09 -0800 (PST) (envelope-from wscott@pdxlx008.pdx.intel.com) Message-Id: <199801050539.VAA02628@pdxlx008.pdx.intel.com> To: "Russell L. Carter" cc: dyson@FreeBSD.ORG, garbanzo@hooked.net (Alex), current@FreeBSD.ORG Subject: Re: Pentium optimizations In-reply-to: Your message of "Wed, 17 Dec 1997 08:42:33 MST." <199712171542.IAA09962@dnstoo.consys.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Date: Sun, 04 Jan 1998 21:39:09 -0800 From: Wayne Scott Sender: owner-freebsd-current@FreeBSD.ORG X-Loop: FreeBSD.org Precedence: bulk > The out-of-order execution seems to help a lot. Oh, and the P5 > specific asm actually makes the PPro slow down over the C source; > not good tidings for ye merry old tuners. > > }areas of reasonable payoffs, and lots of "obvious" optimizations that > }end up being neutral. > > Yep. I wouldn't worry too much about other people's claims about code > optimized for Pentium Pro. > > Russell When optimizing for the P6, the main things you should worry about are: branch prediction hitting the caches No unaligned data, and no partials. For example, for branch prediction, the compile should try to make all branches you don't take go forward so they don't occupy the BTB. Code selection does matter, but only is special cases. =Wayne Wayne Scott MD6 Architecture - Intel Corp. wscott@ichips.intel.com Work #: (503) 264-4165 Disclaimer: All views expressed are my own opinions, and not necessarily those of Intel Corporation.