From owner-freebsd-hackers Tue Mar 6 4:46:43 2001 Delivered-To: freebsd-hackers@freebsd.org Received: from mail.wgate.com (mail.wgate.com [38.219.83.4]) by hub.freebsd.org (Postfix) with ESMTP id 643F437B719 for ; Tue, 6 Mar 2001 04:46:38 -0800 (PST) (envelope-from msinz@wgate.com) Received: from sinz.eng.tvol.net ([10.32.2.99]) by mail.wgate.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id 152C8XPY; Tue, 6 Mar 2001 07:46:50 -0500 Received: from wgate.com (localhost [127.0.0.1]) by sinz.eng.tvol.net (8.11.1/8.11.1) with ESMTP id f26Ckbk63267 for ; Tue, 6 Mar 2001 07:46:37 -0500 (EST) (envelope-from msinz@wgate.com) Message-ID: <3AA4DC2D.F72208C7@wgate.com> Date: Tue, 06 Mar 2001 07:46:37 -0500 From: Michael Sinz Organization: WorldGate Communications Inc. X-Mailer: Mozilla 4.76 [en] (X11; U; FreeBSD 4.2-STABLE i386) X-Accept-Language: en MIME-Version: 1.0 To: hackers@FreeBSD.ORG Subject: Re: Machines are getting too damn fast References: Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Matt Dillon wrote: > Subject: Re: Machines are getting too damn fast > > :throughput. For example, on the PIII-850 (116MHz FSB and SDRAM, its > :overclocked) here on my desk with 256KB L2 cache: > : > :dd if=/dev/zero of=/dev/null bs=512k count=4000 > :4000+0 records in > :4000+0 records out > :2097152000 bytes transferred in 8.229456 secs (254834825 bytes/sec) > : > :dd if=/dev/zero of=/dev/null bs=128k count=16000 > :16000+0 records in > :16000+0 records out > :2097152000 bytes transferred in 1.204001 secs (1741819224 bytes/sec) > : > :Now THAT is a significant difference. :-) > > Interesting. I get very different results with the 1.3 GHz P4. The > best I seem to get is 1.4 GBytes/sec. I'm not sure what the L2 cache > is on the box, but it's definitely a consumer model. > > dd if=/dev/zero of=/dev/null bs=512k count=4000 > 2097152000 bytes transferred in 2.363903 secs (887156520 bytes/sec) > > dd if=/dev/zero of=/dev/null bs=128k count=16000 > 2097152000 bytes transferred in 1.471046 secs (1425619621 bytes/sec) > > If I use lower block sizes the syscall overhead blows up the > performance (it gets lower rather then higher). So I figure I don't > have as much L2 as on your system. The P4 has other issues when you don't do straight line code. Any branch mis-predictions cost a minimum of 20 cycles due to the pipeline plus whatever cache/fetch/decode hits you may get on the actual target. This may be why you get lower values than a PIII or Athelon. (Both have significantly lower penalty for branch mis-prediction) -- Michael Sinz ---- Worldgate Communications ---- msinz@wgate.com A master's secrets are only as good as the master's ability to explain them to others. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message