Date: Wed, 28 Nov 2001 22:48:05 -0800 From: Peter Wemm <peter@wemm.org> To: Luigi Rizzo <luigi@FreeBSD.org> Cc: cvs-committers@FreeBSD.org, cvs-all@FreeBSD.org Subject: Re: cvs commit: src/sys/pci if_sis.c Message-ID: <20011129064805.C37793808@overcee.netplex.com.au> In-Reply-To: <20011128141510.A13586@iguana.aciri.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Luigi Rizzo wrote: > > While this helps things like packet forwarding, it hurts things like > > and generic servers (web, proxies) where things are done in > userland and the content is opaque and the only unaligned > accesses are for the IP/TCP headers (but those are touched > already in the packet forwarding case). > > > NFS which now have to do lots and lots of unaligned accesses. > > I would actually like to see some numbers showing that this is the > case. Where else these unaligned accesses could be other than in > creating the NFS/RPC headers ? Do a bunch of unaligned accesses > really cost more than a memory-to-memory copy of 1500 bytes ? Even just the IP, TCP and UDP header processing is affected. > > Have you benchmarked anything else besides packet forwarding? > > no, how would you benchmark this (that is without hitting a > bottleneck elsewhere in the system) ? You dont need to hit the wall, supply a constant stream of requests and measure the cpu used in interrupt or system mode. To show that unaligned accesses do have a measurable effect: char buf[100000*4]; main() { int i; int j; int n; int *p; j = 0; for (n = 0; n < 10000; n++) { p = (int *)&buf[OFF]; for (i = 0; i < 99999; i++) j += *p++; } exit(j); } On an AthlonMP (smp kernel, smp is running, my X11 desktop) peter@daintree[10:19pm]~-192> cc -O2 -DOFF=0 -o b0 b.c peter@daintree[10:19pm]~-193> cc -O2 -DOFF=1 -o b1 b.c peter@daintree[10:19pm]~-194> cc -O2 -DOFF=2 -o b2 b.c peter@daintree[10:19pm]~-195> cc -O2 -DOFF=3 -o b3 b.c peter@daintree[10:19pm]~-196> set time peter@daintree[10:20pm]~-198> ./b0 ; ./b1 ; ./b0 ; ./b2 ; ./b0 ; ./b3 8.876u 0.023s 0:08.97 99.1% 5+671k 0+0io 0pf+0w 9.154u 0.007s 0:09.23 99.1% 5+671k 0+0io 0pf+0w 8.901u 0.000s 0:08.97 99.2% 5+671k 0+0io 0pf+0w 9.157u 0.007s 0:09.23 99.1% 5+671k 0+0io 0pf+0w 8.883u 0.015s 0:08.96 99.2% 5+670k 0+0io 0pf+0w 9.147u 0.015s 0:09.22 99.2% 5+671k 0+0io 0pf+0w On a Pentium4: peter@pentium4[10:25pm]/home/tmp-11> ./b0 ; ./b1 ; ./b0 ; ./b2 ; ./b0 ; ./b3 3.229u 0.000s 0:03.23 100.0% 5+673k 0+0io 0pf+0w 4.464u 0.000s 0:04.46 100.0% 5+672k 0+0io 0pf+0w 3.236u 0.000s 0:03.23 100.0% 5+671k 0+0io 0pf+0w 4.464u 0.000s 0:04.46 100.0% 5+672k 0+0io 0pf+0w 3.235u 0.000s 0:03.23 100.0% 5+671k 0+0io 0pf+0w 4.464u 0.000s 0:04.46 100.0% 5+670k 0+0io 0pf+0w On a Pentium3 (coppermine): > ./b0 ; ./b1 ; ./b0 ; ./b2 ; ./b0 ; ./b3 14.710u 0.000s 0:14.71 100.0% 5+671k 0+0io 0pf+0w 14.728u 0.000s 0:14.73 99.9% 5+671k 0+0io 0pf+0w 14.718u 0.000s 0:14.71 100.0% 5+671k 0+0io 0pf+0w 14.720u 0.007s 0:14.73 99.9% 5+671k 0+0io 0pf+0w 14.718u 0.000s 0:14.71 100.0% 5+671k 0+0io 0pf+0w 14.735u 0.000s 0:14.73 100.0% 5+670k 0+0io 0pf+0w On a Pentuim Pro (200MHz, I reduced the outer loop from 10000 to 1000): > ./b0 ; ./b1 ; ./b0 ; ./b2 ; ./b0 ; ./b3 3.624u 0.007s 0:03.65 99.1% 5+677k 0+0io 0pf+0w 3.673u 0.007s 0:03.68 99.7% 5+673k 0+0io 0pf+0w 3.623u 0.015s 0:03.65 99.4% 5+674k 0+0io 0pf+0w 3.663u 0.007s 0:03.68 99.4% 5+671k 0+0io 0pf+0w 3.639u 0.007s 0:03.65 99.4% 5+674k 0+0io 0pf+0w 3.684u 0.000s 0:03.69 99.7% 5+673k 0+0io 0pf+0w The most spectacular sufferer of unaligned accesses is the Pentium-4 which takes ~38% longer to do unaligned accesses... I suspect writes are going to be more prounced, especially on systems with ECC that have to do read/merge/write for every unaligned write. > > > Right now the new behaviour is controlled by a sysctl variable, > > > hw.sis_quick which defaults to 1 (on), you can set it to 0 to > ... > > > > Please do not remove this yet. > > no problem. It will actually be useful to tell people who have > a reasonable testbed to toggle this and see if it makes a difference. Cheers, -Peter -- Peter Wemm - peter@FreeBSD.org; peter@yahoo-inc.com; peter@netplex.com.au "All of this is for nothing if we don't go to the stars" - JMS/B5 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe cvs-all" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20011129064805.C37793808>