From owner-freebsd-current@FreeBSD.ORG Fri Feb 10 15:22:33 2006 Return-Path: X-Original-To: current@freebsd.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 2A38716A422 for ; Fri, 10 Feb 2006 15:22:33 +0000 (GMT) (envelope-from gallatin@cs.duke.edu) Received: from duke.cs.duke.edu (duke.cs.duke.edu [152.3.140.1]) by mx1.FreeBSD.org (Postfix) with ESMTP id CC67E43D5D for ; Fri, 10 Feb 2006 15:22:30 +0000 (GMT) (envelope-from gallatin@cs.duke.edu) Received: from grasshopper.cs.duke.edu (grasshopper.cs.duke.edu [152.3.145.30]) by duke.cs.duke.edu (8.13.4/8.13.4) with ESMTP id k1AFMTBh011991 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 10 Feb 2006 10:22:29 -0500 (EST) Received: (from gallatin@localhost) by grasshopper.cs.duke.edu (8.12.9p2/8.12.9/Submit) id k1AFMOaV034401; Fri, 10 Feb 2006 10:22:24 -0500 (EST) (envelope-from gallatin) From: Andrew Gallatin MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <17388.44976.250463.383429@grasshopper.cs.duke.edu> Date: Fri, 10 Feb 2006 10:22:24 -0500 (EST) To: Poul-Henning Kamp In-Reply-To: <1835.1139524338@critter.freebsd.dk> References: <1835.1139524338@critter.freebsd.dk> X-Mailer: VM 6.75 under 21.1 (patch 12) "Channel Islands" XEmacs Lucid Cc: current@freebsd.org Subject: Re: [TEST/REVIEW] cpu time accounting patch, step 2 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 10 Feb 2006 15:22:33 -0000 Poul-Henning Kamp writes: > Tests, reviews etc please. Here are some benchmark results from my "AMD Athlon(tm) 64 X2 Dual Core Processor 3800+" running an SMP kernel for both loopback and 10GbE TCP networks. The executive summary is that there's a 41% improvement in loopback pingpong, and a 12.5% improvement in a pingpong test with a remote linux machine over 10GbE. Since the linux machine isn't infinately fast, the improvement is probably a bit more than 12.5% ;) The machine seemed stable, and I haven't seen anything negative except for a tiny slowdown in sending using TCP_SENDFILE. Thank you so much for doing this!! Drew Test data: machdep.cpu_idle_hlt=0 net.isr.direct=1 dev.myri10ge.0.skip_pio_read=1 Everything else default except where noted for 10GbE TCP_RR test. netperf -tTCP_STREAM to localhost x loopback_stream.before + loopback_stream.after +--------------------------------------------------------------------------+ |x x x x x + + + + +| | |______A_M____| |____M_A_____| | +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 4100.11 4221.45 4187.95 4178.016 46.300782 + 5 4472.37 4567.2 4510 4519.416 38.960833 Difference at 95.0% confidence 341.4 +/- 62.4045 8.17134% +/- 1.49364% (Student's t, pooled s = 42.7885) netperf -tTCP_RR to localhost x loopback_rr.before + loopback_rr.after +--------------------------------------------------------------------------+ |x x + ++ ++| |MA| |__MA___|| +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 27807.53 28195.85 27873.98 27964.662 163.75044 + 5 38590.71 40167.44 39301.22 39463.23 645.1375 Difference at 95.0% confidence 11498.6 +/- 686.411 41.1182% +/- 2.45457% (Student's t, pooled s = 470.647) netperf -tTCP_STREAM to linux machine via 10GbE (128KB socket buffers) x 10GbE_stream_rx.before + 10GbE_stream_rx.after +--------------------------------------------------------------------------+ | + | |x x x x x + + + + | | |________A__M____| |_________________A_________M_______|| +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 7655.65 7723.43 7702.66 7693.382 27.071167 + 5 7768.29 7883.34 7863.13 7830.482 57.216535 Difference at 95.0% confidence 137.1 +/- 65.2772 1.78205% +/- 0.848484% (Student's t, pooled s = 44.7581) x 10GbE_stream_tx.before + 10GbE_stream_tx.after +--------------------------------------------------------------------------+ |+ x x + + x + + x x| | |_|____________M__________MA_____A___________________|___________| | +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 4640.64 4722.05 4670.05 4679.212 36.651045 + 5 4636.54 4709.22 4657.93 4671.276 31.004687 No difference proven at 95.0% confidence netperf -tTCP_SENDFILE to linux machine via 10GbE (128KB socket buffers) x 10GbE_sendfile_tx.before + 10GbE_sendfile_tx.after +--------------------------------------------------------------------------+ | + + + + + x xxx x| ||_________M_____A_______________| |_________A________| | +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 7453.74 7523.76 7492.15 7490.258 24.844414 + 5 7328.59 7430.97 7355.29 7369.944 42.886543 Difference at 95.0% confidence -120.314 +/- 51.1132 -1.60627% +/- 0.682395% (Student's t, pooled s = 35.0464) netperf -tTCP_RR to linux machine via 10GbE, after setting nic interrupt coalescing parameters to their minimum setting (dev.myri10ge.0.intr_coal_delay=1): x 10GbE_rr.before + 10GbE_rr.after +--------------------------------------------------------------------------+ |x xx + +++++| ||_AM| |_AM_|| +--------------------------------------------------------------------------+ N Min Max Median Avg Stddev x 5 29721.79 29929.62 29863.69 29827.82 89.80339 + 6 33323.05 33703.95 33614.14 33556.268 137.33838 Difference at 95.0% confidence 3728.45 +/- 162.431 12.4999% +/- 0.544562% (Student's t, pooled s = 118.588)