Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 31 Dec 2005 07:12:23 +0000 (GMT)
From:      Robert Watson <rwatson@FreeBSD.org>
To:        Avleen Vig <lists-freebsd@silverwraith.com>
Cc:        freebsd-hackers@freebsd.org
Subject:   Re: My wish list for 6.1
Message-ID:  <20051231070750.T5785@fledge.watson.org>
In-Reply-To: <20051217063409.GB19094@silverwraith.com>
References:  <43A26FFB.9080405@samsco.org> <20051216104022.A20877@cons.org> <20051217063409.GB19094@silverwraith.com>

next in thread | previous in thread | raw e-mail | index | archive | help

On Fri, 16 Dec 2005, Avleen Vig wrote:

> On Fri, Dec 16, 2005 at 10:40:22AM -0500, Martin Cracauer wrote:
>>> 2.  SMP kernels for install.  Right now we only install a UP kernel, for
>>> performance reasons.  We should be able to package both a UP and SMP
>>> kernel into the release bits, and have sysinstall install both.  It
>>> should also select the correct one for the target system and make that
>>> the default on boot.
>>
>> If people are concerned about performance, I benchmarked a 6-beta kernel 
>> SMP versus UP on a socket 939 Opteron.
>
> If those results are accurate, there's no real reason not to just use an SMP 
> kernel on default install?

This is an old thread that I'm just catching up on, but I figured I'd chime in 
anyway: you have to be really careful benchmarking across CPU types and 
configurations, as the performance characteristics of important insturctions 
differ a lot across hardware variations.  For example, the performance of 
atomic operations, used to synchronize between CPUs, varies significantly by 
CP, bus configuration, etc.  On modern opteron hardware, the performance of 
inter-CPU synchronization instructions is blindingly fast.  On modern Xeon P4 
hardware, it is incredibly slow.  Software optimized for the Opteron will 
often perform much slower on Xeon P4 hardware as a result.  P3 hardware tends 
to behave a lot more like Opteron in terms of speed of insturctions relating 
to disabling interrupts, where on P4 Xeon they are proprtionally much slower. 
The critical section optimizations made by John Baldwin, and the movement to 
critical sections in UMA and kernel malloc that I made, made a big performance 
difference on Xeon P4 hardware, but relatively little difference on Opteron. 
All this seems to suggest that when comparing UP and SMP, it's useful to do it 
on nice opteron hardware, but also on P4 Xeon to prevent making decisions 
based on platform-specific properties.

Robert N M Watson



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20051231070750.T5785>