From owner-freebsd-current@FreeBSD.ORG  Wed Sep 17 21:24:34 2008
Return-Path: <owner-freebsd-current@FreeBSD.ORG>
Delivered-To: freebsd-current@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34])
	by hub.freebsd.org (Postfix) with ESMTP id 56DA61065672
	for <freebsd-current@freebsd.org>; Wed, 17 Sep 2008 21:24:34 +0000 (UTC)
	(envelope-from stephen@math.missouri.edu)
Received: from cauchy.math.missouri.edu (cauchy.math.missouri.edu
	[128.206.184.213])
	by mx1.freebsd.org (Postfix) with ESMTP id 2030E8FC13
	for <freebsd-current@freebsd.org>; Wed, 17 Sep 2008 21:24:34 +0000 (UTC)
	(envelope-from stephen@math.missouri.edu)
Received: from laptop3.gateway.2wire.net (cauchy.math.missouri.edu
	[128.206.184.213])
	by cauchy.math.missouri.edu (8.14.2/8.14.2) with ESMTP id
	m8HLNddS004404; Wed, 17 Sep 2008 16:23:40 -0500 (CDT)
	(envelope-from stephen@math.missouri.edu)
Message-ID: <48D17584.9020306@math.missouri.edu>
Date: Wed, 17 Sep 2008 16:24:20 -0500
From: Stephen Montgomery-Smith <stephen@math.missouri.edu>
User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US;
	rv:1.8.1.16) Gecko/20080909 SeaMonkey/1.1.11
MIME-Version: 1.0
To: Dan Nelson <dnelson@allantgroup.com>
References: <48CDBC78.4010409@math.missouri.edu>
	<20080915195021.GA69528@cons.org>
	<48CEFF74.8020602@math.missouri.edu>
	<20080916033459.GA31220@troutmask.apl.washington.edu>
	<48CF2AEF.9070208@math.missouri.edu>
	<48CF2CA4.1000802@math.missouri.edu>
	<20080916041142.GH3188@dan.emsphone.com>
In-Reply-To: <20080916041142.GH3188@dan.emsphone.com>
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit
Cc: Martin Cracauer <cracauer@cons.org>, freebsd-current@freebsd.org,
	Steve Kargl <sgk@troutmask.apl.washington.edu>
Subject: Re: Improved multiprocessor usage on amd64
X-BeenThere: freebsd-current@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Discussions about the use of FreeBSD-current
	<freebsd-current.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>, 
	<mailto:freebsd-current-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-current>
List-Post: <mailto:freebsd-current@freebsd.org>
List-Help: <mailto:freebsd-current-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-current>,
	<mailto:freebsd-current-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 17 Sep 2008 21:24:34 -0000

Dan Nelson wrote:
> In the last episode (Sep 15), Stephen Montgomery-Smith said:
>> Stephen Montgomery-Smith wrote:
>>> Steve Kargl wrote:
>>>> On Mon, Sep 15, 2008 at 07:36:04PM -0500, Stephen Montgomery-Smith wrote:
>>>>> ... and each thread is a loop of the form
>>>>>
>>>>> while (1) {
>>>>>   wait until told to start;
>>>>>   do massive amounts of floating point arithmetic (only additions and
>>>>> multiplications) on large arrays;
>>>>>   tell the master process that you are done;
>>>>> }
>>>>>
>>>>>> Do you have about as many threads as processor or more?
>>>>> Both ways.  The time difference between the two approaches is 
>>>>> negligible.
>>>>>
>>>> Are you using ULE?  With my MPI applications, if the number of
>>>> launched processes exceeds the number of cpus by 1, ULE falls
>>>> through the floor.  I have a nagging feeling that there is a problem 
>>>> with cpu affinity.
>>>>
>>>> http://lists.freebsd.org/pipermail/freebsd-current/2008-July/086917.html
>> Let me say a little bit more.
>>
>> I have this gut feeling that the problem has a lot to do with cache 
>> management.  My program has each thread doing, in effect, huge matrix 
>> multiplications, each one working on their own little bit.  If a CPU 
>> core changes from one thread to another, it then has to flush out the 
>> cache to RAM, and read in a whole bunch of other RAM into cache.
> 
> You can try playing with the new cpuset functions in HEAD and 7-STABLE
> to lock particular threads on certain CPUs.
> 

It was an excellent suggestion.  But it didn't make any difference.