Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 29 May 2007 14:18:32 -0700 (PDT)
From:      Jeff Roberson <jroberson@chesapeake.net>
To:        Bruce Evans <brde@optusnet.com.au>
Cc:        freebsd-arch@freebsd.org
Subject:   Re: rusage breakdown and cpu limits.
Message-ID:  <20070529141342.D661@10.0.0.1>
In-Reply-To: <20070530065423.H93410@delplex.bde.org>
References:  <20070529105856.L661@10.0.0.1> <200705291456.38515.jhb@freebsd.org> <20070529121653.P661@10.0.0.1> <20070530065423.H93410@delplex.bde.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, 30 May 2007, Bruce Evans wrote:

> On Tue, 29 May 2007, Jeff Roberson wrote:
>
>> The problem with using a pool or per-process spinlock is that it keeps the 
>> contention in the process domain, rather than thread domain.  For 
>> multithreaded processes this will give the same contention as a global 
>> scheduler lock, only slightly reduced in scope.  I'd like to solve this in 
>> such a way that we don't have to revisit it again.
>> 
>> I think I'm going to make the rusage struct per-thread and aggregate it on 
>> demand.  There will be a lot of code churn, but it will be simple. There 
>> are
>
> Ugh.
>
>> a few cases where which will be complicated, and cpulimit is one of them.
>
> No, cpulimit is simple because it can be fuzzy, unlike calcru() which require
> the rusage to be up to date.

cpulimit is complicated because it requires aggregate statistics from all 
threads like rusage.  It may be queried infrequently however.  It's just 
one of the few cases where we actually examine the values as if we still 
only have one thread per process.

>
> I see how rusage accumulation can help for everything _except_ the
> runtime and tick counts (i.e., for stuff updated by statclock()).  For
> the runtime and tick counts, the possible savings seem to be small and
> negative.  calcru() would have to run the accumulation code and the
> accumulation code would have to acquire something like sched_lock to
> transfer the per-thread data (since the lock for updating that data
> is something like sched_lock).  This is has the same locking overheads
> and larger non-locking overheads than accumulating the runtime directly
> into the rusage at context switch time -- calcru() needs to acquire
> something like sched_lock either way.

Yes, it will make calcru() more expensive.  However, this should be 
infrequent relative to context switches.  It's only used for calls to 
getrusage(), fill_kinfo_proc(), and certain clock_gettime() calls.

The thing that will protect mi_switch() is not process global.  I want to 
keep process global locks out of mi_switch() or we reduce concurrency for 
multi-threaded applications.

Jeff

>
> Bruce
>



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070529141342.D661>