Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 17 Apr 2008 11:43:49 +0100 (BST)
From:      Robert Watson <rwatson@FreeBSD.org>
To:        current@FreeBSD.org
Cc:        net@FreeBSD.org
Subject:   Moving pcbinfo and inpcb locks from mutexes to rwlocks
Message-ID:  <20080417112757.M1046@fledge.watson.org>

next in thread | raw e-mail | index | archive | help

Dear all,

In the next couple of days (exact schedule depends on how testing goes), I'll 
merge a portion of the rwlock lock patch that I've developed and that Kris has 
been testing.  This opens the door to increased parallelism in the network 
stack by facilitating UDP and TCP moves to read-locking of certain data 
structures under certain conditions.  These patches were originally developed 
to address known high lock contention when running the BIND9 and nsd name 
servers, which employ a single UDP socket from many threads simultaneously.

In the first pass, the changes simply substitute an rwlock for a mutex, and 
convert the accessor macros to explicit write-locking for inpcbs; for pcbinfo, 
we allow read locking to be used in certain restricted situations, and 
generalize certain lock assertions to allow read locks to be held.  However, 
in practice, this pass should make little functional difference, as all key 
paths will remain protected by exclusive locking.

This will then settle for a bit to attempt to show up any problems that didn't 
turn up in testing so far, and then...

In the second pass, write locking is replaced with read locking of the pcbinfo 
lock in input paths for UDP, and write locking of the inpcb is replaced with 
read locking in many cases in the output paths, elimating one source of high 
lock contention for BIND/nsd.  TCP continues to use exclusive locking in all 
cases with this set of changes.

It is my understanding, and I need to confirm this, that struct rwlock is the 
same size as struct mutex, meaning that (a) they don't require monitoring 
tools to be rebuilt, and (b) these are potential MFC candidates.  If you run 
into ABI problems with monitoring tools after the merge, please let me know 
(including architecture information, etc).

Thanks,

Robert N M Watson
Computer Laboratory
University of Cambridge



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080417112757.M1046>