Date: Fri, 01 Feb 2008 06:20:55 -0600 From: Eric Anderson <anderson@freebsd.org> To: Steven Hartland <killing@multiplay.co.uk> Cc: Dieter <freebsd@sopwith.solgatos.com>, freebsd-performance@freebsd.org Subject: Re: newfs locks entire machine for 20seconds Message-ID: <47A30EA7.7050506@freebsd.org> In-Reply-To: <002201c86499$7861ac20$b6db87d4@multiplay.co.uk> References: <200801310147.BAA04522@sopwith.solgatos.com> <47A2A606.9080702@freebsd.org> <002201c86499$7861ac20$b6db87d4@multiplay.co.uk>
next in thread | previous in thread | raw e-mail | index | archive | help
Steven Hartland wrote:
>
> ----- Original Message ----- From: "Eric Anderson" <anderson@freebsd.org>
>
>> I saw this once before, a long time back, and every time I went
>> through a debugging session, it came to some kind of lock on the
>> sysctl tree with regards to the geom info (maybe the XML kind of tree
>> dump or something). I don't recall all the details, but it was
>> something like that.
>
> Yep thats where I've traced it to its requesting: kern.geom.confxml
>
> Which does:-
> static int
> sysctl_kern_geom_confxml(SYSCTL_HANDLER_ARGS)
> {
> int error;
> struct sbuf *sb;
>
> sb = sbuf_new(NULL, NULL, 0, SBUF_AUTOEXTEND);
> g_waitfor_event(g_confxml, sb, M_WAITOK, NULL);
> error = SYSCTL_OUT(req, sbuf_data(sb), sbuf_len(sb) + 1);
> sbuf_delete(sb);
> return error;
> }
>
> What I dont understand is why this would lock the entire machine.
>
> I've enabled LOCK_PROFILING and reran and I get the following which
> seems to indicate the culpret is: SYSCTL_LOCK()
>
> From what I can tell g_waitfor_event is returning EAGAIN for a large
> amount of time which means we get stuck in:-
> userland_sysctl
> ...
> SYSCTL_LOCK();
>
> do {
> req.oldidx = 0;
> req.newidx = 0;
> error = sysctl_root(0, name, namelen, &req);
> } while (error == EAGAIN);
>
> if (req.lock == REQ_WIRED && req.validlen > 0)
> vsunlock(req.oldptr, req.validlen);
>
> SYSCTL_UNLOCK();
> ...
>
> The only reason I can see for returning EAGAIN is g_destroy_geom
> calling g_cancel_event
Wait - if it returns EAGAIN for a while, then look at that code above.
It will hold the sysctl lock for some indefinite amount of time. Maybe
it should look like this instead:
do {
SYSCTL_LOCK();
req.oldidx = 0;
req.newidx = 0;
error = sysctl_root(0, name, namelen, &req);
SYSCTL_UNLOCK();
} while (error == EAGAIN);
if (req.lock == REQ_WIRED && req.validlen > 0)
vsunlock(req.oldptr, req.validlen);
Can you try that?
Eric
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?47A30EA7.7050506>
