From owner-freebsd-hackers@FreeBSD.ORG Thu Jun 24 12:43:11 2004 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 9062916A4CE; Thu, 24 Jun 2004 12:43:11 +0000 (GMT) Received: from cs1.cs.huji.ac.il (cs1.cs.huji.ac.il [132.65.16.10]) by mx1.FreeBSD.org (Postfix) with ESMTP id 22B0643D62; Thu, 24 Jun 2004 12:43:11 +0000 (GMT) (envelope-from danny@cs.huji.ac.il) Received: from pampa.cs.huji.ac.il ([132.65.80.32] ident=danny) by cs1.cs.huji.ac.il with esmtp id 1BdTXS-000AOP-Mq; Thu, 24 Jun 2004 15:41:10 +0300 X-Mailer: exmh version 2.6.3 04/04/2003 with nmh-1.0.4 To: freebsd-hackers@freebsd.org In-reply-to: Your message of Wed, 23 Jun 2004 19:32:49 +0300 . Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Thu, 24 Jun 2004 15:41:10 +0300 From: Danny Braniss Message-Id: <20040624124311.22B0643D62@mx1.FreeBSD.org> cc: Matt Freitag cc: Tim Robbins Subject: Re: waiting on sbwait X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 24 Jun 2004 12:43:11 -0000 > > > Danny Braniss wrote: > > > > >Hi, > > > We have a host running samba under -stable 4.10, and quiet frequently > > >it becomes uresponsive, hitting ^T gives > > > > > >load: 0.00 cmd: ls 12807 [sbwait] 0.00u 0.00s 0% 160k > > > > > >so my guess is that we are running out of some resource (socket buffer). > > > > > >so, if this is true, is there some sysctl to increase? > > > > > >danny > > > I'm not sure if it's what you're hitting, but Perhaps the sysctl > > "kern.ipc.maxsockets" needs to be raised, though it seems like you'd > > need a decent amount of concurrent active sessions to reach this > > ceiling. Also it's read-only, so you'll want to tune it in loader.conf. > > > > -mpf > > > > sometimes we get > load: 0.04 cmd: dmesg 13453 [nfsrcvlk] 0.00u 0.00s 0% 148k > > and looking through the code, there might be some connection between sbwait > and nfsrcvlk, but i doubt that it's sockets that im running out off, neither > mbufs, since: > > foundation> netstat -m > 326/1216/26624 mbufs in use (current/peak/max): > 326 mbufs allocated to data > 321/428/6656 mbuf clusters in use (current/peak/max) > 1160 Kbytes allocated to network (5% of mb_map in use) > 0 requests for memory denied > 0 requests for memory delayed > 0 calls to protocol drain routines > > also, the process enters sbwait either in sosend or soreceive, make me > believe that it's some resource, rather than data, that is missing. > > the fact that this 'unresponsivness' happens sometimes is making this > rather challenging, but try to tell this to the users :-) > > danny found the cause: NFS/amd a user had several symlinks to /net/host/xyz, and host was down. doing ls -F /net/host/xyz does the trick, the machine becomes unresponsive. now have to look for the reason :-( danny