Date: Wed, 8 Jul 2009 15:51:51 -0600 From: John Hein <jhein@timing.com> To: net@freebsd.org Subject: Re: network lock manager (lockd) deadlocked in 'rpcrecv' Message-ID: <19029.5367.534192.928426@gromit.timing.com> In-Reply-To: <19029.4145.296260.915327@gromit.timing.com> References: <19029.4145.296260.915327@gromit.timing.com>
next in thread | previous in thread | raw e-mail | index | archive | help
John Hein wrote at 15:31 -0600 on Jul 8, 2009: > I have a home directory on FreeBSD 7.2-stable (20090705), amd64. > It is serving up the directory over nfs (v3, tcp), and now > I'm seeing lots of 'lockd not responding' on Fedora 10 & 11 systems. > > USER PID PPID SID NI %CPU %MEM VSZ RSS TT WCHAN STAT STARTED TIME COMMAND > root 791 1 791 0 0.0 0.0 6748 1500 ?? rpcrec Ds 2:45PM 0:05.80 /usr/sbin/rpc.lockd > > Once lockd gets in this state, doing a test lock on a file > from a FreeBSD box locks with 'lockd not responding', too > (and ctrl-c and kill -9 does nothing). > > USER PID PPID SID NI %CPU %MEM VSZ RSS TT WCHAN STAT STARTED TIME COMMAND > jhein 6297 3491 3491 0 0.0 0.0 1412 604 p5 nlmrcv T+ 3:18PM 0:00.00 /h/jhein/nfslocktest /nfs/locktest > > > I see this on an i386 6.4-stable, too. Also in dmesg: NLM: failed to contact remote rpcbind, stat = 5, port = 28416 And from ddb... Tracing command rpc.lockd pid 791 tid 100176 td 0xffffff00069dd720 sched_switch() at 0xffffffff8037df95 = sched_switch+0x1d5 mi_switch() at 0xffffffff803656fb = mi_switch+0x18b sleepq_timedwait() at 0xffffffff80390aeb = sleepq_timedwait+0x3b _sleep() at 0xffffffff80365cd4 = _sleep+0x324 clnt_dg_call() at 0xffffffff80504a0b = clnt_dg_call+0x4fb nlm_get_rpc() at 0xffffffff804f3ef7 = nlm_get_rpc+0x147 nlm_host_get_rpc() at 0xffffffff804f430e = nlm_host_get_rpc+0x10e nlm_do_lock() at 0xffffffff804f58be = nlm_do_lock+0x1ce nlm4_lock_4_svc() at 0xffffffff804f6c91 = nlm4_lock_4_svc+0x11 nlm_prog_4() at 0xffffffff804f8098 = nlm_prog_4+0x308 svc_run() at 0xffffffff8050c1f3 = svc_run+0x293 nlm_syscall() at 0xffffffff804f675c = nlm_syscall+0x79c syscall() at 0xffffffff805818f4 = syscall+0x1b4 Xfast_syscall() at 0xffffffff8056d35b = Xfast_syscall+0xab --- syscall (154, FreeBSD ELF64, nlm_syscall), rip = 0x8008a91ec, rsp = 0x7fffffffed08, rbp = 0x7fffffffe e20 ---
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19029.5367.534192.928426>