Date: Fri, 18 Feb 2000 00:39:13 -0800 (PST) From: Matthew Dillon <dillon@apollo.backplane.com> To: "David E. Cross" <crossd@cs.rpi.edu> Cc: freebsd-hackers@FreeBSD.ORG Subject: Re: stuck NFS procs (LONG) Message-ID: <200002180839.AAA75699@apollo.backplane.com> References: <200002180730.CAA14119@cs.rpi.edu>
next in thread | previous in thread | raw e-mail | index | archive | help
Ah!... ok, it is an NFS bug. I've been trying to track this down for a while ever since you reported the 3.4 lockup bug. This is probably related to a similar problem. There is a bug somewhere related to NFS locking up while doing a pagein from the executable image. It can occur when the binary is ripped out from under the client but it also can apparently occur if the program takes a signal during a pagein on a valid binary that hasn't been ripped out. If you still have this machine up, can you idle it and do a tcpdump looking for NFS packets for a few minutes? I'd like to know if it is doing an infinite retry of the page it got stuck on. Knowing what it is trying to do and why it isn't aborting on error with a segfault is the key. After that, is there any chance you can panic this machine and get a kernel dump? -Matt Matthew Dillon <dillon@backplane.com> :(kgdb) back :#0 mi_switch () at ../../kern/kern_synch.c:825 :#1 0xc0131781 in tsleep (ident=0xc054b220, priority=4, : wmesg=0xc01ff83c "pgtblk", timo=0) at ../../kern/kern_synch.c:443 :#2 0xc014ea7c in allocbuf (bp=0xc532ec08, size=8192) : at ../../kern/vfs_bio.c:1805 :#3 0xc014e5b0 in getblk (vp=0xca69b240, blkno=185, size=8192, slpflag=0, : slptimeo=0) at ../../kern/vfs_bio.c:1566 :#4 0xc0180cde in nfs_getcacheblk (vp=0xca69b240, bn=185, size=8192, : p=0xcac45520) at ../../nfs/nfs_bio.c:914 :#5 0xc017f9db in nfs_bioread (vp=0xca69b240, uio=0xcacbdf10, ioflag=8323072, : cred=0xc16d4180, getpages=0) at ../../nfs/nfs_bio.c:409 :#6 0xc01a5204 in nfs_read (ap=0xcacbdec8) at ../../nfs/nfs_vnops.c:963 :#7 0xc0159f4f in vn_read (fp=0xc16268c0, uio=0xcacbdf10, cred=0xc16d4180, : flags=0) at vnode_if.h:303 :#8 0xc013a06d in dofileread (p=0xcac45520, fp=0xc16268c0, fd=3, : buf=0x805a000, nbyte=65536, offset=-1, flags=0) : at ../../kern/sys_generic.c:179 :#9 0xc0139f77 in read (p=0xcac45520, uap=0xcacbdf94) : at ../../kern/sys_generic.c:111 :#10 0xc01ea777 in syscall (frame={tf_es = 39, tf_ds = 39, tf_edi = 1, : tf_esi = 65536, tf_ebp = -1077945672, tf_isp = -892608540, tf_ebx = 0, : tf_edx = 16, tf_ecx = 13, tf_eax = 3, tf_trapno = 0, tf_err = 2, : tf_eip = 134515596, tf_cs = 31, tf_eflags = 582, tf_esp = -1077945796, : tf_ss = 39}) at ../../i386/i386/trap.c:1100 : : :I'll stop here. : :-- :David Cross | email: crossd@cs.rpi.edu To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200002180839.AAA75699>