From owner-freebsd-hackers Wed Apr 11 21:23:26 2001 Delivered-To: freebsd-hackers@freebsd.org Received: from mail.fureai.or.jp (mail.fureai.or.jp [203.179.2.251]) by hub.freebsd.org (Postfix) with ESMTP id 7382237B50B; Wed, 11 Apr 2001 21:23:07 -0700 (PDT) (envelope-from natori@mad.scientist.com) Received: from localhost (ppp108.td.fureai.or.jp [210.143.57.108]) by mail.fureai.or.jp (8.9.3/3.7W) with ESMTP id NAA15537; Thu, 12 Apr 2001 13:23:05 +0900 (JST) Message-Id: <200104120423.NAA15537@mail.fureai.or.jp> To: FreeBSD-questions@freebsd.org, FreeBSD-hackers@freebsd.org Subject: nfsd hangs in ``inode'' state X-Mailer: Mew version 1.92.1 on Emacs 19.28 / Mule 2.3 (SUETSUMUHANA) Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Date: Thu, 12 Apr 2001 13:23:01 +0900 From: "S. Natori" Lines: 66 Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Hello All, I am running a FreeBSD-4.2 NFS server with dozens of FreeBSD-4.2 NFS clients on 100BaseTX LAN. Recently I found that when the NFS server receives a lot of requests in a short period (e.g., 2 clients start X with gnome desktop simultaneously), all nfsd server processes hang in inode state. UID PID PPID CPU PRI NI VSZ RSS WCHAN STAT TT TIME COMMAND 0 440 1 0 2 0 360 132 accept Is ?? 0:00.00 nfsd: master (nfsd) 0 441 440 0 -14 0 352 124 inode D ?? 0:03.49 nfsd: server (nfsd) 0 442 440 0 -14 0 352 124 inode D ?? 0:00.17 nfsd: server (nfsd) 0 443 440 0 -14 0 352 124 inode D ?? 0:00.02 nfsd: server (nfsd) 0 444 440 0 -14 0 352 124 inode D ?? 0:00.01 nfsd: server (nfsd) I cannot kill or restart them. The consoles of the clients print ``NFS server not responding'' and I should restart the server. This occurs about once a week. I tried (1) increasing the number of nfsd processes (4 -> 8, 20) (2) replacing the server HDD (SCSI) with another ATA33 HDD (3) changing mount_nfs options (tried removing tcp, adding soft,dumbtimer) but all failed to solve the problem. Perhaps this problem is the one reported previously in http://www.FreeBSD.org/cgi/getmsg.cgi?fetch=1913802+1916627+/usr/local/www/db/text/2000/freebsd-questions/20001008.freebsd-questions The reporter avoided this problem by switching to NetBSD, but I prefer FreeBSD. Do you have any idea to solve or circumvent this problem? Any suggestions would be appreciated. Many thanks in advance, S. Natori +------------------------------------------------------------------------- |More detailed information about the NFS server/client. +------------------------------------------------------------------------- Server spec: CPU: Pentium III 800MHz MEM: 256MB SCSI: Adaptec 29160 Ultra160 SCSI adapter HDD: da0: Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged Queueing Enabled da0: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C) NIC: Intel Pro 100+ Client fstab: server:/home /home nfs rw,tcp,intr 0 0 server:/usr/local /usr/local nfs rw,tcp,intr 0 0 server:/usr/X11R6 /usr/X11R6 nfs rw,tcp,intr 0 0 server:/usr/compat /usr/compat nfs rw,tcp,intr 0 0 server:/usr/share /usr/share nfs rw,tcp,intr 0 0 +------------------------------------------------------------------------- |Much more detailed information about the hanged nfsds +------------------------------------------------------------------------- USER PID %CPU %MEM ACFLG CPU F INBLK JOBC KTRACE KTRACEP LIM LOGIN STARTED MAJFLT MINFLT MSGRCV MSGSND NI NIVCSW NSIGS NSWAP NVCSW WCHAN OUBLK P_RU PADDR PAGEIN PGID PPID PRI RE RGID RLINK RSS RSZ RTPRIO RUID RUSER SESS PENDING CAUGHT IGNORED BLOCKED SL STARTED STAT SVGID SVUID TDEV TIME TPGID TSESS TSIZ TT TTY UCOMM UID UPR VSZ WCHAN XSTAT COMMAND root 440 0.0 0.1 3 0 84 0 0 0 0 - root Sat 4/ 7 15:16:03 2001 0 20 4 4 0 58 0 0 58 cdae0f6 0 0 e855440 0 440 1 2 5662 0 3ab3e0 132 132 normal 0 root 1808880 0 80800 1840c007 0 525 3:16PM Is 0 0 ?? 0:00.00 0 0 156 ?? ?? nfsd 0 50 360 accept 0 nfsd: master (nfsd) root 441 0.0 0.1 3 0 4 4322 0 0 0 - root Sat 4/ 7 15:16:03 2001 0 23 0 100282 0 0 0 0 104888 17fb700 1569 0 e855ac0 0 440 440 -14 5662 0 3ab298 124 124 normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0 0 ?? 0:03.49 0 0 156 ?? ?? nfsd 0 50 352 inode 0 nfsd: server (nfsd) root 442 0.0 0.1 3 0 4 494 0 0 0 - root Sat 4/ 7 15:16:03 2001 0 7 0 4108 0 0 0 0 4634 17fb700 18 0 e855920 0 440 440 -14 5662 0 e855ac0 124 124 normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0 0 ?? 0:00.17 0 0 156 ?? ?? nfsd 0 50 352 inode 0 nfsd: server (nfsd) root 443 0.0 0.1 3 0 4 58 0 0 0 - root Sat 4/ 7 15:16:03 2001 0 12 0 581 0 0 0 0 696 17fb700 22 0 e855780 0 440 440 -14 5662 0 e855920 124 124 normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0 0 ?? 0:00.02 0 0 156 ?? ?? nfsd 0 50 352 inode 0 nfsd: server (nfsd) root 444 0.0 0.1 3 0 4 17 0 0 0 - root Sat 4/ 7 15:16:03 2001 0 7 0 198 0 0 0 0 271 188fe00 0 0 d4470c0 0 440 440 -14 5662 0 3ab4d0 124 124 normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0 0 ?? 0:00.01 0 0 156 ?? ?? nfsd 0 50 352 inode 0 nfsd: server (nfsd) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message