Date: Sun, 11 Jan 2004 11:20:39 -0500 (EST) From: Ted Wisniewski <ted@ness.plymouth.edu> To: Dan Nelson <dnelson@allantgroup.com> Cc: freebsd-questions@freebsd.org Subject: Re: 5.2-RELEASE - Show stopper problem Message-ID: <200401111620.i0BGKdTL004427@ness.plymouth.edu> In-Reply-To: <20040111040555.GD3393@dan.emsphone.com> "from Dan Nelson at Jan 10, 2004 10:05:55 pm"
next in thread | previous in thread | raw e-mail | index | archive | help
Dan, Thanks for your response... As you can see in this output from the ps command you suggested, the processes are dfinitely waiting on the disk. BTW.. The syste in question was a fresh install from yesterday with no users other than myself (I did the cvsup to get it to 5.2-RELEASE). It did hang when I did that with a similar result. One of the "install -s etc.." processes went into the same state. Output from PS: 692 wait ?? Is 0:00.00 /bin/sh -c periodic daily 1694 wait ?? I 0:00.00 /bin/sh - /usr/sbin/periodic daily 1701 wait ?? I 0:00.02 /bin/sh - /usr/sbin/periodic daily 1702 wait ?? I 0:00.00 /bin/sh - /usr/sbin/periodic daily 1703 piperd ?? I 0:00.00 mail -s testhost.plymouth.edu daily run output 1779 wait ?? I 0:00.00 /bin/sh /etc/periodic/daily/450.status-securit 1780 wait ?? I 0:00.00 /bin/sh - /usr/sbin/periodic security 1786 wait ?? I 0:00.00 /bin/sh - /usr/sbin/periodic security 1787 wait ?? I 0:00.00 /bin/sh - /usr/sbin/periodic security 1788 piperd ?? I 0:00.00 mail -s testhost.plymouth.edu security run out 1789 wait ?? I 0:00.00 /bin/sh - /etc/periodic/security/100.chksetuid 1795 wait ?? I 0:00.00 /bin/sh - /etc/periodic/security/100.chksetuid 1796 piperd ?? I 0:00.00 xargs -0 -n 20 ls -liTd 1797 piperd ?? I 0:00.00 sed s/^ *// 1798 piperd ?? I 0:00.00 sort -k 11 1799 wait ?? I 0:00.00 /bin/sh - /etc/periodic/security/100.chksetuid 1802 piperd ?? I 0:00.00 cat 1805 ufs ?? D 0:02.10 find /usr -xdev -type f ( -perm -u+x -or -perm 2779 sbwait ?? Is 0:00.02 sshd: sysop [priv] (sshd) 2782 select ?? S 0:00.01 sshd: sysop@ttyp0 (sshd) 737 getblk p0- D 0:02.14 find /usr -xdev -type f ( -perm -u+x -or -perm 2783 pause p0 Ss 0:00.01 -csh (csh) 2786 - p0 R+ 0:00.00 ps -axO wchan These have been running for about 9 hours now, on a base system with nothing extra installed. Originally, I thought it had something to do with the raid controllers on the various machines I tried, however, I was able to duplicate the condition on an ATA based system (I had to work a lot harder at getting the condition to occur on it). It is almost like the I/O is too fast and something happens.. On my test system the machine will run for days with this happening, however, I have another system that is actually doing a lot of I/O.... eventually it crashes (well locks up completely)... If there is any particular info you might need, I am willing to do what I can. Ted (* In the last episode (Jan 10), Ted Wisniewski said: (* > In 5.2-RC and 5.2-RELEASE there appears to be some issue with (* > filesystem or I/O subsystem under 5.2-X. Now, You can install and do (* > the normal kind of things, however, when you create a lot of I/O on (* > the disk there seems to be a problem actually reading/writing it (* > to/from disk. For example, If I do a "make buildworld"... It (* > appears to go along ok. However, I have had a number of (repeatable) (* > situations where the "make installworld" will go so far then will not (* > be able to complete. In this case, there is an attempt to write data (* > to disk that cannot complete; the process goes into a disk wait state (* > (it cannot be killed, and will stay in this state ... forever). (* > (* > For example the standard daily security script: (* > (* > 727 p0 T 0:00.00 sh 100.chksetuid (* > 737 p0 T 0:02.14 find /usr -xdev -type f ( -perm -u+x -or -perm -g+x -or -perm -o+x ) ( -perm -u+s -or -perm -g+s (* (* The 'T' state usually means that someone sent the process a STOP (* signal. Try running "kill -CONT 727 737" to start them back up. (* (* Processes waiting on disk I/O will be in the 'D' state, and you can run (* "ps axO wchan" to print the specific part of the kernel it's waiting (* in. (* (* -- (* Dan Nelson (* dnelson@allantgroup.com (* -- | Ted Wisniewski E-Mail: ted@mail.plymouth.edu | | Manager, Systems Group WEB: http://oz.plymouth.edu/~ted/ | | Information Technology Services | | Plymouth State University Phone: (603) 535-2661 | | Plymouth NH, 03264 Fax: (603) 535-2263 |
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200401111620.i0BGKdTL004427>
