Date: Wed, 24 Dec 1997 03:18:06 -0600 (CST) From: Joel Ray Holveck <joelh@gnu.org> To: freebsd-current@freebsd.org Subject: Random crash update Message-ID: <199712240918.DAA17870@detlev.UUCP>
next in thread | raw e-mail | index | archive | help
Well, as I said earlier, I was having random core dumps from Emacs, bash, sendmail, and others when I rebuilt the OS. I rebuilt it again (from the same sources, ctm 3176, w/o secure) and thought my woes were over. It turns out that they were apparently just less frequent. Instead of seeing one program crash every five minutes, it's more like every 20 or so. During the build process, everything worked fine. I had shut down all non-essential processes (X, inetd, sendmail, etc) for the duration. No core dumps. I rebuilt the new sources one at a time, by first rebuilding include, then lib, then executing in csh: foreach foo in ( bin gnu include lib libexec lkm sbin usr.bin usr.sbin ) cd $foo make all install clean || break cd .. end (Yes, I realize this makes include and lib build again. No harm, eh?) A couple of times sources wouldn't complete (slattach and startslip had problems with not including certain #includes, but I had noticed a patch in the pipe so just didn't rebuild them... I'll submit a pr if that's not fixed by the time this fiasco ends; also, something else (I've got it written down somewhere around here) wanted to install a file to the previously nonexistant /usr/share/locale/sl_SI.ISO_8859-2 directory), at which point I used the same csh command with the SUBDIR variable, and omitted the sections I had problems with. (This is what happens when you have a low-paying job and can't afford enough drive space to make world.) Rebooted. No worries, it appeared. Ran Emacs's M-1 2 M-x hanoi command (towers of hanoi, n=12). It naturally gc'd several times, but never gave me a problem. (Before the second rebuild, it would crash before completing a n=6 run.) Shut down the system completely. Next boot, start to get my mail. In about two hours, I've had the following errors: -bash in free(): warning: pointer to wrong page. -bash in free(): warning: modified (chunk-) pointer. [bash exits.] -bash in free(): warning: junk pointer, too high to make sense. -bash in free(): warning: modified (chunk-) pointer. [bash hangs at very high CPU usage.] Dec 24 02:30:37 detlev /kernel: pid 1275 (sendmail), uid 0: exited on signal 11 [sendmail exits.] Dec 24 03:00:38 detlev /kernel: pid 2211 (sendmail), uid 0: exited on signal 11 [sendmail exits.] (Does anybody know how to convince sendmail to leave core dumps?) Recall that bash and sendmail each were giving me problems after my first rebuild, among others. Additionally, bash is only dumping when a child process dies. The first sendmail shouldn't have been forking (IIRC, the mail queue was empty), but I could be wrong. The second sendmail had an empty queue and no network connection, so it shouldn't have forked at all. Nevertheless, I'll mention that kern.fast_vfork=1; I haven't tried changing it yet. Note also that although sendmail was recompiled, its sources have not changed since I first installed FreeBSD on this box (28 Nov) save hub.mc, which of course wasn't used. Note yet also that bash has not been recompiled. The following timeline be of interest: - The first time I connected, ijjppp dropped my connection during the middle of the session. I had timeout set to 300 seconds. The fetchmail session was very much active. - Upon reconnecting and setting timeout to 0, I ran fetchmail and ssh. Both uneventful. - After retrieving some 200+ messages, and running them all through sendmail, I verified that mailq was empty, set the timeout back to 300, and started a ping to test ppp's timeout. (ppp stayed up this time.) - During this ping, I noticed inetd consuming some 93% of the system resources according to top. (Am486DX/4-100, 16MB RAM.) I killed inetd; I didn't need it at the moment. (Restarted inetd later and ping'd 127.0.0.1; no recongizable effect on inetd; besides, inetd shouldn't even see ICMPs if I understand right.) - Sometime around here (after checking mailq, before sending message) was when sendmail died the first time. - Killed ping, dropped ppp connection. - Started writing this message. During the composition, sendmail died again. If anybody has any suggestions, I'd love to hear them. Thanks, joelh -- Joel Ray Holveck - joelh@gnu.org - http://www.wp.com/piquan Fourth law of programming: Anything that can go wrong wi sendmail: segmentation violation - core dumped
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199712240918.DAA17870>