Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 24 Dec 1997 03:18:06 -0600 (CST)
From:      Joel Ray Holveck <joelh@gnu.org>
To:        freebsd-current@freebsd.org
Subject:   Random crash update
Message-ID:  <199712240918.DAA17870@detlev.UUCP>

next in thread | raw e-mail | index | archive | help
Well, as I said earlier, I was having random core dumps from Emacs,
bash, sendmail, and others when I rebuilt the OS.  I rebuilt it again
(from the same sources, ctm 3176, w/o secure) and thought my woes were
over.  It turns out that they were apparently just less frequent.
Instead of seeing one program crash every five minutes, it's more like
every 20 or so.

During the build process, everything worked fine.  I had shut down all
non-essential processes (X, inetd, sendmail, etc) for the duration.
No core dumps.  I rebuilt the new sources one at a time, by first
rebuilding include, then lib, then executing in csh:
  foreach foo in ( bin gnu include lib libexec lkm sbin usr.bin usr.sbin )
    cd $foo
    make all install clean || break
    cd ..
  end
(Yes, I realize this makes include and lib build again.  No harm, eh?)

A couple of times sources wouldn't complete (slattach and startslip
had problems with not including certain #includes, but I had noticed a
patch in the pipe so just didn't rebuild them... I'll submit a pr if
that's not fixed by the time this fiasco ends; also, something else
(I've got it written down somewhere around here) wanted to install a
file to the previously nonexistant /usr/share/locale/sl_SI.ISO_8859-2
directory), at which point I used the same csh command with the SUBDIR
variable, and omitted the sections I had problems with.  (This is what
happens when you have a low-paying job and can't afford enough drive
space to make world.)

Rebooted.  No worries, it appeared.  Ran Emacs's M-1 2 M-x hanoi
command (towers of hanoi, n=12).  It naturally gc'd several times, but
never gave me a problem.  (Before the second rebuild, it would crash
before completing a n=6 run.)  Shut down the system completely.
Next boot, start to get my mail.  In about two hours, I've had the
following errors:

-bash in free(): warning: pointer to wrong page.
-bash in free(): warning: modified (chunk-) pointer.
[bash exits.]
-bash in free(): warning: junk pointer, too high to make sense.
-bash in free(): warning: modified (chunk-) pointer.
[bash hangs at very high CPU usage.]
Dec 24 02:30:37 detlev /kernel: pid 1275 (sendmail), uid 0: exited on
  signal 11
[sendmail exits.]
Dec 24 03:00:38 detlev /kernel: pid 2211 (sendmail), uid 0: exited on
  signal 11
[sendmail exits.]

(Does anybody know how to convince sendmail to leave core dumps?)

Recall that bash and sendmail each were giving me problems after my
first rebuild, among others.  Additionally, bash is only dumping when
a child process dies.  The first sendmail shouldn't have been forking
(IIRC, the mail queue was empty), but I could be wrong.  The second
sendmail had an empty queue and no network connection, so it shouldn't
have forked at all.  Nevertheless, I'll mention that
kern.fast_vfork=1; I haven't tried changing it yet.  Note also that
although sendmail was recompiled, its sources have not changed since I
first installed FreeBSD on this box (28 Nov) save hub.mc, which of
course wasn't used.  Note yet also that bash has not been recompiled.

The following timeline be of interest:

- The first time I connected, ijjppp dropped my connection during the
  middle of the session.  I had timeout set to 300 seconds.  The
  fetchmail session was very much active.
- Upon reconnecting and setting timeout to 0, I ran fetchmail and
  ssh.  Both uneventful.
- After retrieving some 200+ messages, and running them all through
  sendmail, I verified that mailq was empty, set the timeout back to
  300, and started a ping to test ppp's timeout.  (ppp stayed up this
  time.)
- During this ping, I noticed inetd consuming some 93% of the system
  resources according to top.  (Am486DX/4-100, 16MB RAM.)  I killed
  inetd; I didn't need it at the moment.  (Restarted inetd later and
  ping'd 127.0.0.1; no recongizable effect on inetd; besides, inetd
  shouldn't even see ICMPs if I understand right.)
- Sometime around here (after checking mailq, before sending message)
  was when sendmail died the first time.
- Killed ping, dropped ppp connection.
- Started writing this message.  During the composition, sendmail died
  again.

If anybody has any suggestions, I'd love to hear them.

Thanks,
joelh

-- 
Joel Ray Holveck - joelh@gnu.org - http://www.wp.com/piquan
   Fourth law of programming:
   Anything that can go wrong wi
sendmail: segmentation violation - core dumped



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199712240918.DAA17870>