From owner-freebsd-stable@FreeBSD.ORG Mon Oct 13 11:07:22 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 3FF9116A4B3 for ; Mon, 13 Oct 2003 11:07:22 -0700 (PDT) Received: from sage-one.net (adsl-65-71-135-137.dsl.crchtx.swbell.net [65.71.135.137]) by mx1.FreeBSD.org (Postfix) with ESMTP id 112C643F93 for ; Mon, 13 Oct 2003 11:07:21 -0700 (PDT) (envelope-from jackstone@sage-one.net) Received: from sagea (sagea.sage-american [10.0.0.3]) by sage-one.net (8.12.8p2/8.12.8) with SMTP id h9DI7GVR066596; Mon, 13 Oct 2003 13:07:16 -0500 (CDT) (envelope-from jackstone@sage-one.net) Message-Id: <3.0.5.32.20031013130713.014336f8@10.0.0.10> X-Sender: jackstone@10.0.0.10 X-Mailer: QUALCOMM Windows Eudora Pro Version 3.0.5 (32) Date: Mon, 13 Oct 2003 13:07:13 -0500 To: Matthew Seaman , Hani Mouneimne From: "Jack L. Stone" In-Reply-To: <20031013164826.GB20434@happy-idiot-talk.infracaninophile.c o.uk> References: <40e792e94c57e8fc779e568f066edcfb@194.83.224.1> <40e792e94c57e8fc779e568f066edcfb@194.83.224.1> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" X-Spam-Status: No, hits=-4.8 required=4.5 tests=AWL,BAYES_10,EMAIL_ATTRIBUTION,IN_REP_TO,MY_OBFUY, QUOTED_EMAIL_TEXT,REFERENCES,REPLY_WITH_QUOTES autolearn=ham version=2.55-sageame.rules_v1 X-Spam-Checker-Version: SpamAssassin 2.55-sageame.rules_v1 (1.174.2.19-2003-05-19-exp) cc: "." Subject: Re: Crashing box X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 13 Oct 2003 18:07:22 -0000 At 05:48 PM 10.13.2003 +0100, Matthew Seaman wrote: >On Mon, Oct 13, 2003 at 04:19:58PM +0200, Hani Mouneimne wrote: >> Hey all, >> >> I was wondering if you could help with this issue. >> >> Eeverytime I run a make/compile on my freebsd 4.8 p10 systrem it has a >> complete spaz and reboots. Usually cores and someimes gives no messages at >> all in the logfiles. >> Here is the latest output of a makeworld I am doing >> ="sh /usr/src/tools/install.sh" >> PATH=/usr/obj/usr/src/i386/usr/sbin:/usr/obj/usr/src/i386/usr/bin:/usr/obj/u sr/src/i386/usr/games:/sbin:/bin:/usr/sbin:/usr/bin >> make -f Makefile.inc1 par-depend >> *** Signal 11 >> *** Signal 11 >> Killed > >I assume you've read the FAQ entry on Sig11: > > http://www.freebsd.org/doc/en_US.ISO8859-1/books/faq/troubleshoot.html#SIGNA L11 > >Signal 11, especially if it occurs in an unpredictable place during >compiles or other heavy weight operations, is a clear sign of hardware >problems, but I think you know that from what you say next. > >> This is just one of many crashes of similar scale, Sefaulting is also >> common. >> I have changed the entire server hardware including the hard drive and it is >> still doing this. It was fine with FreeBSD p0 so I am wondering it it could >> be some code issue. > >Tricky. Are you sure you've swapped out *all* of the hardware? SEGVs >are typically due to memory or CPUs going bad, but there are several >other considerations. > In addition to the good advice by Matthew, I had one particular server that was driving me nuts in a similar manner -- and, I had changed out ALL of the hardware and was getting ready to change out the Mobo as last resort. Then, I noticed something in the BIOS. In one section, there were some settings for "monitoring all of the IRQs" and "ON" was set for several IRQs, maybe 4 or 5 and that included things like "wake on ring", "wake on LAN" and others. VOILA! I disabled all of the "monitoring" and that *seems* to have finally eliminated this strange affliction. I gather that if any calls by the OS on those monitored IRQs happened to cause a conflict by instructing the machine to react in two different manners at the same time, a reboot and/or core dump would occur -- at least cause problems. Anyway, check your BIOS carefully. Even though I checked & rechecked the BIOS many times, I had never noticed this "monitored IRQs"..... BTW, I had reassigned the above box as a "build box" and it went really nuts on that effort and had all of the errors one could imagine -- and not a single successful build. This is when I discovered the BIOS glitch. Best regards, Jack L. Stone, Administrator SageOne Net http://www.sage-one.net jackstone@sage-one.net