From owner-freebsd-questions@FreeBSD.ORG Tue Jan 17 15:49:06 2006 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C60AB16A41F for ; Tue, 17 Jan 2006 15:49:06 +0000 (GMT) (envelope-from danial_thom@yahoo.com) Received: from web33301.mail.mud.yahoo.com (web33301.mail.mud.yahoo.com [68.142.206.116]) by mx1.FreeBSD.org (Postfix) with SMTP id 866D443D45 for ; Tue, 17 Jan 2006 15:49:05 +0000 (GMT) (envelope-from danial_thom@yahoo.com) Received: (qmail 48375 invoked by uid 60001); 17 Jan 2006 15:49:05 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Received:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=jfvj83ePWgSmf7XLvmW+lsHtFgffXuyJs6e1+HZpmtO90+9wnoA9XKowWUu1xYFDp3SqplLm34Q47M+sVPVNZwlI2fWJStWvbAmV6ICFBcOduAR6UfDGqjS0+MkmTkJGO7Lr8xd1+h4Gk86uyqcVzLSYoA8NOsARiY2DbReitJA= ; Message-ID: <20060117154905.48373.qmail@web33301.mail.mud.yahoo.com> Received: from [24.46.186.215] by web33301.mail.mud.yahoo.com via HTTP; Tue, 17 Jan 2006 07:49:05 PST Date: Tue, 17 Jan 2006 07:49:05 -0800 (PST) From: Danial Thom To: Martin McCormick , freebsd-questions@freebsd.org In-Reply-To: <200601170151.k0H1pYhn034216@dc.cis.okstate.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Cc: Subject: Re: Rescuing the Ethernet Interface after SCB Timeout X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: danial_thom@yahoo.com List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 17 Jan 2006 15:49:06 -0000 --- Martin McCormick wrote: > One of our FreeBSD systems has begun randomly > shutting down > its Ethernet interface. After doing so, the > box continues to try to > run but prints the following errors in syslog: > > Jan 16 03:01:23 xx /kernel: fxp0: SCB timeout: > 0x70 0x0 0x50 0x400 > Jan 16 03:01:24 xx /kernel: fxp0: SCB timeout: > 0xf0 0x0 0x50 0x400 > Jan 16 03:01:35 xx last message repeated 9 > times > Jan 16 03:01:40 xx /kernel: fxp0: device > timeout > Jan 16 03:01:40 xx /kernel: fxp0: DMA timeout > Jan 16 03:01:40 xx /kernel: fxp0: DMA timeout > > If we reboot, the system may be okay for 4 > months or 2 hours, > who knows? I can write a shell script to look > for any new messages > like those shown above so determining that the > failure has occurred is > not difficult especially since the system is > still running all be it > without a network connection. > > Will bringing fxp0 down with ifconfig and then > back up as in > > ifconfig fxp0 down > sleep 5 > ifconfig fxp0 up > > restore it to operation again? > > I am asking because the system is unreachable > when fxp0 is > down. If one was logged in to that system, is > there anything we could > do to rescue it besides a full reboot? > > I want to make it rescue itself if it can > since these things > always happen on holidays or weekends or at > 03:00 in the morning. > > This system doesn't die that often, but it is > often enough to > take measures to prevent it from needing our > laying on of hands at odd > hours. > > Thank you. > Such things are usually caused by a bus error; either the card or the bus may be slightly out of spec, or some other card may be interfering. You'll see such things, for example, when you put a PCI-X card on a 133Mhz bus, and then you change the bus to 100Mhz and it will never happen again. You might not be able to recover. If you can replace the card or try a different bus slot. If your bus is overloaded (ie multiple cards, too much trace length or too many loads) then removing one card may fix it. If you are using a non-intel card with an intel part on it I'd suggest trying a different card. DT __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com