From owner-freebsd-bugs@FreeBSD.ORG Wed Jul 25 14:40:07 2007 Return-Path: Delivered-To: freebsd-bugs@hub.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1041B16A52E for ; Wed, 25 Jul 2007 14:40:07 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id D850913C474 for ; Wed, 25 Jul 2007 14:40:06 +0000 (UTC) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.14.1/8.14.1) with ESMTP id l6PEe6Vr098474 for ; Wed, 25 Jul 2007 14:40:06 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.1/8.14.1/Submit) id l6PEe69P098473; Wed, 25 Jul 2007 14:40:06 GMT (envelope-from gnats) Resent-Date: Wed, 25 Jul 2007 14:40:06 GMT Resent-Message-Id: <200707251440.l6PEe69P098473@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-bugs@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, Stefano Guidi Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B48AD16A419 for ; Wed, 25 Jul 2007 14:34:14 +0000 (UTC) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (unknown [IPv6:2001:4f8:fff6::21]) by mx1.freebsd.org (Postfix) with ESMTP id 99EDE13C46B for ; Wed, 25 Jul 2007 14:34:14 +0000 (UTC) (envelope-from nobody@FreeBSD.org) Received: from www.freebsd.org (localhost [127.0.0.1]) by www.freebsd.org (8.14.1/8.14.1) with ESMTP id l6PEYEmE087216 for ; Wed, 25 Jul 2007 14:34:14 GMT (envelope-from nobody@www.freebsd.org) Received: (from nobody@localhost) by www.freebsd.org (8.14.1/8.14.1/Submit) id l6PEYEmn087215; Wed, 25 Jul 2007 14:34:14 GMT (envelope-from nobody) Message-Id: <200707251434.l6PEYEmn087215@www.freebsd.org> Date: Wed, 25 Jul 2007 14:34:14 GMT From: Stefano Guidi To: freebsd-gnats-submit@FreeBSD.org X-Send-Pr-Version: www-3.0 Cc: Subject: kern/114899: bge0: watchdog timeout -- resetting X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 25 Jul 2007 14:40:07 -0000 >Number: 114899 >Category: kern >Synopsis: bge0: watchdog timeout -- resetting >Confidential: no >Severity: critical >Priority: high >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Wed Jul 25 14:40:06 GMT 2007 >Closed-Date: >Last-Modified: >Originator: Stefano Guidi >Release: FreeBSD-5.4 >Organization: >Environment: FreeBSD fspa2 5.4-FBSD-Bombardier FreeBSD 5.4-FBSD-Bombardier #1: Fri Jul 22 14:50:49 UTC 2005 root@:/usr/obj/usr/src/sys/PREEMPTION i386 >Description: Hello, This is a problem report, as far as I understand, for an already known problem. Sometimes we experience a lock in the ethernet card that causes our application to halt. In the /var/log/messages I see the message "bge0: watchdog timeout -- resetting". I wrote this mail because I've seen that the problem is reported in many mails always for the FreeBSD-6.x, but we are using a previous version (5.4) and we got exactly the same problem. I hope that following informations can help somehow. I've not been able to find any solution in the mail threads I've read, so, if someone knows how to solve it, or, meanwhile, it has been solved... Thank you S.Guidi The machine is not highly loaded (usually < 10%) This is the output of vmstat -i: interrupt total rate irq28: mpt0 17742 0 irq1: atkbd0 2 0 irq8: rtc 24138043 128 irq13: npx0 1 0 irq14: ata0 46 0 irq16: bge0 bge1+ 23002998 121 irq0: clk 18856972 99 Total 66015804 350 And this is the output of dmesg (please look at the end of the file, at the line stating "Interrupt storm detected on "irq16: bge0 bge1+"; throttling interrupt source", what does it mean?) : Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.4-FBSD-Bombardier #1: Fri Jul 22 14:50:49 UTC 2005 root@:/usr/obj/usr/src/sys/PREEMPTION ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(TM) CPU 3.00GHz (3000.12-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf41 Stepping = 1 Features=0xbfebfbff Hyperthreading: 2 logical CPUs real memory = 1073532928 (1023 MB) avail memory = 1040973824 (992 MB) ioapic2 irqs 48-71 on motherboard ioapic1 irqs 24-47 on motherboard ioapic0 irqs 0-23 on motherboard npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR acpi0: Power Button (fixed) acpi_bus_number: can't get _ADR acpi_bus_number: can't get _ADR Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x588-0x58b on acpi0 cpu0: on acpi0 pcib0: on acpi0 pci0: on pcib0 pci0: at device 0.1 (no driver attached) pcib1: at device 2.0 on pci0 pci2: on pcib1 pcib2: at device 4.0 on pci0 pci3: on pcib2 pcib3: at device 0.0 on pci3 pci4: on pcib3 mpt0: port 0x4000-0x40ff mem 0xcffe0000-0xcffeffff,0xcfff0000-0xcfffffff irq 28 at device 1.0 on pci4 pcib4: at device 0.2 on pci3 pci5: on pcib4 pcib5: at device 6.0 on pci0 pci6: on pcib5 bge0: mem 0xcdff0000-0xcdffffff irq 16 at device 0.0 on pci6 miibus0: on bge0 brgphy0: on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:11:25:a9:6e:64 pcib6: at device 7.0 on pci0 pci7: on pcib6 bge1: mem 0xcbff0000-0xcbffffff irq 16 at device 0.0 on pci7 miibus1: on bge1 brgphy1: on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: Ethernet address: 00:11:25:a9:6e:65 pci0: at device 8.0 (no driver attached) uhci0: port 0x2200-0x221f irq 16 at device 29.0 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0x2600-0x261f irq 19 at device 29.1 on pci0 usb1: on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered pci0: at device 29.7 (no driver attached) pcib7: at device 30.0 on pci0 pci1: on pcib7 pci1: at device 1.0 (no driver attached) isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x480-0x48f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.2 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 pci0: at device 31.3 (no driver attached) atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 atkbd: unable to set the command byte. kbd0 at atkbd0 psm0: unable to set the command byte. sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A orm0: at iomem 0xce000-0xd1fff,0xcc800-0xcdfff,0xcb000-0xcc7ff,0xc0000-0xcafff on isa0 pmtimer0 on isa0 ppc0: parallel port not found. sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 3000122542 Hz quality 800 Timecounters tick every 10.000 msec acd0: DVDROM at ata0-master PIO4 Waiting 15 seconds for SCSI devices to settle Interrupt storm detected on "irq16: bge0 bge1+"; throttling interrupt source ses0 at mpt0 bus 0 target 8 lun 0 ses0: Fixed Processor SCSI-2 device ses0: 3.300MB/s transfers ses0: SAF-TE Compliant Device da0 at mpt0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-4 device da0: 320.000MB/s transfers (160.000MHz, offset 63, 16bit), Tagged Queueing Enabled da0: 70006MB (143374000 512 byte sectors: 255H 63S/T 8924C) Mounting root from ufs:/dev/da0s1a >How-To-Repeat: I'm not able to repeat the problem, it occurs quite seldom (~once a month), but when it happens it locks our application. >Fix: One suggested workaround was to disable acpi controller including the line hint.apic.0.disabled="1" in file /boot/loader.conf but it doesn't work because we got again the same problem. >Release-Note: >Audit-Trail: >Unformatted: