From owner-freebsd-stable@FreeBSD.ORG Wed Oct 31 01:51:13 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5367116A418 for ; Wed, 31 Oct 2007 01:51:13 +0000 (UTC) (envelope-from pmurray@nevada.net.nz) Received: from bellagio.open2view.net (bellagio.open2view.net [210.48.79.75]) by mx1.freebsd.org (Postfix) with ESMTP id CEBA613C4A5 for ; Wed, 31 Oct 2007 01:51:12 +0000 (UTC) (envelope-from pmurray@nevada.net.nz) Received: from [10.58.3.94] (ip-58-28-203-221.ubs-dsl.xnet.co.nz [58.28.203.221]) by bellagio.open2view.net (Postfix) with ESMTP id 576F76DAA54; Wed, 31 Oct 2007 14:24:14 +1300 (NZDT) In-Reply-To: <2a41acea0710301016u4a0008dfjc83170257337863c@mail.gmail.com> References: <2a41acea0710301016u4a0008dfjc83170257337863c@mail.gmail.com> Mime-Version: 1.0 (Apple Message framework v752.3) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: <9B40D12A-4657-43F9-8746-6AE87EA31199@nevada.net.nz> Content-Transfer-Encoding: 7bit From: Philip Murray Date: Wed, 31 Oct 2007 14:24:28 +1300 To: Jack Vogel X-Mailer: Apple Mail (2.752.3) Cc: freebsd-stable@freebsd.org Subject: Re: em watchdog problem X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 31 Oct 2007 01:51:13 -0000 On 31/10/2007, at 6:16 AM, Jack Vogel wrote: > This morning I had an idea about what the source of the watchdog > problem is. Also, we have repro'd at least one type of watchdog > inhouse. > > One question, is this problem only happening for those running > STABLE with the 6.6.6 merged driver? > > We found the problem does not seem to happen on 7.0. > Sorry to burst your bubble, but it just happened to me on a 7.0-BETA1 machine running 6.5.3. This machine (Supermicro P4SC8 board) had been running without problems for almost a year previously. It's still happening often on a 6.2-STABLE machine (as previously reported). Was doing around 40Mbit/sec of Rsync traffic at the time. Again, em0 shares an interrupt (this time with atapci, not uhci) dmesg: Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.0-BETA1 #0: Mon Oct 29 23:34:49 NZDT 2007 root@zzxyz.open2view.net:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Pentium(R) 4 CPU 3.20GHz (3194.56-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf41 Stepping = 1 Features=0xbfebfbff Features2=0x441d real memory = 1072562176 (1022 MB) avail memory = 1035980800 (987 MB) ACPI APIC Table: ioapic0 irqs 0-23 on motherboard ioapic1 irqs 24-47 on motherboard kbd1 at kbdmux0 ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413) acpi0: on motherboard acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, 3fde0000 (3) failed Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 cpu0: on acpi0 p4tcc0: on cpu0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib1: at device 3.0 on pci0 pci1: on pcib1 em0: port 0xc000-0xc01f mem 0xf2100000-0xf211ffff irq 18 at device 1.0 on pci1 em0: Ethernet address: 00:30:48:81:e1:9e em0: [FILTER] pcib2: at device 28.0 on pci0 pci2: on pcib2 pcib3: at device 1.0 on pci2 pci3: on pcib3 arcmsr0: mem 0xf2010000-0xf2010fff irq 26 at device 14.0 on pci3 ARECA RAID ADAPTER0: Driver Version 1.20.00.14 2007-2-05 ARECA RAID ADAPTER0: FIRMWARE VERSION V1.36 2005-3-31 arcmsr0: [ITHREAD] uhci0: port 0xe100-0xe11f irq 16 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] uhci0: [ITHREAD] usb0: on uhci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xe000-0xe01f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] uhci1: [ITHREAD] usb1: on uhci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered pci0: at device 29.4 (no driver attached) ehci0: mem 0xf2200000-0xf22003ff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb2: EHCI version 1.0 usb2: companion controllers, 2 ports each: usb0 usb1 usb2: on ehci0 usb2: USB revision 2.0 uhub2: on usb2 uhub2: 4 ports with 4 removable, self powered pcib4: at device 30.0 on pci0 pci4: on pcib4 vgapci0: port 0xd000-0xd0ff mem 0xf0000000-0xf0ffffff,0xf1040000-0xf1040fff irq 16 at device 9.0 on pci4 em1: port 0xd100-0xd13f mem 0xf1000000-0xf101ffff irq 19 at device 10.0 on pci4 em1: Ethernet address: 00:30:48:81:e1:9f em1: [FILTER] isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf000-0xf00f at device 31.1 on pci0 ata0: on atapci0 ata0: [ITHREAD] ata1: on atapci0 ata1: [ITHREAD] atapci1: port 0xe200-0xe207,0xe300-0xe303,0xe400-0xe407,0xe500-0xe503,0xe600-0xe60f irq 18 at device 31.2 on pci0 atapci1: [ITHREAD] ata2: on atapci1 ata2: [ITHREAD] ata3: on atapci1 ata3: [ITHREAD] pci0: at device 31.3 (no driver attached) acpi_tz0: on acpi0 fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A sio1: [FILTER] atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] pmtimer0 on isa0 orm0: at iomem 0xc0000-0xc7fff pnpid ORM0000 on isa0 ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: on ppc0 ppi0: on ppbus0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppc0: [GIANT-LOCKED] ppc0: [ITHREAD] sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounter "TSC" frequency 3194562256 Hz quality 800 Timecounters tick every 1.000 msec Waiting 5 seconds for SCSI devices to settle acd0: CDROM at ata0-master UDMA33 da0 at arcmsr0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-3 device da0: 166.666MB/s transfers (83.333MHz, offset 32, 16bit) da0: 76293MB (156249600 512 byte sectors: 255H 63S/T 9726C) da1 at arcmsr0 bus 0 target 0 lun 1 da1: Fixed Direct Access SCSI-3 device da1: 166.666MB/s transfers (83.333MHz, offset 32, 16bit) da1: 2784728MB (5703123456 512 byte sectors: 255H 63S/T 355003C) GEOM_JOURNAL: Journal 490099366: da1p1 contains data. GEOM_JOURNAL: Journal 490099366: da1p1 contains journal. Trying to mount root from ufs:/dev/da0s1a WARNING: / was not properly dismounted GEOM_JOURNAL: Journal da1p1 consistent. WARNING: /tmp was not properly dismounted WARNING: /usr was not properly dismounted WARNING: /var was not properly dismounted arplookup 210.55.230.210 failed: host is not on local network ichsmb0: port 0x500-0x51f irq 17 at device 31.3 on pci0 ichsmb0: [GIANT-LOCKED] ichsmb0: [ITHREAD] smbus0: on ichsmb0 smb0: on smbus0 em0: watchdog timeout -- resetting em0: link state changed to DOWN em0: link state changed to UP