From owner-freebsd-bugs@FreeBSD.ORG Fri Oct 29 15:40:29 2004 Return-Path: Delivered-To: freebsd-bugs@hub.freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E0AA216A4CF for ; Fri, 29 Oct 2004 15:40:28 +0000 (GMT) Received: from freefall.freebsd.org (freefall.freebsd.org [216.136.204.21]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9D18D43D3F for ; Fri, 29 Oct 2004 15:40:28 +0000 (GMT) (envelope-from gnats@FreeBSD.org) Received: from freefall.freebsd.org (gnats@localhost [127.0.0.1]) by freefall.freebsd.org (8.12.11/8.12.11) with ESMTP id i9TFeSxD065461 for ; Fri, 29 Oct 2004 15:40:28 GMT (envelope-from gnats@freefall.freebsd.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.12.11/8.12.11/Submit) id i9TFeSDL065455; Fri, 29 Oct 2004 15:40:28 GMT (envelope-from gnats) Resent-Date: Fri, 29 Oct 2004 15:40:28 GMT Resent-Message-Id: <200410291540.i9TFeSDL065455@freefall.freebsd.org> Resent-From: FreeBSD-gnats-submit@FreeBSD.org (GNATS Filer) Resent-To: freebsd-bugs@FreeBSD.org Resent-Reply-To: FreeBSD-gnats-submit@FreeBSD.org, "O. Hartmann" Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id ABC9D16A4CE for ; Fri, 29 Oct 2004 15:34:35 +0000 (GMT) Received: from edda.geo.uni-mainz.de (edda.Geo.Uni-Mainz.DE [134.93.183.16]) by mx1.FreeBSD.org (Postfix) with ESMTP id E835E43D2F for ; Fri, 29 Oct 2004 15:34:34 +0000 (GMT) (envelope-from hartmann@edda.geo.uni-mainz.de) Received: from edda.geo.uni-mainz.de (hartmann@localhost [127.0.0.1]) by edda.geo.uni-mainz.de (8.13.1/8.13.1) with ESMTP id i9TFa7iN001177 for ; Fri, 29 Oct 2004 15:36:07 GMT (envelope-from hartmann@edda.geo.uni-mainz.de) Received: (from hartmann@localhost) by edda.geo.uni-mainz.de (8.13.1/8.13.1/Submit) id i9TFa69q001176; Fri, 29 Oct 2004 17:36:06 +0200 (CEST) (envelope-from hartmann) Message-Id: <200410291536.i9TFa69q001176@edda.geo.uni-mainz.de> Date: Fri, 29 Oct 2004 17:36:06 +0200 (CEST) From: "O. Hartmann" To: FreeBSD-gnats-submit@FreeBSD.org X-Send-Pr-Version: 3.113 Subject: kern/73289: [FreeBSD 5.3-STABLE] Interrupt storm on sym0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: "O. Hartmann" List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 29 Oct 2004 15:40:29 -0000 >Number: 73289 >Category: kern >Synopsis: [FreeBSD 5.3-STABLE] Interrupt storm on sym0 >Confidential: no >Severity: critical >Priority: high >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Fri Oct 29 15:40:28 GMT 2004 >Closed-Date: >Last-Modified: >Originator: O. Hartmann >Release: FreeBSD 5.3-STABLE i386 >Organization: Department of Geophysics, Johannes Gutenberg-Universitaet, Mainz >Environment: System: FreeBSD edda.geo.uni-mainz.de 5.3-STABLE FreeBSD 5.3-STABLE #51: Fri Oct 29 08:49:09 UTC 2004 root@edda.geo.uni-mainz.de:/usr/obj/usr/src/sys/EDDA i386 >Description: Investigating for a mysterious SMP related crashing scenario I did some weird observations today. I tried to watch several system dependent things via 'systat' and I got a weird error when trying 'systat -vmstat 1': The alternate system clock has died. Reverting to ''pigs' display. Shortly after this message, using Mozilla, the box got stuck as I have reported so many times in the past. After a reboot I did the same and now the systat display came up, but suddenly, doing some work, I watched this 'interrupt storm' on interrupt 24, sym0 (used SCSI controller), eating up all cpu time. this is a snapshot taken via cut and paste in the xterm; is this a hardware problem (i.e. failure, defect) or a software bug? 1 users Load 1.44 0.83 0.40 Oct 29 17:17 Mem:KB REAL VIRTUAL VN PAGER SWAP PAGER Tot Share Tot Share Free in out in out Act 70364 7108 160880 8924 810524 count All 209492 10316 2490140 13196 pages zfod Interrupts Proc:r p d s w Csw Trp Sys Int Sof Flt cow 112906 total 1 2 48 104k 91 1121 113k 66 3 48248 wire 1: atkb 59312 act 6: fdc0 31.2%Sys 51.2%Intr 1.2%User 0.0%Nice 16.5%Idl 102628 inact 127 8: rtc | | | | | | | | | | 448 cache 3 9: acpi ================+++++++++++++++++++++++++> 810076 free 79 12: psm daefr 13: npx Namei Name-cache Dir-cache prcfr 17: ahc Calls hits % hits % react 19: pcm pdwak 6 20: fxp pdpgs111693 24: sym Disks da0 da1 da2 cd0 cd1 pass0 pass1 intrn 25: sym KB/t 0.00 0.00 16.00 0.00 0.00 0.00 0.00 113904 buf 998 0: clk tps 0 0 3 0 0 0 0 3 dirtybuf MB/s 0.00 0.00 0.05 0.00 0.00 0.00 0.00 69987 desiredvnodes % busy 0 0 2 0 0 0 0 17496 numvnodes 13270 freevnodes This is my mptable -dmesg -verbose -grope output, hope it is of help to you: =============================================================================== MPTable, version 2.0.15 looking for EBDA pointer @ 0x040e, found, searching EBDA @ 0x0009f000 searching CMOS 'top of mem' @ 0x0009ec00 (635K) searching default 'top of mem' @ 0x0009fc00 (639K) searching BIOS @ 0x000f0000 MP FPS found in BIOS @ physical addr: 0x000f5270 ------------------------------------------------------------------------------- MP Floating Pointer Structure: location: BIOS physical address: 0x000f5270 signature: '_MP_' length: 16 bytes version: 1.4 checksum: 0xe3 mode: Virtual Wire ------------------------------------------------------------------------------- MP Config Table Header: physical address: 0x000f4e60 signature: 'PCMP' base table length: 300 version: 1.4 checksum: 0x75 OEM ID: 'OEM00000' Product ID: 'PROD00000000' OEM table pointer: 0x00000000 OEM table size: 0 entry count: 29 local APIC address: 0xfee00000 extended table length: 124 extended table checksum: 200 ------------------------------------------------------------------------------- MP Config Base Table Entries: -- Processors: APIC ID Version State Family Model Step Flags 3 0x11 BSP, usable 6 8 10 0x387fbff 0 0x11 AP, usable 6 8 6 0x387fbff -- Bus: Bus ID Type 0 PCI 1 PCI 2 ISA -- I/O APICs: APIC ID Version State Address 2 0x11 usable 0xfec00000 3 0x11 usable 0xfec01000 -- I/O Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 2 0 2 0 INT conforms conforms 2 1 2 1 INT conforms conforms 2 0 2 2 INT conforms conforms 2 3 2 3 INT conforms conforms 2 4 2 4 INT conforms conforms 2 6 2 6 INT conforms conforms 2 7 2 7 INT conforms conforms 2 8 2 8 INT conforms conforms 2 12 2 12 INT conforms conforms 2 13 2 13 INT conforms conforms 2 14 2 14 INT conforms conforms 2 15 2 15 INT active-lo level 0 2:A 3 4 INT active-lo level 0 4:A 3 1 INT active-lo level 0 6:A 3 3 INT active-lo level 0 15:A 3 14 INT active-lo level 2 9 2 9 INT active-lo level 1 3:A 3 6 INT active-lo level 1 5:A 3 8 INT active-lo level 1 5:B 3 9 -- Local Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT active-hi edge 2 0 255 0 NMI active-hi edge 2 0 255 1 ------------------------------------------------------------------------------- MP Config Extended Table Entries: -- System Address Space bus ID: 0 address type: I/O address address base: 0x0 address range: 0x10000 -- System Address Space bus ID: 0 address type: memory address address base: 0x40000000 address range: 0xbebc0000 -- System Address Space bus ID: 0 address type: prefetch address address base: 0xfebc0000 address range: 0xe9440000 -- System Address Space bus ID: 0 address type: memory address address base: 0xe8000000 address range: 0x18000000 -- System Address Space bus ID: 0 address type: memory address address base: 0xa0000 address range: 0x20000 -- Bus Heirarchy bus ID: 2 bus info: 0x01 parent bus ID: 0 -- Compatibility Bus Address bus ID: 0 address modifier: add predefined range: 0x00000000 -- Compatibility Bus Address bus ID: 0 address modifier: add predefined range: 0x00000001 ------------------------------------------------------------------------------- dmesg output: Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.3-STABLE #51: Fri Oct 29 08:49:09 UTC 2004 root@edda.geo.uni-mainz.de:/usr/obj/usr/src/sys/EDDA ACPI APIC Table: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel Pentium III (866.70-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x68a Stepping = 10 Features=0x387fbff real memory = 1073721344 (1023 MB) avail memory = 1041166336 (992 MB) FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): APIC ID: 3 cpu1 (AP): APIC ID: 0 ioapic0 irqs 0-15 on motherboard ioapic1 irqs 16-31 on motherboard netsmb_dev: loaded npx0: [FAST] npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <32-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0 cpu0: on acpi0 cpu1: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 fxp0: port 0xd800-0xd83f mem 0xfd800000-0xfd8fffff,0xfe000000-0xfe000fff irq 20 at device 2.0 on pci0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:e0:18:05:73:f4 ahc0: port 0xd400-0xd4ff mem 0xfd000000-0xfd000fff irq 17 at device 4.0 on pci0 ahc0: [GIANT-LOCKED] aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs pcm0: port 0xd000-0xd03f irq 19 at device 6.0 on pci0 pcm0: pcm0: [GIANT-LOCKED] pci0: at device 7.0 (no driver attached) isab0: port 0xe800-0xe80f at device 15.0 on pci0 isa0: on isab0 atapci0: port 0xb400-0xb40f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 15.1 on pci0 ata0: channel #0 on atapci0 ata1: channel #1 on atapci0 pcib1: on acpi0 pci1: on pcib1 em0: port 0xb000-0xb03f mem 0xfa800000-0xfa81ffff irq 22 at device 3.0 on pci1 em0: Ethernet address: 00:07:e9:14:8f:7b em0: Speed:N/A Duplex:N/A sym0: <1010-33> port 0xa800-0xa8ff mem 0xf9800000-0xf9801fff,0xfa000000-0xfa0003ff irq 24 at device 5.0 on pci1 sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym0: open drain IRQ line driver, using on-chip SRAM sym0: using LOAD/STORE-based firmware. sym0: handling phase mismatch from SCRIPTS. sym0: [GIANT-LOCKED] sym1: <1010-33> port 0xa400-0xa4ff mem 0xf8800000-0xf8801fff,0xf9000000-0xf90003ff irq 25 at device 5.1 on pci1 sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking sym1: open drain IRQ line driver, using on-chip SRAM sym1: using LOAD/STORE-based firmware. sym1: handling phase mismatch from SCRIPTS. sym1: [GIANT-LOCKED] atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model IntelliMouse, device ID 3 sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A ppc0: port 0x778-0x77a,0x378-0x37f irq 7 drq 3 flags 0x8 on acpi0 ppc0: Generic chipset (ECP-only) in ECP mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 lpt0: on ppbus0 lpt0: Interrupt-driven port fdc0: port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 orm0: at iomem 0xd0000-0xd3fff,0xc0000-0xca7ff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <8 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 fb0 at vga0 Timecounters tick every 1.000 msec Waiting 5 seconds for SCSI devices to settle (noperiph:sym0:0:-1:-1): SCSI BUS reset delivered. (noperiph:sym1:0:-1:-1): SCSI BUS reset delivered. da0 at sym0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-3 device da0: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged Queueing Enabled da0: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C) da1 at sym0 bus 0 target 1 lun 0 da1: Fixed Direct Access SCSI-3 device da1: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged Queueing Enabled da1: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C) da2 at sym0 bus 0 target 2 lun 0 da2: Fixed Direct Access SCSI-3 device da2: 160.000MB/s transfers (80.000MHz, offset 62, 16bit), Tagged Queueing Enabled da2: 17429MB (35694904 512 byte sectors: 255H 63S/T 2221C) cd0 at ahc0 bus 0 target 0 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present cd1 at ahc0 bus 0 target 1 lun 0 cd1: Removable CD-ROM SCSI-2 device cd1: 10.000MB/s transfers (10.000MHz, offset 8) cd1: Attempt to query device size failed: NOT READY, Medium not present SMP: AP CPU #1 Launched! Mounting root from ufs:/dev/da0s1a WARNING: / was not properly dismounted WARNING: /compat was not properly dismounted WARNING: /homes was not properly dismounted /homes: mount pending error: blocks 24 files 1 WARNING: /usr was not properly dismounted WARNING: /usr/data was not properly dismounted WARNING: /usr/local was not properly dismounted WARNING: /usr/obj was not properly dismounted WARNING: /usr/ports was not properly dismounted WARNING: /usr/scratch was not properly dismounted WARNING: /usr/src was not properly dismounted WARNING: /var was not properly dismounted pflog0: promiscuous mode enabled fxp0: promiscuous mode enabled fxp0: promiscuous mode disabled =============================================================================== >How-To-Repeat: >Fix: >Release-Note: >Audit-Trail: >Unformatted: