Date: Mon, 06 Jul 2009 20:22:15 -0400 From: Dan Langille <dan@langille.org> To: Dan Langille <dan@langille.org> Cc: Victor Balada Diaz <victor@bsdes.net>, freebsd-stable@freebsd.org, Pete French <petefrench@ticketswitch.com>, "Marat N.Afanasyev" <amarat@ksu.ru> Subject: Re: interrupt storm on MSI IXP600 based motherboards Message-ID: <4A529537.8070206@langille.org> In-Reply-To: <7697CDAB-B4E7-480A-B31A-1F54275B8D54@langille.org> References: <E1LNnFa-0003ze-7k@dilbert.ticketswitch.com> <49774BAE.3000809@ksu.ru> <20090122071845.GF4881@alf.bsdes.net> <4978A10A.9060006@langille.org> <7697CDAB-B4E7-480A-B31A-1F54275B8D54@langille.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Dan Langille wrote: > > On Jan 22, 2009, at 11:38 AM, Dan Langille wrote: > >> Victor Balada Diaz wrote: >>> On Wed, Jan 21, 2009 at 07:22:06PM +0300, Marat N.Afanasyev wrote: >>>>>> trouble with onboard re(4) was resolved in -CURRENT and -STABLE, >>>>>> but storms are not bound to ethernet only. storm may appear on any >>>>>> device. if any device generates enough interrupts rate, storm will >>>>>> arrive. >>>>> Yes, I just got another storm, on my ATA controller this time. Ah >>>>> well, so much for the idea of disabling unneeded devices! >>>>> >>>>> -pete. >>>>> >>>> it's a kind of magic, really. I built a new kernel with KDB and DDB >>>> and after 1 day, 13:15 I'm still waiting for storm to arrive. And I >>>> added >>>> hw.acpi.osname="Linux" to /boot/loader.conf. >>> Try doing lots of IO and you will get the problem soon. You might >>> want to try: >>> while true; do dd if=/dev/zero of=BAH bs=1M count=1024; sync; done >> >> FWIW, last night I changed the address of the comm port IO in my BIOS. >> Then I ran the Bacula regression test suite (lots of IO). For my >> machine, once the interrupt storm starts, it continues. I do not know >> if that happens to everyone. >> >> Since changing the address, I have had no interrupt storms. I have >> been running the above IO loop for about ten minutes. >> >> No storm yet (knock on wood). > > > And it's back: > > Jan 22 17:21:46 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:23:19 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:28:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:33:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:38:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > > I shall try the hw.acpi.osname="Linux" option now. > > From dmsg: Jan 22 18:10:07 polo kernel: ACPI: Overriding _OS definition > with "Linux" The problem returns: Jul 6 20:12:10 polo kernel: interrupt storm detected on "irq22:"; throttling interrupt source Jul 6 20:12:41 polo last message repeated 31 times Jul 6 20:14:42 polo last message repeated 121 times Jul 6 20:17:09 polo last message repeated 147 times FreeBSD polo.example.org 7.2-STABLE FreeBSD 7.2-STABLE #10: Mon Jun 1 19:19:13 EDT 2009 dan@example.org:/usr/obj/usr/src/sys/PHENOM amd64 Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-STABLE #10: Mon Jun 1 19:19:13 EDT 2009 dan@example.org:/usr/obj/usr/src/sys/PHENOM Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Phenom(tm) 9600 Quad-Core Processor (2300.17-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x100f22 Stepping = 2 Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT> Features2=0x802009<SSE3,MON,CX16,<b23>> AMD Features=0xee500800<SYSCALL,NX,MMX+,FFXSR,Page1GB,RDTSCP,LM,3DNow!+,3DNow!> AMD Features2=0x7ff<LAHF,CMP,SVM,ExtAPIC,CR8,<b5>,<b6>,<b7>,Prefetch,<b9>,<b10>> TSC: P-state invariant Cores per package: 4 usable memory = 4281012224 (4082 MB) avail memory = 4108423168 (3918 MB) ACPI APIC Table: <122107 APIC0947> FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 2 cpu3 (AP): APIC ID: 3 ioapic0 <Version 2.1> irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: <122107 RSDT0947> on motherboard ACPI: Overriding _OS definition with "Linux" acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of fee00000, 1000 (3) failed acpi0: reservation of ffb80000, 80000 (3) failed acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, cff00000 (3) failed ACPI HPET table warning: Sequence is non-zero (2) Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 acpi_timer0: <32-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 acpi_hpet0: <High Precision Event Timer> iomem 0xfed00000-0xfed003ff on acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 pci0: <ACPI PCI bus> on pcib0 pcib1: <ACPI PCI-PCI bridge> at device 11.0 on pci0 pci1: <ACPI PCI bus> on pcib1 vgapci0: <VGA-compatible display> port 0xd800-0xd87f mem 0xfd000000-0xfdffffff,0xd0000000-0xdfffffff,0xfa000000-0xfbffffff irq 19 at device 0.0 on pci1 atapci0: <ATI IXP600 SATA300 controller> port 0xc000-0xc007,0xb000-0xb003,0xa000-0xa007,0x9000-0x9003,0x8000-0x800f mem 0xf9fff800-0xf9fffbff irq 22 at device 18.0 on pci0 atapci0: [ITHREAD] atapci0: AHCI Version 01.10 controller with 4 ports detected ata2: <ATA channel 0> on atapci0 ata2: [ITHREAD] ata3: <ATA channel 1> on atapci0 ata3: [ITHREAD] ata4: <ATA channel 2> on atapci0 ata4: [ITHREAD] ata5: <ATA channel 3> on atapci0 ata5: [ITHREAD] ohci0: <OHCI (generic) USB controller> mem 0xf9ffe000-0xf9ffefff irq 16 at device 19.0 on pci0 ohci0: [GIANT-LOCKED] ohci0: [ITHREAD] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb0 uhub0: 2 ports with 2 removable, self powered ohci1: <OHCI (generic) USB controller> mem 0xf9ffd000-0xf9ffdfff irq 17 at device 19.1 on pci0 ohci1: [GIANT-LOCKED] ohci1: [ITHREAD] usb1: OHCI version 1.0, legacy support usb1: SMM does not respond, resetting usb1: <OHCI (generic) USB controller> on ohci1 usb1: USB revision 1.0 uhub1: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb1 uhub1: 2 ports with 2 removable, self powered ohci2: <OHCI (generic) USB controller> mem 0xf9ffc000-0xf9ffcfff irq 18 at device 19.2 on pci0 ohci2: [GIANT-LOCKED] ohci2: [ITHREAD] usb2: OHCI version 1.0, legacy support usb2: SMM does not respond, resetting usb2: <OHCI (generic) USB controller> on ohci2 usb2: USB revision 1.0 uhub2: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb2 uhub2: 2 ports with 2 removable, self powered ohci3: <OHCI (generic) USB controller> mem 0xf9ffb000-0xf9ffbfff irq 17 at device 19.3 on pci0 ohci3: [GIANT-LOCKED] ohci3: [ITHREAD] usb3: OHCI version 1.0, legacy support usb3: SMM does not respond, resetting usb3: <OHCI (generic) USB controller> on ohci3 usb3: USB revision 1.0 uhub3: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb3 uhub3: 2 ports with 2 removable, self powered ohci4: <OHCI (generic) USB controller> mem 0xf9ffa000-0xf9ffafff irq 18 at device 19.4 on pci0 ohci4: [GIANT-LOCKED] ohci4: [ITHREAD] usb4: OHCI version 1.0, legacy support usb4: SMM does not respond, resetting usb4: <OHCI (generic) USB controller> on ohci4 usb4: USB revision 1.0 uhub4: <ATI OHCI root hub, class 9/0, rev 1.00/1.00, addr 1> on usb4 uhub4: 2 ports with 2 removable, self powered ehci0: <EHCI (generic) USB 2.0 controller> mem 0xf9fff000-0xf9fff0ff irq 19 at device 19.5 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb5: EHCI version 1.0 usb5: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4 usb5: <EHCI (generic) USB 2.0 controller> on ehci0 usb5: USB revision 2.0 uhub5: <ATI EHCI root hub, class 9/0, rev 2.00/1.00, addr 1> on usb5 uhub5: 10 ports with 10 removable, self powered pci0: <serial bus, SMBus> at device 20.0 (no driver attached) atapci1: <ATI IXP600 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xff00-0xff0f at device 20.1 on pci0 ata0: <ATA channel 0> on atapci1 ata0: [ITHREAD] hdac0: <ATI SB600 High Definition Audio Controller> mem 0xf9ff4000-0xf9ff7fff irq 16 at device 20.2 on pci0 hdac0: HDA Driver Revision: 20090329_0131 hdac0: [ITHREAD] isab0: <PCI-ISA bridge> at device 20.3 on pci0 isa0: <ISA bus> on isab0 pcib2: <ACPI PCI-PCI bridge> at device 20.4 on pci0 pci2: <ACPI PCI bus> on pcib2 fwohci0: <VIA Fire II (VT6306)> port 0xe800-0xe87f mem 0xfebff800-0xfebfffff irq 23 at device 0.0 on pci2 fwohci0: [FILTER] fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:dc:10:00:01:53:a0:bc fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: <IEEE1394(FireWire) bus> on fwohci0 dcons_crom0: <dcons configuration ROM> on firewire0 dcons_crom0: bus_addr 0x1464000 fwe0: <Ethernet over FireWire> on firewire0 if_fwe0: Fake Ethernet address: 02:dc:10:53:a0:bc fwe0: Ethernet address: 02:dc:10:53:a0:bc fwip0: <IP over FireWire> on firewire0 fwip0: Firewire address: 00:dc:10:00:01:53:a0:bc @ 0xfffe00000000, S400, maxrec 2048 sbp0: <SBP-2/SCSI over FireWire> on firewire0 fwohci0: Initiate bus reset fwohci0: BUS reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode fxp0: <Intel 82559 Pro/100 Ethernet> port 0xe400-0xe43f mem 0xfebfe000-0xfebfefff,0xfea00000-0xfeafffff irq 22 at device 2.0 on pci2 miibus0: <MII bus> on fxp0 inphy0: <i82555 10/100 media interface> PHY 1 on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:04:ac:d3:78:23 fxp0: [ITHREAD] acpi_button0: <Power Button> on acpi0 sio0: configured irq 3 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 3 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model IntelliMouse, device ID 3 cpu0: <ACPI CPU> on acpi0 acpi_throttle0: <ACPI CPU Throttling> on cpu0 acpi_throttle0: CLK_VAL field overlaps THT_EN bit device_attach: acpi_throttle0 attach returned 6 cpu1: <ACPI CPU> on acpi0 cpu2: <ACPI CPU> on acpi0 cpu3: <ACPI CPU> on acpi0 ppc0: cannot reserve I/O port range sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ugen0: <American Power Conversion Back-UPS XS 1200 FW:8.g1 .D USB FW:g1, class 0/0, rev 1.10/1.06, addr 2> on uhub4 Timecounters tick every 1.000 msec firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) acd0: CDRW <HL-DT-ST GCE-8523B/1.01> at ata0-slave UDMA33 ad4: 476940MB <WDC WD5000AAKS-00YGA0 12.01C02> at ata2-master SATA300 ad6: 476940MB <WDC WD5000AAKS-00YGA0 12.01C02> at ata3-master SATA300 hdac0: HDA Codec #0: Realtek ALC888 pcm0: <HDA Realtek ALC888 PCM #0 Analog> at cad 0 nid 1 on hdac0 pcm1: <HDA Realtek ALC888 PCM #1 Analog> at cad 0 nid 1 on hdac0 pcm2: <HDA Realtek ALC888 PCM #2 Digital> at cad 0 nid 1 on hdac0 GEOM_MIRROR: Device mirror/gm0 launched (2/2). GEOM_LABEL: Label for provider mirror/gm0s1a is ufsid/47c95ad36ed40e80. GEOM_LABEL: Label for provider mirror/gm0s1d is ufsid/47c95ae2ac3d5ead. GEOM_LABEL: Label for provider mirror/gm0s1e is ufsid/47c95ad3fa3660aa. GEOM_LABEL: Label for provider mirror/gm0s1f is ufsid/47c95ad37ffedafc. acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 (probe0:ata0:0:1:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe0:ata0:0:1:0): CAM Status: SCSI Status Error (probe0:ata0:0:1:0): SCSI Status: Check Condition (probe0:ata0:0:1:0): NOT READY asc:3a,1 (probe0:ata0:0:1:0): Medium not present - tray closed (probe0:ata0:0:1:0): Unretryable error acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 cd0 at ata0 bus 0 target 1 lun 0 cd0: <HL-DT-ST CD-RW GCE-8523B 1.01> Removable CD-ROM SCSI-0 device cd0: 33.000MB/s transfers cd0: Attempt to query device size failed: NOT READY, Medium not present - tray closed SMP: AP CPU #1 Launched! SMP: AP CPU #2 Launched! SMP: AP CPU #3 Launched! Trying to mount root from ufs:/dev/mirror/gm0s1a GEOM_LABEL: Label ufsid/47c95ad36ed40e80 removed. GEOM_LABEL: Label for provider mirror/gm0s1a is ufsid/47c95ad36ed40e80. GEOM_LABEL: Label ufsid/47c95ad3fa3660aa removed. GEOM_LABEL: Label for provider mirror/gm0s1e is ufsid/47c95ad3fa3660aa. GEOM_LABEL: Label ufsid/47c95ad37ffedafc removed. GEOM_LABEL: Label for provider mirror/gm0s1f is ufsid/47c95ad37ffedafc. GEOM_LABEL: Label ufsid/47c95ae2ac3d5ead removed. GEOM_LABEL: Label for provider mirror/gm0s1d is ufsid/47c95ae2ac3d5ead. GEOM_LABEL: Label ufsid/47c95ad36ed40e80 removed. GEOM_LABEL: Label ufsid/47c95ad3fa3660aa removed. GEOM_LABEL: Label ufsid/47c95ad37ffedafc removed. GEOM_LABEL: Label ufsid/47c95ae2ac3d5ead removed. [dan@polo:/usr/home/dan] $
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4A529537.8070206>