From owner-freebsd-current@FreeBSD.ORG Fri Jul 16 05:38:59 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A154816A4CE for ; Fri, 16 Jul 2004 05:38:59 +0000 (GMT) Received: from av3-2-sn3.vrr.skanova.net (av3-2-sn3.vrr.skanova.net [81.228.9.110]) by mx1.FreeBSD.org (Postfix) with ESMTP id D783643D31 for ; Fri, 16 Jul 2004 05:38:58 +0000 (GMT) (envelope-from daniel_k_eriksson@telia.com) Received: by av3-2-sn3.vrr.skanova.net (Postfix, from userid 502) id 3F56D37F05; Fri, 16 Jul 2004 07:38:58 +0200 (CEST) Received: from smtp1-2-sn3.vrr.skanova.net (smtp1-2-sn3.vrr.skanova.net [81.228.9.178]) by av3-2-sn3.vrr.skanova.net (Postfix) with ESMTP id 2DD9437E44; Fri, 16 Jul 2004 07:38:58 +0200 (CEST) Received: from gadget (h130n1fls11o822.telia.com [213.64.66.130]) by smtp1-2-sn3.vrr.skanova.net (Postfix) with ESMTP id EDC2A38002; Fri, 16 Jul 2004 07:38:57 +0200 (CEST) From: "Daniel Eriksson" To: Date: Fri, 16 Jul 2004 07:38:51 +0200 Organization: Home Message-ID: MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_NextPart_000_0000_01C46B07.F3CD7D10" X-Priority: 3 (Normal) X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook, Build 10.0.6626 Importance: Normal In-Reply-To: <40F748F3.6020702@alumni.rice.edu> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1441 cc: freebsd-current@freebsd.org Subject: RE: consistent hard lockup with recent kernels X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 16 Jul 2004 05:38:59 -0000 This is a multi-part message in MIME format. ------=_NextPart_000_0000_01C46B07.F3CD7D10 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Jon Noack wrote: > Yeah, it is most likely bugs uncovered by native preemption. For now=20 > you can remask then by commenting out the "#define PREEMPTION" in=20 > src/sys//include/param.h and rebuilding your kernel. Robert=20 > Watson's comments on this: > "This results in a quite usable kernel, although quite probably more=20 > poor interrupt handling latency, etc." I just tried a new kernel (2004.07.16.01.00.00) with preemption turned = on, and my problems with interrupt storms and SATA still remains. With preemption turned off the machine seems stable, but somewhat sluggish. With preemption enabled I get interrupt storms and DMA problems with the SATA devices like below when trying to access all discs at once (running fsck on all 11 file systems simultaneously). ad20 is one of two SATA = devices in the system. Interrupt storm detected on "irq20: atapci4"; throttling interrupt = source ad20: FAILURE - already active DMA on this device ad20: setting up DMA failed Attached is the dmesg.boot from the old kernel (2004.07.12.22.00.00) = with preemption turned off that seems to be stable. I'm using HZ=3D2000, DEVICE_POLLING and ZERO_COPY_SOCKETS, plus device = apic. I also have net.isr enabled. 'vmstat -i' gives the following output: interrupt total rate irq1: atkbd0 248 0 irq0: clk 8176853 1993 irq4: sio0 12 0 irq6: fdc0 6 0 irq8: rtc 524902 127 irq13: npx0 1 0 irq14: ata0 33148 8 irq15: ata1 24641 6 irq16: atapci0+ 867398 211 irq17: atapci2+ 296877 72 irq18: re0 2 0 irq19: ahc0 94035 22 irq20: atapci4 184381 44 irq23: vr0 2 0 Total 10202506 2487 /Daniel Eriksson ------=_NextPart_000_0000_01C46B07.F3CD7D10 Content-Type: application/octet-stream; name="dmesg.boot" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="dmesg.boot" Copyright (c) 1992-2004 The FreeBSD Project.=0A= Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994=0A= The Regents of the University of California. All rights reserved.=0A= FreeBSD 5.2-CURRENT #0: Tue Jul 13 03:17:24 CEST 2004=0A= daniel@xxx.xxx.xxx:/usr/obj/usr/src/sys/FORTIFY=0A= WARNING: Kernel preemption is disabled, expect reduced performance.=0A= Timecounter "i8254" frequency 1193182 Hz quality 0=0A= CPU: AMD Athlon(TM) XP 2500+ (1999.78-MHz 686-class CPU)=0A= Origin =3D "AuthenticAMD" Id =3D 0x6a0 Stepping =3D 0=0A= = Features=3D0x383fbff=0A= AMD Features=3D0xc0400000=0A= real memory =3D 1342156800 (1279 MB)=0A= avail memory =3D 1304948736 (1244 MB)=0A= ACPI APIC Table: =0A= ioapic0: Changing APIC ID to 2=0A= ioapic0 irqs 0-23 on motherboard=0A= npx0: [FAST]=0A= npx0: on motherboard=0A= npx0: INT 16 interface=0A= acpi0: on motherboard=0A= acpi0: [GIANT-LOCKED]=0A= acpi0: Power Button (fixed)=0A= Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000=0A= acpi_timer0: <32-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0=0A= cpu0: on acpi0=0A= acpi_button0: on acpi0=0A= pcib0: port 0xcf8-0xcff on acpi0=0A= pci0: on pcib0=0A= agp0: mem = 0xf8000000-0xfbffffff at device 0.0 on pci0=0A= pcib1: at device 1.0 on pci0=0A= pci1: on pcib1=0A= pci1: at device 0.0 (no driver attached)=0A= atapci0: port = 0xb400-0xb4ff,0xb800-0xb803,0xd000-0xd007,0xd400-0xd403,0xd800-0xd807 = irq 16 at device 10.0 on pci0=0A= ata2: at 0xd800 on atapci0=0A= ata3: at 0xd000 on atapci0=0A= atapci1: port = 0x9800-0x98ff,0xa000-0xa003,0xa400-0xa407,0xa800-0xa803,0xb000-0xb007 = irq 16 at device 10.1 on pci0=0A= ata4: at 0xb000 on atapci1=0A= ata5: at 0xa400 on atapci1=0A= ahc0: port 0x9400-0x94ff mem = 0xed800000-0xed800fff irq 19 at device 12.0 on pci0=0A= ahc0: [GIANT-LOCKED]=0A= aic7892: Ultra160 Wide Channel A, SCSI Id=3D7, 32/253 SCBs=0A= atapci2: port = 0x7800-0x78ff,0x8000-0x8003,0x8400-0x8407,0x8800-0x8803,0x9000-0x9007 = irq 17 at device 14.0 on pci0=0A= ata6: at 0x9000 on atapci2=0A= ata7: at 0x8400 on atapci2=0A= atapci3: port = 0x6000-0x60ff,0x6400-0x6403,0x6800-0x6807,0x7000-0x7003,0x7400-0x7407 = irq 17 at device 14.1 on pci0=0A= ata8: at 0x7400 on atapci3=0A= ata9: at 0x6800 on atapci3=0A= atapci4: port = 0x4000-0x40ff,0x4400-0x440f,0x4800-0x4803,0x5000-0x5007,0x5400-0x5403,0x5= 800-0x5807 irq 20 at device 15.0 on pci0=0A= ata10: at 0x5800 on atapci4=0A= ata11: at 0x5000 on atapci4=0A= atapci5: port = 0x3800-0x380f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 irq 20 at device 15.1 = on pci0=0A= ata0: at 0x1f0 irq 14 on atapci5=0A= ata1: at 0x170 irq 15 on atapci5=0A= uhci0: port 0x3400-0x341f irq 21 at device = 16.0 on pci0=0A= uhci0: [GIANT-LOCKED]=0A= usb0: on uhci0=0A= usb0: USB revision 1.0=0A= uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1=0A= uhub0: 2 ports with 2 removable, self powered=0A= uhci1: port 0x3000-0x301f irq 21 at device = 16.1 on pci0=0A= uhci1: [GIANT-LOCKED]=0A= usb1: on uhci1=0A= usb1: USB revision 1.0=0A= uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1=0A= uhub1: 2 ports with 2 removable, self powered=0A= uhci2: port 0x2800-0x281f irq 21 at device = 16.2 on pci0=0A= uhci2: [GIANT-LOCKED]=0A= usb2: on uhci2=0A= usb2: USB revision 1.0=0A= uhub2: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1=0A= uhub2: 2 ports with 2 removable, self powered=0A= uhci3: port 0x2400-0x241f irq 21 at device = 16.3 on pci0=0A= uhci3: [GIANT-LOCKED]=0A= usb3: on uhci3=0A= usb3: USB revision 1.0=0A= uhub3: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1=0A= uhub3: 2 ports with 2 removable, self powered=0A= ehci0: mem 0xed000000-0xed0000ff irq = 21 at device 16.4 on pci0=0A= ehci0: [GIANT-LOCKED]=0A= ehci_pci_attach: companion usb0=0A= ehci_pci_attach: companion usb1=0A= ehci_pci_attach: companion usb2=0A= ehci_pci_attach: companion usb3=0A= usb4: EHCI version 1.0=0A= usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3=0A= usb4: on ehci0=0A= usb4: USB revision 2.0=0A= uhub4: (0x1106) EHCI root hub, class 9/0, rev 2.00/1.00, addr 1=0A= uhub4: 8 ports with 8 removable, self powered=0A= isab0: at device 17.0 on pci0=0A= isa0: on isab0=0A= vr0: port 0x2000-0x20ff mem = 0xec800000-0xec8000ff irq 23 at device 18.0 on pci0=0A= miibus0: on vr0=0A= rlphy0: on miibus0=0A= rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto=0A= vr0: Ethernet address: 00:0e:a6:1f:29:1e=0A= vr0: [GIANT-LOCKED]=0A= re0: port 0x1800-0x18ff mem = 0xec000000-0xec0000ff irq 18 at device 19.0 on pci0=0A= miibus1: on re0=0A= rgephy0: on miibus1=0A= rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, = 1000baseTX-FDX, auto=0A= re0: Ethernet address: 00:50:fc:f8:c6:81=0A= re0: [GIANT-LOCKED]=0A= fdc0: port = 0x3f7,0x3f2-0x3f5 irq 6 drq 2 on acpi0=0A= fdc0: FIFO enabled, 8 bytes threshold=0A= fd0: <1440-KB 3.5" drive> on fdc0 drive 0=0A= sio0: configured irq 4 not in bitmap of probed irqs 0=0A= sio0: port may not be enabled=0A= sio0 port 0x3f8-0x3ff irq 4 on acpi0=0A= sio0: type 16550A=0A= atkbdc0: port 0x64,0x60 irq 1 on acpi0=0A= atkbd0: irq 1 on atkbdc0=0A= kbd0 at atkbd0=0A= atkbd0: [GIANT-LOCKED]=0A= orm0: at iomem 0xc0000-0xcafff on isa0=0A= pmtimer0 on isa0=0A= sc0: at flags 0x100 on isa0=0A= sc0: VGA <16 virtual consoles, flags=3D0x300>=0A= sio1: configured irq 3 not in bitmap of probed irqs 0=0A= sio1: port may not be enabled=0A= vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0=0A= Timecounter "TSC" frequency 1999782834 Hz quality 800=0A= Timecounters tick every 1.000 msec=0A= ipfw2 initialized, divert enabled, rule-based forwarding enabled, = default to accept, logging unlimited=0A= ad0: 114473MB [232581/16/63] at ata0-master UDMA100=0A= ad1: 114473MB [232581/16/63] at ata0-slave UDMA100=0A= ad2: 117800MB [239340/16/63] at ata1-master UDMA100=0A= ad3: 117800MB [239340/16/63] at ata1-slave UDMA100=0A= ad4: 238475MB [484521/16/63] at ata2-master UDMA100=0A= ad5: 238475MB [484521/16/63] at ata2-slave UDMA100=0A= ad6: 239372MB [486344/16/63] at ata3-master UDMA133=0A= ad7: 239372MB [486344/16/63] at ata3-slave UDMA133=0A= ad8: 194481MB [395136/16/63] at ata4-master UDMA133=0A= ad9: 194481MB [395136/16/63] at ata4-slave UDMA133=0A= ad10: 194481MB [395136/16/63] at ata5-master UDMA133=0A= ad11: 239372MB [486344/16/63] at ata5-slave UDMA133=0A= ad12: 117800MB [239340/16/63] at ata6-master UDMA100=0A= ad13: 117800MB [239340/16/63] at ata6-slave UDMA100=0A= ad14: 117246MB [238216/16/63] at ata7-master UDMA133=0A= ad15: 117246MB [238216/16/63] at ata7-slave UDMA133=0A= ad16: 114473MB [232581/16/63] at ata8-master = UDMA100=0A= ad18: 26059MB [52946/16/63] at ata9-master UDMA66=0A= ad20: 239372MB [486344/16/63] at ata10-master SATA150=0A= ad22: 238475MB [484521/16/63] at ata11-master = SATA150=0A= ar0: 476950MB [60802/255/63] status: READY subdisks:=0A= disk0 READY on ad4 at ata2-master=0A= disk1 READY on ad5 at ata2-slave=0A= ar1: 478744MB [61031/255/63] status: READY subdisks:=0A= disk0 READY on ad6 at ata3-master=0A= disk1 READY on ad7 at ata3-slave=0A= ar2: 388962MB [49585/255/63] status: READY subdisks:=0A= disk0 READY on ad9 at ata4-slave=0A= disk1 READY on ad8 at ata4-master=0A= Waiting 5 seconds for SCSI devices to settle=0A= sa0 at ahc0 bus 0 target 5 lun 0=0A= sa0: Removable Sequential Access SCSI-2 device =0A= sa0: 10.000MB/s transfers (10.000MHz, offset 15)=0A= da0 at ahc0 bus 0 target 0 lun 0=0A= da0: Fixed Direct Access SCSI-3 device =0A= da0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit), Tagged Queueing = Enabled=0A= da0: 8759MB (17938986 512 byte sectors: 255H 63S/T 1116C)=0A= da1 at ahc0 bus 0 target 1 lun 0=0A= da1: Fixed Direct Access SCSI-3 device =0A= da1: 40.000MB/s transfers (20.000MHz, offset 31, 16bit), Tagged Queueing = Enabled=0A= da1: 8759MB (17938986 512 byte sectors: 255H 63S/T 1116C)=0A= da2 at ahc0 bus 0 target 10 lun 0=0A= da2: Fixed Direct Access SCSI-3 device =0A= da2: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged = Queueing Enabled=0A= da2: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C)=0A= da3 at ahc0 bus 0 target 11 lun 0=0A= da3: Fixed Direct Access SCSI-3 device =0A= da3: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged = Queueing Enabled=0A= da3: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C)=0A= da4 at ahc0 bus 0 target 12 lun 0=0A= da4: Fixed Direct Access SCSI-3 device =0A= da4: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged = Queueing Enabled=0A= da4: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C)=0A= da5 at ahc0 bus 0 target 13 lun 0=0A= da5: Fixed Direct Access SCSI-3 device =0A= da5: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged = Queueing Enabled=0A= da5: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C)=0A= Mounting root from ufs:/dev/ad0s1a=0A= ------=_NextPart_000_0000_01C46B07.F3CD7D10--