From owner-freebsd-i386@FreeBSD.ORG Thu Jun 30 17:07:13 2005 Return-Path: X-Original-To: freebsd-i386@freebsd.org Delivered-To: freebsd-i386@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 40EAD16A421 for ; Thu, 30 Jun 2005 17:07:13 +0000 (GMT) (envelope-from jr@jrssite.com) Received: from hermes.acsalaska.net (hermes.acsalaska.net [209.112.173.230]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3238E43D66 for ; Thu, 30 Jun 2005 17:07:11 +0000 (GMT) (envelope-from jr@jrssite.com) Received: from [192.168.77.37] (209-193-42-90-cdsl-rb1.sit.acsalaska.net [209.193.42.90]) by hermes.acsalaska.net (8.13.4/8.13.4) with ESMTP id j5UH71DT015891 for ; Thu, 30 Jun 2005 09:07:09 -0800 (AKDT) (envelope-from jr@jrssite.com) Message-ID: <42C426C1.6090307@jrssite.com> Date: Thu, 30 Jun 2005 09:07:13 -0800 From: JR Dalrymple User-Agent: Mozilla Thunderbird 1.0.2 (X11/20050401) X-Accept-Language: en-us, en MIME-Version: 1.0 To: freebsd-i386@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-ACS-Spam-Status: no X-ACS-Scanned-By: MD 2.52; SA 3.0.4; spamdefang 1.112 Subject: ipi stuck problem on 5.3 X-BeenThere: freebsd-i386@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: I386-specific issues for FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 30 Jun 2005 17:07:13 -0000 I am working on a Dell Poweredge 6100/200 with quad 200 Pentium Pro CPUs. I built it with a GENERIC/SMP kernel and it's doing normal day to day operations fine (smb/cifs, intranet, dns cache, ftp). Whenever I try a CPU intensive operation (dump or build a large port) it panics with this "Panic: apic: previous ipi is stuck" and then I have to reboot it. I found this patch: http://lists.freebsd.org/pipermail/freebsd-current/2004-November/043697.html That patch seems to be included in current source so I just updated all my source and was going to make buildworld, then rebuild the kernel. I rebooted into the non-SMP kernel assuming that the problem was SMP related. When I did make buildworld, it started and went through probably the first couple hours or so without a hitch. I had it running in a screen virtual terminal so I disconnected and walked away (yes I was working in multi-user mode). I got a call @ 7:30 this morning saying no one could access their SMB shares. When I got to the console it had the same panic message. So the question is: Is the problem SMP related or no? Really that's immaterial, more importantly, how to I rebuild and patch if the machine panics when I try to do it. I'll attach the dmesg at the bottom for good measure: What am I missing here? Suggestions welcome. Thanks JR Copyright (c) 1992-2004 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.3-RELEASE #0: Fri Nov 5 04:19:18 UTC 2004 root@harlow.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC MPTable: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Pentium Pro (198.95-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x619 Stepping = 9 Features=0xfbff real memory = 536870912 (512 MB) avail memory = 515698688 (491 MB) ioapic0: Assuming intbase of 0 ioapic0 irqs 0-15 on motherboard npx0: [FAST] npx0: on motherboard npx0: INT 16 interface pcib0: pcibus 0 on motherboard pci0: on pcib0 fxp0: port 0xff40-0xff5f mem 0xfe900000-0xfe9ffff f,0xfe2ff000-0xfe2fffff irq 10 at device 12.0 on pci0 miibus0: on fxp0 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:a0:c9:89:ca:86 xl0: <3Com 3c905C-TX Fast Etherlink XL> port 0xfc80-0xfcff mem 0xfe8ffc00-0xfe8f fc7f irq 9 at device 13.0 on pci0 miibus1: on xl0 ukphy0: on miibus1 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl0: Ethernet address: 00:04:75:ad:a2:a4 eisab0: at device 14.0 on pci0 eisa0: on eisab0 mainboard0: on eisa0 slot 0 isa0: on eisab0 pci0: at device 15.0 (no driver attached) pci0: at device 20.0 (no driver attached) pcib1: pcibus 1 on motherboard pci1: on pcib1 amr0: port 0xec80-0xecff irq 11 at device 10.0 on pci1 amr0: [GIANT-LOCKED] amr0: Firmware U.75, BIOS 1.44, 16MB RAM ahc0: port 0xe800-0xe8ff mem 0xfe1fb000-0xf e1fbfff irq 11 at device 11.0 on pci1 ahc0: Using left over BIOS settings ahc0: [GIANT-LOCKED] aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs ahc1: port 0xe400-0xe4ff mem 0xfe1fa000-0xf e1fafff irq 10 at device 12.0 on pci1 ahc1: Using left over BIOS settings ahc1: [GIANT-LOCKED] aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs xl1: <3Com 3c905B-TX Fast Etherlink XL> port 0xec00-0xec7f mem 0xfe1f9c00-0xfe1f 9c7f irq 3 at device 14.0 on pci1 miibus2: on xl1 xlphy0: <3Com internal media interface> on miibus2 xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto xl1: Ethernet address: 00:10:4b:69:75:6d cpu0 on motherboard orm0: at iomem 0xec000-0xeffff,0xea000-0xebfff,0xe8000-0xe9fff ,0xcd800-0xcdfff,0xcd000-0xcd7ff,0xcc800-0xccfff,0xc0000-0xc7fff on isa0 pmtimer0 on isa0 ata0 at port 0x3f6,0x1f0-0x1f7 irq 14 on isa0 ata1 at port 0x376,0x170-0x177 irq 15 on isa0 atkbdc0: at port 0x64,0x60 on isa0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] fdc0: at port 0x3f0-0x3f5 irq 6 drq 2 on isa0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold ppbus0: on ppc0 plip0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 unknown: can't assign resources (port) psmcpnp0: irq resource info is missing; assuming irq 12 unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (irq) Timecounter "TSC" frequency 198948670 Hz quality 800 Timecounters tick every 10.000 msec Waiting 15 seconds for SCSI devices to settle amrd0: on amr0 amrd0: 8568MB (17547264 sectors) RAID 1 (optimal) amrd1: on amr0 amrd1: 25704MB (52641792 sectors) RAID 5 (optimal) sa0 at ahc0 bus 0 target 6 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 10.000MB/s transfers (10.000MHz, offset 15) ses0 at amr0 bus 0 target 6 lun 0 ses0: Fixed Processor SCSI-2 device ses0: SAF-TE Compliant Device cd0 at ahc1 bus 0 target 6 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 3.300MB/s transfers cd0: Attempt to query device size failed: NOT READY, Medium not present Mounting root from ufs:/dev/amrd0s1a WARNING: / was not properly dismounted WARNING: /usr was not properly dismounted /usr: superblock summary recomputed WARNING: /var was not properly dismounted /var: mount pending error: blocks 8 files 2