From owner-freebsd-current Mon Jul 16 23:26:57 2001 Delivered-To: freebsd-current@freebsd.org Received: from white.dogwood.com (white.dogwood.com [63.96.228.130]) by hub.freebsd.org (Postfix) with ESMTP id 8BFA537B406; Mon, 16 Jul 2001 23:26:51 -0700 (PDT) (envelope-from dave@dogwood.com) Received: (from dave@localhost) by white.dogwood.com (8.11.4/8.11.3) id f6H6Qhc08309; Mon, 16 Jul 2001 23:26:43 -0700 (PDT) (envelope-from dave) From: Dave Cornejo Message-Id: <200107170626.f6H6Qhc08309@white.dogwood.com> Subject: more on supermicro 6010H hang To: gibbs@scsiguy.com, jhb@freebsd.org Date: Mon, 16 Jul 2001 23:26:43 -0700 (PDT) Cc: freebsd-current@freebsd.org X-Mailer: ELM [version 2.4ME+ PL88 (25)] MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG I have isolated the point at which current no longer runs as Jan 31 - Feb 1 of this year. Prior version work fine, in Feb & Mar I get either "Kernel trap 9 with interrupts disabled" or I think the same thing with trap 26 (really not sure on that one). Next I took a brand new current from this evening and tried it - it still hangs, but a keypress on the keyboard pretty much always breaks it out of the hang and into a normal boot. Now, I finally got the equipment and time together to remote gdb the bad kernel and here's what I get: I set a breakpoint at cam_xpt.c::xpt_config() - this is where the "Waiting 15 seconds.." message is from and stepped down through it. I get through the first xpt_for_all_busses (xptconfigbuscountfunc,...) and then I hit the second one (~line 6749 of cam_xpt.c) I pass through several things, including the xptconfigfunc() and end up in subr_autoconf.c::run_interrupt_driven_config_hooks(). At the bottom of this function there is a tsleep that gets called - this is apparently where it hangs. If I hit a key on the keyboard it will continue on past this point and all seems to work fine from then on. This is my first time this deep into the kernel - can you suggest a further plan of attack? thanks! dave c here's the dmesg output for this system if this helps any: Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.0-CURRENT #0: Mon Jul 16 22:32:23 PDT 2001 dave@juneau.dogwood.com:/usr/src/sys/i386/compile/SMP Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (999.53-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x686 Stepping = 6 Features=0x383fbff real memory = 1073676288 (1048512K bytes) avail memory = 1040248832 (1015868K bytes) Programming 16 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 Programming 16 pins in IOAPIC #1 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 4, version: 0x000f0011, at 0xfec00000 io1 (APIC): apic id: 5, version: 0x000f0011, at 0xfec01000 Preloaded elf kernel "kernel" at 0xc0527000. Pentium Pro MTRR support enabled WARNING: Driver mistake: destroy_dev on 154/0 Using $PIR table, 7 entries at 0xc00f5370 npx0: on motherboard npx0: INT 16 interface pcib0: at pcibus 0 on motherboard IOAPIC #1 intpin 12 -> irq 2 IOAPIC #1 intpin 10 -> irq 5 IOAPIC #1 intpin 11 -> irq 7 IOAPIC #1 intpin 15 -> irq 9 pci0: on pcib0 pcib1: at device 0.1 on pci0 IOAPIC #1 intpin 14 -> irq 11 pci1: on pcib1 pci1: at 0.0 (no driver attached) fxp0: port 0xc800-0xc83f mem 0xfe800000-0xfe8fffff,0xfeafb000-0xfeafbfff irq 2 at device 4.0 on pci0 fxp0: Ethernet address 00:30:48:11:69:84 inphy0: on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ahc0: port 0xd000-0xd0ff mem 0xfeafc000-0xfeafcfff irq 5 at device 5.0 on pci0 aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/255 SCBs ahc1: port 0xd800-0xd8ff mem 0xfeaff000-0xfeafffff irq 7 at device 5.1 on pci0 aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/255 SCBs fxp1: port 0xd400-0xd43f mem 0xfe900000-0xfe9fffff,0xfeafd000-0xfeafdfff irq 9 at device 6.0 on pci0 fxp1: Ethernet address 00:30:48:11:6e:27 inphy1: on miibus1 inphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto isab0: port 0x580-0x58f at device 15.0 on pci0 isa0: on isab0 atapci0: port 0xffa0-0xffaf at device 15.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 ohci0: mem 0xfeafe000-0xfeafefff irq 10 at device 15.2 on pci0 usb0: OHCI version 1.0, legacy support usb0: on ohci0 usb0: USB revision 1.0 uhub0: (unknown) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 4 ports with 4 removable, self powered pcib2: at pcibus 2 on motherboard pci2: on pcib2 orm0: