From owner-freebsd-stable Fri Feb 16 18:22: 0 2001 Delivered-To: freebsd-stable@freebsd.org Received: from boromir.vpop.net (boromir.vpop.net [206.117.147.2]) by hub.freebsd.org (Postfix) with ESMTP id AF39237B4EC for ; Fri, 16 Feb 2001 18:21:53 -0800 (PST) Received: from vpop.net ([209.102.16.48]) by boromir.vpop.net (8.11.2/8.11.2) with ESMTP id f1H2LnR44312; Fri, 16 Feb 2001 18:21:49 -0800 (PST) (envelope-from mreimer@vpop.net) Message-ID: <3A8DE046.1F64D37D@vpop.net> Date: Fri, 16 Feb 2001 18:21:58 -0800 From: Matthew Reimer Organization: VPOP Technologies, Inc. X-Mailer: Mozilla 4.76 [en] (X11; U; Linux 2.2.12 i386) X-Accept-Language: en MIME-Version: 1.0 To: freebsd-stable@freebsd.org Subject: Re: Reproducible kernel panic during boot; probably SCSI-related References: Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Justin Gibbs just committed a fix for this problem: > gibbs 2001/02/16 17:53:59 PST > > Modified files: (Branch: RELENG_4) > sys/dev/aic7xxx aic7xxx_freebsd.c > Log: > MFC: Only honor the CHANNEL_B_PRIMARY flag for twin channel devices. > > Revision Changes Path > 1.3.2.15 +5 -3 src/sys/dev/aic7xxx/aic7xxx_freebsd.c Thanks to you who replied to my inquiry. Matt Matthew Reimer wrote: > > We've had a reproducible problem today with 4.2-STABLE from between > 2001/02/13 1500 PST to 2001/02/14 1500 PST, where the kernel would print > the message "Waiting 2 seconds for SCSI devices to settle" and then > panic (details below). This would happen every time. I rebuilt the > kernel several times to make sure there were no stale dependencies but > every time it panicked the same way. Eventually I cvsup'd /sys sources > from 2001/02/10 and the problem went away. I presume the problem is > still in -stable. > > I wasn't able to get a dump because *dumpdev = 0xffffffff. (Does anyone > know how to get dumpdev set properly this early in the boot process? In > -current one could use "show /disk/da0s1b" but we don't have that in > -stable.) > > This is running on an L440GX mobo with one extra 2940 controller, for a > total of three. > > Any ideas? > > Matt > > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x123 > fault code = supervisor read, page not present > instruction pointer = 0x8:0xc013a2b8 > stack pointer = 0x10:0xc03aea88 > frame pointer = 0x10:0xc03aeab4 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor eflags = interrupt enabled, resume, IOPL = 0 > current process = 0 (swapper) > interrupt mask = cam > kernel: type 12 trap, code=0 > Stopped at ahc_get_tran_settings+0xbc: movzbl 0x3(%esi),%eax > > db> trace > ahc_get_tran_settings(c163fa00,0,42,c03aeb84) at > ahc_get_tran_settings+0xbc > ahc_action(c163d740,c03aeb84) at ahc_action+0x636 > xpt_action(c03aeb84,c03aeb84,c0a34240,1,c03aec10) at xpt_action+0x376 > proberequestdefaultnegotiation(c16c2a00,c16c501c,c16c2a00,1,1) at > proberequestdefaultnegotiation+0x2c > probeschedule(c16c2a00,c16c2a00,c163d718,7d0,c16c2a00) at > probeschedule+0x6d > proberegister(c16c2a00,c16c5000,6c0000,c03aed60,c16c5000) at > proberegister+0x7c > cam_periph_alloc(c0122078,0,c01228e4,c01221cc,c02359d8) at > cam_periph_alloc+0x1e2 > xpt_scan_lun(c1640c00,c0a34260,10003201,c16c5000) at xpt_scan_lun+0x1c5 > xpt_action(c16c5000,c16c5000,c0a34260,1) at xpt_action+0x946 > xpt_scan_bus(c1640c00,c16c5800) at xpt_scan_bus+0x123 > xpt_action(c16c5800,c16c5800,c163d700,c011f8a4,c03aef28) at > xpt_action+0x92c > xpt_finishconfig(c1640c00,c16c5800,c163d380,c03aef68,c0a34280) at > xpt_finishconfig+0x30 > xptconfigfunc(c163d700,0,c03aef54,c011f723,c163d700) at > xptconfigfunc+0xfb > xptdefbusfunc(c163d700,c03aef68) at xptdefbusfunc+0x29 > xptbustraverse(0,c011f8a4,c03aef68,0,c0122d70) at xptbustraverse+0x2b > xpt_for_all_busses(c0122d70,0,c03aef94,c0160f9e,0) at > xpt_for_all_busses+0x29 > xpt_config(0) at xpt_config+0x50 > run_interrupt_driven_config_hooks(0,3acc00,3b3000,0,c011d414) at > run_interrupt_driven_config_hooks+0x1a > mi_startup(c03aefb4,b0202,ffe,3b3000,c0166da5) at mi_startup+0x70 > begin() at begin+0x4b > > --- dmesg of a working kernel from 2001.02.10.00.00.00 > > FreeBSD 4.2-STABLE #4: Wed Feb 14 17:06:26 PST 2001 > mreimer@thorin.vpop.net:/usr/obj/usr/src/sys/RING > Timecounter "i8254" frequency 1193182 Hz > Timecounter "TSC" frequency 547179703 Hz > CPU: Pentium III/Pentium III Xeon/Celeron (547.18-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0x673 Stepping = 3 > > Features=0x387fbff OV,PAT,PSE36,PN,MMX,FXSR,SSE> > real memory = 268369920 (262080K bytes) > avail memory = 257646592 (251608K bytes) > Preloaded elf kernel "kernel" at 0xc038d000. > Pentium Pro MTRR support enabled > npx0: on motherboard > npx0: INT 16 interface > pcib0: on motherboard > pci0: on pcib0 > pcib2: at device 1.0 on > pci0 > pci1: on pcib2 > pcib3: at device 15.0 on > pci1 > pci2: on pcib3 > ahc0: port 0x1400-0x14ff mem > 0xf4100000 > -0xf4100fff irq 11 at device 12.0 on pci0 > aic7896/97: Wide Channel A, SCSI Id=7, 32/255 SCBs > ahc1: port 0x1800-0x18ff mem > 0xf4101000 > -0xf4101fff irq 11 at device 12.1 on pci0 > aic7896/97: Wide Channel B, SCSI Id=7, 32/255 SCBs > fxp0: port 0x1080-0x10bf mem > 0xf4000000-0xf40f > ffff,0xf4102000-0xf4102fff irq 10 at device 14.0 on pci0 > fxp0: Ethernet address 00:a0:c9:fb:26:e6 > ahc2: port 0x2000-0x20ff mem > 0xf4103000-0xf4103fff i > rq 5 at device 16.0 on pci0 > aic7870: Wide Channel A, SCSI Id=7, 16/255 SCBs > isab0: at device 18.0 on pci0 > isa0: on isab0 > pci0: at 18.1 > pci0: at 18.2 irq 10 > chip1: port 0x1040-0x104f at > device > 18.3 on pci0 > pci0: at 20.0 > pcib1: on motherboard > pci3: on pcib1 > fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on > isa0 > fdc0: FIFO enabled, 8 bytes threshold > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > atkbdc0: at port 0x60,0x64 on isa0 > atkbd0: irq 1 on atkbdc0 > vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on > isa0 > sc0: on isa0 > sc0: VGA <16 virtual consoles, flags=0x0> > sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 > sio0: type 16550A, console > sio1: configured irq 3 not in bitmap of probed irqs 0 > IP packet filtering initialized, divert disabled, rule-based forwarding > enabled, > default to deny, logging disabled > Waiting 2 seconds for SCSI devices to settle > sa0 at ahc2 bus 0 target 0 lun 0 > sa0: Removable Sequential Access SCSI-2 device > sa0: 10.000MB/s transfers (10.000MHz, offset 15) > da1 at ahc0 bus 0 target 1 lun 0 > da1: Fixed Direct Access SCSI-3 device > da1: 40.000MB/s transfers (20.000MHz, offset 31, 16bit), Tagged Queueing > Enabled > da1: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C) > da0 at ahc0 bus 0 target 0 lun 0 > da0: Fixed Direct Access SCSI-3 device > da0: 40.000MB/s transfers (20.000MHz, offset 31, 16bit), Tagged Queueing > Enabled > da0: 8748MB (17916240 512 byte sectors: 255H 63S/T 1115C) > ch0 at ahc2 bus 0 target 1 lun 0 > ch0: Removable Changer SCSI-2 device > ch0: 3.300MB/s transfers > ch0: 8 slots, 1 drive, 1 picker, 0 portals > da2 at ahc1 bus 0 target 2 lun 0 > da2: Fixed Direct Access SCSI-3 device > da2: 40.000MB/s transfers (20.000MHz, offset 31, 16bit), Tagged Queueing > Enabled > da2: 35003MB (71687340 512 byte sectors: 255H 63S/T 4462C) > Mounting root from ufs:/dev/da0s1a > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-stable" in the body of the message To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message