From owner-freebsd-stable Wed Dec 8 19:29: 1 1999 Delivered-To: freebsd-stable@freebsd.org Received: from thelab.hub.org (nat196.191.mpoweredpc.net [142.177.196.191]) by hub.freebsd.org (Postfix) with ESMTP id A56031526E; Wed, 8 Dec 1999 19:28:52 -0800 (PST) (envelope-from scrappy@hub.org) Received: from localhost (scrappy@localhost) by thelab.hub.org (8.9.3/8.9.1) with ESMTP id XAA10564; Wed, 8 Dec 1999 23:28:51 -0400 (AST) (envelope-from scrappy@hub.org) X-Authentication-Warning: thelab.hub.org: scrappy owned process doing -bs Date: Wed, 8 Dec 1999 23:28:51 -0400 (AST) From: The Hermit Hacker To: freebsd-scsi@freebsd.org Cc: mackley@tht.net, freebsd-stable@freebsd.org Subject: SCSI problem ... OS or just bus? Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG I recently did two upgrades in the course of a few days...upgraded my 3.3-STABLE to a more recent version, and added hard drives onto the system...now I'm getting SCSI problems that make no sense :( The machine just hung once more, which its doing every few hours...I can get down to the debugger, but a 'trace' doesn't appear to show anyting, so I panic... ========== (da4:ahc0:0:8:0): Other SCB Timeout (da4:ahc0:0:8:0): SCB 0xeb - timed out in dataout phase, SEQADDR == 0x10f (da4:ahc0:0:8:0): Other SCB Timeout (da2:ahc0:0:5:0): SCB 0x24 - timed out in dataout phase, SEQADDR == 0x10f (da2:ahc0:0:5:0): BDR message in message buffer (da2:ahc0:0:5:0): SCB 0x92 - timed out in dataout phase, SEQADDR == 0x10f (da2:ahc0:0:5:0): no longer in timeout, status = 34b ahc0: Issued Channel A Bus Reset. 98 SCBs aborted ^C^C^CStopped at siointr1+0xc5: jmp siointr1+0x1c9 db> where No such command db> list No such command db> ? Bad character ? db> help print p examine x search set write w delete d break dwatch watch step s continue c until next match trace call show ps gdb panic db> trace siointr1(c225d000,0,c021dc47,0,d6030010) at siointr1+0xc5 siointr(0,d6030010,77,d6034000,0) at siointr+0x13 Xfastintr4() at Xfastintr4+0x17 db> continue ^C^CStopped at siointr1+0xc5: jmp siointr1+0x1c9 db> trace siointr1(c225d000,0,c021dc47,0,d6030010) at siointr1+0xc5 siointr(0,d6030010,65,d6031000,0) at siointr+0x13 Xfastintr4() at Xfastintr4+0x17 db> panic panic: from debugger syncing disks... Fatal trap 12: page fault while in kernel mode fault virtual address = 0x30 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01ec14c stack pointer = 0x10:0xc025a5fc frame pointer = 0x10:0xc025a600 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = Idle interrupt mask = bio trap number = 12 panic: page fault (da0:ahc0:0:0:0): Synchronize cache failed, status == 0xb, scsi status == 0x0 ====== I'm going to upgrade to the newest kernel, but none of the recent commit messages *sound* like they are applicable... A dmesg of the machine looks like, just in case either anything about the controller/drives pops out at anyone? ====== Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.3-STABLE #0: Wed Dec 1 09:53:53 EST 1999 root@hub.org:/usr/src/sys/compile/hub_org Timecounter "i8254" frequency 1193182 Hz Timecounter "TSC" frequency 501139140 Hz CPU: Pentium III (501.14-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x387f9ff,MMX,FXSR,> real memory = 805306368 (786432K bytes) avail memory = 780460032 (762168K bytes) Preloaded elf kernel "kernel" at 0xc02df000. Pentium Pro MTRR support enabled Probing for devices on PCI bus 0: chip0: rev 0x02 on pci0.0.0 chip1: rev 0x02 on pci0.1.0 chip2: rev 0x02 on pci0.4.0 chip3: rev 0x02 on pci0.4.3 ahc0: rev 0x01 int a irq 14 on pci0.9.0 ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs ahc1: rev 0x01 int a irq 15 on pci0.10.0 ahc1: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs xl0: <3Com 3c905-TX Fast Etherlink XL> rev 0x00 int a irq 12 on pci0.11.0 xl0: Ethernet address: 00:60:08:c8:36:05 xl0: autoneg complete, link status good (half-duplex, 10Mbps) xl1: <3Com 3c905-TX Fast Etherlink XL> rev 0x00 int a irq 10 on pci0.12.0 xl1: Ethernet address: 00:60:97:d0:3c:f5 xl1: autoneg complete, link status good (half-duplex, 10Mbps) Probing for devices on PCI bus 1: vga0: rev 0x01 int a irq 10 on pci1.0.0 Probing for PnP devices: CSN 1 Vendor ID: UMC9008 [0x0890a355] Serial 0xab8d1af0 Comp ID: PNP80d6 [0xd680d041] ed1: address 00:c0:f0:1a:8d:ab, type NE2000 (16 bit) ed1 (edpnp sn 0xab8d1af0) at 0x220-0x23f irq 11 on isa Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <4 virtual consoles, flags=0x0> ed0 not found at 0x280 atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A, console fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface IP packet filtering initialized, divert enabled, rule-based forwarding disabled, default to accept, logging limited to 100 packets/entry by default Waiting 2 seconds for SCSI devices to settle cda2 at ahc0 bus 0 target 5 lun 0 da2: Fixed Direct Access SCSI-2 device da2: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da2: 4341MB (8890760 512 byte sectors: 255H 63S/T 553C) da3 at ahc0 bus 0 target 6 lun 0 da3: Fixed Direct Access SCSI-2 device da3: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da3: 4341MB (8890760 512 byte sectors: 255H 63S/T 553C) da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 4350MB (8910423 512 byte sectors: 255H 63S/T 554C) da4 at ahc0 bus 0 target 8 lun 0 da4: Fixed Direct Access SCSI-3 device da4: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da4: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C) da1 at ahc0 bus 0 target 2 lun 0 da1: Fixed Direct Access SCSI-2 device da1: 20.000MB/s transfers (10.000MHz, offset 8, 16bit), Tagged Queueing Enabled da1: 4095MB (8386733 512 byte sectors: 255H 63S/T 522C) da8 at ahc0 bus 0 target 12 lun 0 da8: Fixed Direct Access SCSI-3 device da8: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da8: 17366MB (35566000 512 byte sectors: 255H 63S/T 2213C) da5 at ahc0 bus 0 target 9 lun 0 da5: Fixed Direct Access SCSI-2 device da5: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da5: 17366MB (35566480 512 byte sectors: 255H 63S/T 2213C) da7 at ahc0 bus 0 target 11 lun 0 da7: Fixed Direct Access SCSI-2 device da7: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da7: 17366MB (35566480 512 byte sectors: 255H 63S/T 2213C) da6 at ahc0 bus 0 target 10 lun 0 da6: Fixed Direct Access SCSI-2 device da6: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da6: 17366MB (35566480 512 byte sectors: 255H 63S/T 2213C) hanging root device to da0s1a WARNING: / was not properly dismounted ======= The QUANTUMs above have been in the machine for >1yr now and have worked, from what I can tell, without a hitch...its only since we added da4+ that we've seeing the problems...interaction problem between drive manufacturers? I seem to recall this sort of thing being a problem on older drives, but.. Marc G. Fournier ICQ#7615664 IRC Nick: Scrappy Systems Administrator @ hub.org primary: scrappy@hub.org secondary: scrappy@{freebsd|postgresql}.org To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message