Date: Fri, 2 Jul 1999 11:46:41 -0500 (CDT) From: Joe Greco <jgreco@ns.sol.net> To: ken@plutotech.com (Kenneth D. Merry) Cc: scsi@freebsd.org Subject: Re: FreeBSD panics with Mylex DAC960SX Message-ID: <199907021646.LAA77311@aurora.sol.net> In-Reply-To: <199907011954.NAA44126@panzer.kdm.org> from "Kenneth D. Merry" at "Jul 1, 1999 1:54:18 pm"
next in thread | previous in thread | raw e-mail | index | archive | help
> Thanks for all the work on this! I talked to Justin for a minute, and I > think we've figured out what the problem is. >=20 > It's a little more complicated than this, but the simple explanation is > that we aren't doing the right thing when a command comes back with just a > sense key and no ASC or ASCQ. It's hard to believe we haven't run into > this before, but I think that's the problem. >=20 > Try applying the attached patch to scsi_all.c. It isn't the final patch > for this problem, the solution is probably a little more complicated than > this. But hopefully this will let us know whether the problem is what we > think it is. ~ Hit [Enter] to boot immediately, or any other key for command prompt. =0DBooting [kernel] in 14 seconds...=20 Type '?' for a list of commands, 'help' for more detailed help. disk1s1a:> boot -s /=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\= =08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08/kernel = text=3D0x10e9fc |=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-= =08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08= /=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\= =08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08= -=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08= /=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\= =08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08= -=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08dat= a=3D0x16124+0x1de98 |=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08syms=3D= [0x4+0x1de20|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08+0x= 4+0x20077\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08\=08|=08/=08-=08] Copyright (c) 1992-1999 FreeBSD Inc. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 3.1-RELEASE #5: Fri Jul 2 11:37:31 CDT 1999 root@:/usr/src/sys/compile/SPOOL-DDB Timecounter "i8254" frequency 1193182 Hz CPU: Pentium II/Xeon/Celeron (686-class CPU) Origin =3D "GenuineIntel" Id =3D 0x652 Stepping=3D2 Features=3D0x183fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE= ,MCA,CMOV,PAT,PSE36,MMX,<b24>> real memory =3D 536870912 (524288K bytes) avail memory =3D 519999488 (507812K bytes) Programming 24 pins in IOAPIC #0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 1, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 0, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xf0283000. Probing for devices on PCI bus 0: chip0: <Intel 82443BX host to PCI bridge> rev 0x03 on pci0.0.0 chip1: <Intel 82443BX host to AGP bridge> rev 0x03 on pci0.1.0 chip2: <Intel 82371AB PCI to ISA bridge> rev 0x02 on pci0.4.0 chip3: <Intel 82371AB Power management controller> rev 0x02 on pci0.4.3 ahc0: <Adaptec aic7890/91 Ultra2 SCSI adapter> rev 0x00 int a irq 19 on pci= 0.6.0 ahc0: aic7890/91 Wide Channel A, SCSI Id=3D7, 16/255 SCBs chip4: <PCI to PCI bridge (vendor=3D1011 device=3D0024)> rev 0x03 on pci0.1= 0.0 Probing for devices on PCI bus 1: Probing for devices on PCI bus 2: de0: <Digital 21140A Fast Ethernet> rev 0x22 int a irq 18 on pci2.4.0 de0: SMC 9332BDT 21140A [10-100Mb/s] pass 2.2 de0: address 00:e0:29:2b:e1:08 de1: <Digital 21140A Fast Ethernet> rev 0x22 int a irq 19 on pci2.5.0 de1: SMC 9332BDT 21140A [10-100Mb/s] pass 2.2 de1: address 00:e0:29:2b:e1:09 Probing for devices on the ISA bus: sc0 on isa sc0: VGA color <16 virtual consoles, flags=3D0x0> ed0 not found at 0x280 atkbdc0 at 0x60-0x6f on motherboard atkbd0 irq 1 on isa psm0 not found sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa sio0: type 16550A, console sio1 at 0x2f8-0x2ff irq 3 on isa sio1: type 16550A sio2: configured irq 5 not in bitmap of probed irqs 0 sio2 not found at 0x3e8 sio3: configured irq 9 not in bitmap of probed irqs 0 sio3 not found at 0x2e8 fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa fdc0: FIFO enabled, 8 bytes threshold fd0: 1.44MB 3.5in ppc0 at 0x378 irq 7 on isa ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/9 bytes threshold nlpt0: <generic printer> on ppbus 0 nlpt0: Interrupt-driven port ppi0: <generic parallel i/o> on ppbus 0 plip0: <PLIP network interface> on ppbus 0 vga0 at 0x3b0-0x3df maddr 0xa0000 msize 131072 on isa npx0 on motherboard npx0: INT 16 interface we0 at 0x2e8 on isa we0: kernel is keeping watchdog alive APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via pin 2 IP packet filtering initialized, divert disabled, rule-based forwarding dis= abled, logging limited to 100 packets/entry Waiting 2 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! de1: enabling 100baseTX port changing root device t(da0:ahc0:0:0:0): about to print announcement da0 at ahc0 bus 0 target 0 lun 0 da0: <SEAGATE ST34371W 0484> Fixed Direct Access SCSI-2 device=20 da0: 40.0MB/s transfers (20.0MHz, offset 15, 16bit), Tagged Queueing Enabled da0: 4148MB (8496884 512 byte sectors: 255H 63S/T 528C) (da0:ahc0:0:0:0): printed announcement o da0s1a (da0:ahc0:0:0:0): read capacity returned 0 (da1:ahc0:0:1:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da1:ahc0:0:1:0): NOT READY (da1:ahc0:0:1:0): fatal error, failed to attach to device (da1:ahc0:0:1:0): lost device (da1:ahc0:0:1:0): about to print announcement (da1:ahc0:0:1:0): printed announcement (da1:ahc0:0:1:0): removing device entry (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 Enter full pathname of shell or RETURN for /bin/sh:=20 erase ^H, kill ^U, intr ^C /sbin/camcontrol cmd -n da -u 1 -v -c 25 0 0 0 0 0 0 0 0 0 -i 8 i4 i4 camcontrol: cam_lookup_pass: CAMGETPASSTHRU ioctl failed cam_lookup_pass: No such file or directory cam_lo(okup_pdass: either the apass driver isn'0t in your kernel: cam_lookup_pasas: or da1 doesn'ht exist end of ccamcontrol 0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 (da0:ahc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 /dev/rda0s1a: (FILESYSTEM CLEANd; SKIPPING CHECKaS /dev/rda0s1a: 0clean, 127240 fr:ee (296 frags, 1a5868 blocks, 0.2h% fragmen= tation)c Can't open /de0v/rda1s1e: Devic:e not configured0 /dev/rda1s1e: :CAN'T CHECK FILE0 SYSTEM. /dev/r:da1s1e: UNEXPECT0ED INCONSISTENCY); RUN fsck MANUA:LLY. read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 (da0:ahc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 /dev/rda0s1h: (FILESYSTEM CLEANd; SKIPPING CHECKaS 0/dev/rda0s1h: :clean, 232315 fraee (131 frags, 2h9023 blocks, 0.1c% fragme= ntation)0 :0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 (da0:ahc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 /dev/rda0s1e: (FILESYSTEM CLEANd; SKIPPING CHECKaS 0/dev/rda0s1e: :clean, 51019 freae (8139 frags, 5d360 blocks, 4.1%e fragmen= tation)=0D0 : autosense failed: cable problem? hc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 (da0:ahc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 /dev/rda0s1f: (FILESYSTEM CLEANd; SKIPPING CHECKaS /dev/rda0s1f: 0clean, 112455 fr:ee (239 frags, 1a4027 blocks, 0.2h% fragmen= tation)c 0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 (da0:ahc0:0:0:0): read capacity returned 0 (da0:ahc0:0:0:0): READ CAPACITY. CDB: 25 0 0 0 0 0 0 0 0 0=20 (da0:ahc0:0:0:0): NOT READY (da0:ahc0:0:0:0): address =3D 8496883, length =3D 512 /dev/rda0s1g: FILESYSTEM CLEAN; SKIPPING CHECKS /dev/rda0s1g: clean, 1308429 free (309 frags, 163515 blocks, 0.0% fragmenta= tion) THE FOLLOWING FILE SYSTEM HAD AN UNEXPECTED INCONSISTENCY: /dev/rda1s1e (/news) #=20 I take it the "fatal error, failed to attach to device" is what you were trying for? > You should be able to boot okay with this patch, although you probably > won't be able to fsck or mount the Mylex array until it's ready to run. I would expect that. If you have an elegant (or correct) solution to deal with this - for me, at least, preferably from userland - I'm all ears. What I'm thinking is just sitting there querying the thing every few seconds until it reports a size, then have CAM re-query the device. Does that seem reasonable? I'm actually not interested in a solution more complex than that because in the event the RAID fails, it does the same sort of thing, and I'd like to be able to get into the machine from remote and talk to the Mylex. ... Joe ---------------------------------------------------------------------------= ---- Joe Greco - Systems Administrator jgreco@ns.sol.net Solaria Public Access UNIX - Milwaukee, WI 414/342-4847 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199907021646.LAA77311>