Date: Fri, 8 Nov 2013 15:07:14 -0800 From: Drew Tomlinson <drew@mykitchentable.net> To: freebsd-questions@freebsd.org Subject: Confirmation Of Drive Failure Message-ID: <BLU0-SMTP153A32DBB7E8902E8FE03B3B3F20@phx.gbl>
next in thread | raw e-mail | index | archive | help
I've been running FBSD on this home server for about 13 years. Finally after a power outage, it will no longer boot. The main drives were two SCSI drives striped using gstripe. I am fairly certain da0 is dead and the reason it won't boot. I know there was a working IDE or IDE via firewire enclosure drive before the crash. I had backups on that drive made from Bacula and I'm hoping to be able to recover them. If I'm interpreting all of the below dmesg output correctly, I think I should have an ad1 drive that I can mount. I'm hoping someone can confirm or deny that and help me get it mounted if ada1 should be there. My version of FBSD on the box was 6.4. I am now booted from the 9.2 Live CD. I'm checking dmesg to see what devices are seen: da0 at ahc0 bus 0 scbus2 target 0 lun 0 da0: <SEAGATE SX19171W 9D32> Fixed Direct Access SCSI-2 device da0: 11.626MB/s transfers (5.813MHz, offset 8, 16bit) da0: Command Queueing enabled da0: 8683MB (17783112 512 byte sectors: 64H 32S/T 8683C) da1 at ahc0 bus 0 scbus2 target 2 lun 0 da1: <SEAGATE SX19171W 9D32> Fixed Direct Access SCSI-2 device da1: 11.626MB/s transfers (5.813MHz, offset 8, 16bit) da1: Command Queueing enabled da1: 8683MB (17783112 512 byte sectors: 64H 32S/T 8683C) cd0 at ata1 bus 0 scbus1 target 0 lun 0 cd0: <HITACHI CDR-8435 0010> Removable CD-ROM SCSI-0 device cd0: 16.700MB/s transfers (WDMA2, ATAPI 12bytes, PIO 65534bytes) cd0: cd present [274042 x 2048 byte records] ada0 at ata0 bus 0 scbus0 target 0 lun 0 ada0: <GENERIC GENERIC A08.1500> ATA-5 device ada0: 33.300MB/s transfers (UDMA2, PIO 8192bytes) ada0: 76319MB (156301488 512 byte sectors: 15H 63S/T 16383C) ada0: Previously was known as ad0 ada1 at ata0 bus 0 scbus0 target 1 lun 0 ada1: <WDC WD800AB 03.06A> ATA-0 device ada1: 3.300MB/s transfers (PIO0, PIO 8192bytes) ada1: 0MB (0 512 byte sectors: 16H 63S/T 16383C) ada1: Previously was known as ad1 SMP: AP CPU #1 Launched! Thus if I'm reading this right, it's seeing two internal IDE drives, a CD drive, and two SCSI drives? Although I'm booted from the CD drive, apparently it is having problems based upon many of these messages: (cd0:ata1:0:0:0): READ(10). CDB: 28 00 00 04 2e 78 00 00 01 00 (cd0:ata1:0:0:0): CAM status: SCSI Status Error (cd0:ata1:0:0:0): SCSI status: Check Condition (cd0:ata1:0:0:0): SCSI sense: ILLEGAL REQUEST asc:64,0 (Illegal mode for this track) (cd0:ata1:0:0:0): Info: 0x42e78 (cd0:ata1:0:0:0): Error 6, Unretryable error (cd0:ata1:0:0:0): cddone: got error 0x6 back (cd0:ata1:0:0:0): READ(10). CDB: 28 00 00 04 2e 78 00 00 01 00 Then lots of these messages which tells me ada0 drives is dead? (ada0:ata0:0:0:0): READ_DMA. ACB: c8 00 80 00 00 40 00 00 00 00 10 00 (ada0:ata0:0:0:0): CAM status: ATA Status Error (ada0:ata0:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 40 (UNC ) (ada0:ata0:0:0:0): RES: 51 40 80 00 00 00 00 00 00 10 00 (ada0:ata0:0:0:0): Retrying command Then it looks like there is hope for the stripe: GEOM_STRIPE: Device data created (id=2880277341). GEOM_STRIPE: Disk da0s1d attached to data. GEOM_STRIPE: Disk da1s1d attached to data. GEOM_STRIPE: Device stripe/data activated. But then no hope as this sequence repeats itself: GEOM_STRIPE: Disk da0s1d removed from data. GEOM_STRIPE: Device stripe/data deactivated. GEOM_STRIPE: Disk da0s1d attached to data. GEOM_STRIPE: Device stripe/data activated. GEOM_STRIPE: Disk da0s1d removed from data. GEOM_STRIPE: Device stripe/data deactivated Then I load the sbp module to see if I have any drives in the firewire enclosure: fwohci0: <VIA Fire II (VT6306)> port 0x1c00-0x1c7f mem 0xfc104000-0xfc1047ff irq 16 at device 10.0 on pci0 fwohci0: OHCI version 1.0 (ROM=1) fwohci0: No. of Isochronous channels is 8. fwohci0: EUI64 00:40:63:00:00:00:07:ff fwohci0: Phy 1394a available S400, 3 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: <IEEE1394(FireWire) bus> on fwohci0 fwohci0: Initiate bus reset fwohci0: fwohci_intr_core: BUS reset fwohci0: fwohci_intr_core: node_id=0x00000001, SelfID Count=1, CYCLEMASTER mode firewire0: 2 nodes, maxhop <= 1 cable IRM irm(1) (me) firewire0: bus manager 1 firewire0: fw_explore_node: Pre 1394a-2000 detected firewire0: New S400 device ID:0030e002ee4000a6 sbp0: <SBP-2/SCSI over FireWire> on firewire0 sbp0: sbp_show_sdev_info: sbp0:0:0: ordered:1 type:14 EUI:0030e002ee4000a6 node:0 speed:2 maxrec:8 sbp0: sbp_show_sdev_info: sbp0:0:0 'Oxford ' '911 ' '000037' sbp0: sbp_show_sdev_info: sbp0:0:1: ordered:1 type:14 EUI:0030e002ee4000a6 node:0 speed:2 maxrec:8 sbp0: sbp_show_sdev_info: sbp0:0:1 'Oxford ' '911 ' '000037' sbp0: sbp_timeout:sbp0:0:0 request timeout(cmd orb:0x282fa154) ... agent reset (probe0:sbp0:0:0:0): INQUIRY. CDB: 12 00 00 00 24 00 (probe0:sbp0:0:0:0): CAM status: Command timeout (probe0:sbp0:0:0:0): Retrying command I suspect this means no working drives were found in the firewire enclosure? So I check /dev and see if it sees ad1: root@:~ # ll /dev/ad* lrwxr-xr-x 1 root wheel 4 Nov 2 18:07 /dev/ad0@ -> ada0 lrwxr-xr-x 1 root wheel 5 Nov 8 13:38 /dev/ad0d@ -> ada0d lrwxr-xr-x 1 root wheel 4 Nov 2 18:07 /dev/ad1@ -> ada1 crw-r----- 1 root operator 0x59 Nov 8 13:36 /dev/ada0 crw-r----- 1 root operator 0x75 Nov 8 13:37 /dev/ada0d crw-r----- 1 root operator 0x5b Nov 2 18:07 /dev/ada1 root@:~ # So I created /mnt/data and tried to mount ada1: root@:~ # mount /dev/ada1 /mnt/data mount: /dev/ada1: Device not configured So then I try: root@:~ # bsdlabel /dev/ada1 bsdlabel: cannot get disk geometry: No such file or directory So does this mean I really don't have an ada1 drive? Or is there some step I'm missing to make it accessible. I really appreciate you reading this far and any help you might give. Cheers, Drew
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?BLU0-SMTP153A32DBB7E8902E8FE03B3B3F20>