Date: Fri, 08 Nov 2013 17:30:29 -0600 From: dweimer <dweimer@dweimer.net> To: freebsd-questions@freebsd.org Subject: Re: Confirmation Of Drive Failure Message-ID: <8f6c8132c42415d4dc955e24403beeb3@dweimer.net> In-Reply-To: <BLU0-SMTP153A32DBB7E8902E8FE03B3B3F20@phx.gbl> References: <BLU0-SMTP153A32DBB7E8902E8FE03B3B3F20@phx.gbl>
next in thread | previous in thread | raw e-mail | index | archive | help
On 11/08/2013 5:07 pm, Drew Tomlinson wrote: > I've been running FBSD on this home server for about 13 years. Finally > after a power outage, it will no longer boot. > > The main drives were two SCSI drives striped using gstripe. I am > fairly certain da0 is dead and the reason it won't boot. > > I know there was a working IDE or IDE via firewire enclosure drive > before the crash. I had backups on that drive made from Bacula and > I'm hoping to be able to recover them. > > If I'm interpreting all of the below dmesg output correctly, I think I > should have an ad1 drive that I can mount. I'm hoping someone can > confirm or deny that and help me get it mounted if ada1 should be > there. > > My version of FBSD on the box was 6.4. I am now booted from the 9.2 > Live CD. > > I'm checking dmesg to see what devices are seen: > > da0 at ahc0 bus 0 scbus2 target 0 lun 0 > da0: <SEAGATE SX19171W 9D32> Fixed Direct Access SCSI-2 device > da0: 11.626MB/s transfers (5.813MHz, offset 8, 16bit) > da0: Command Queueing enabled > da0: 8683MB (17783112 512 byte sectors: 64H 32S/T 8683C) > da1 at ahc0 bus 0 scbus2 target 2 lun 0 > da1: <SEAGATE SX19171W 9D32> Fixed Direct Access SCSI-2 device > da1: 11.626MB/s transfers (5.813MHz, offset 8, 16bit) > da1: Command Queueing enabled > da1: 8683MB (17783112 512 byte sectors: 64H 32S/T 8683C) > cd0 at ata1 bus 0 scbus1 target 0 lun 0 > cd0: <HITACHI CDR-8435 0010> Removable CD-ROM SCSI-0 device > cd0: 16.700MB/s transfers (WDMA2, ATAPI 12bytes, PIO 65534bytes) > cd0: cd present [274042 x 2048 byte records] > ada0 at ata0 bus 0 scbus0 target 0 lun 0 > ada0: <GENERIC GENERIC A08.1500> ATA-5 device > ada0: 33.300MB/s transfers (UDMA2, PIO 8192bytes) > ada0: 76319MB (156301488 512 byte sectors: 15H 63S/T 16383C) > ada0: Previously was known as ad0 > ada1 at ata0 bus 0 scbus0 target 1 lun 0 > ada1: <WDC WD800AB 03.06A> ATA-0 device > ada1: 3.300MB/s transfers (PIO0, PIO 8192bytes) > ada1: 0MB (0 512 byte sectors: 16H 63S/T 16383C) > ada1: Previously was known as ad1 > SMP: AP CPU #1 Launched! > > Thus if I'm reading this right, it's seeing two internal IDE drives, a > CD drive, and two SCSI drives? > > Although I'm booted from the CD drive, apparently it is having > problems based upon many of these messages: > > (cd0:ata1:0:0:0): READ(10). CDB: 28 00 00 04 2e 78 00 00 01 00 > (cd0:ata1:0:0:0): CAM status: SCSI Status Error > (cd0:ata1:0:0:0): SCSI status: Check Condition > (cd0:ata1:0:0:0): SCSI sense: ILLEGAL REQUEST asc:64,0 (Illegal mode > for this track) > (cd0:ata1:0:0:0): Info: 0x42e78 > (cd0:ata1:0:0:0): Error 6, Unretryable error > (cd0:ata1:0:0:0): cddone: got error 0x6 back > (cd0:ata1:0:0:0): READ(10). CDB: 28 00 00 04 2e 78 00 00 01 00 > > Then lots of these messages which tells me ada0 drives is dead? > > (ada0:ata0:0:0:0): READ_DMA. ACB: c8 00 80 00 00 40 00 00 00 00 10 00 > (ada0:ata0:0:0:0): CAM status: ATA Status Error > (ada0:ata0:0:0:0): ATA status: 51 (DRDY SERV ERR), error: 40 (UNC ) > (ada0:ata0:0:0:0): RES: 51 40 80 00 00 00 00 00 00 10 00 > (ada0:ata0:0:0:0): Retrying command > > Then it looks like there is hope for the stripe: > > GEOM_STRIPE: Device data created (id=2880277341). > GEOM_STRIPE: Disk da0s1d attached to data. > GEOM_STRIPE: Disk da1s1d attached to data. > GEOM_STRIPE: Device stripe/data activated. > > But then no hope as this sequence repeats itself: > > GEOM_STRIPE: Disk da0s1d removed from data. > GEOM_STRIPE: Device stripe/data deactivated. > GEOM_STRIPE: Disk da0s1d attached to data. > GEOM_STRIPE: Device stripe/data activated. > GEOM_STRIPE: Disk da0s1d removed from data. > GEOM_STRIPE: Device stripe/data deactivated > > Then I load the sbp module to see if I have any drives in the firewire > enclosure: > > fwohci0: <VIA Fire II (VT6306)> port 0x1c00-0x1c7f mem > 0xfc104000-0xfc1047ff irq > 16 at device 10.0 on pci0 > fwohci0: OHCI version 1.0 (ROM=1) > fwohci0: No. of Isochronous channels is 8. > fwohci0: EUI64 00:40:63:00:00:00:07:ff > fwohci0: Phy 1394a available S400, 3 ports. > fwohci0: Link S400, max_rec 2048 bytes. > firewire0: <IEEE1394(FireWire) bus> on fwohci0 > fwohci0: Initiate bus reset > fwohci0: fwohci_intr_core: BUS reset > fwohci0: fwohci_intr_core: node_id=0x00000001, SelfID Count=1, > CYCLEMASTER mode > firewire0: 2 nodes, maxhop <= 1 cable IRM irm(1) (me) > firewire0: bus manager 1 > firewire0: fw_explore_node: Pre 1394a-2000 detected > firewire0: New S400 device ID:0030e002ee4000a6 > sbp0: <SBP-2/SCSI over FireWire> on firewire0 > sbp0: sbp_show_sdev_info: sbp0:0:0: ordered:1 type:14 > EUI:0030e002ee4000a6 node:0 speed:2 maxrec:8 > sbp0: sbp_show_sdev_info: sbp0:0:0 'Oxford ' '911 ' '000037' > sbp0: sbp_show_sdev_info: sbp0:0:1: ordered:1 type:14 > EUI:0030e002ee4000a6 node:0 speed:2 maxrec:8 > sbp0: sbp_show_sdev_info: sbp0:0:1 'Oxford ' '911 ' '000037' > sbp0: sbp_timeout:sbp0:0:0 request timeout(cmd orb:0x282fa154) ... > agent reset > (probe0:sbp0:0:0:0): INQUIRY. CDB: 12 00 00 00 24 00 > (probe0:sbp0:0:0:0): CAM status: Command timeout > (probe0:sbp0:0:0:0): Retrying command > > I suspect this means no working drives were found in the firewire > enclosure? > > So I check /dev and see if it sees ad1: > > root@:~ # ll /dev/ad* > lrwxr-xr-x 1 root wheel 4 Nov 2 18:07 /dev/ad0@ -> ada0 > lrwxr-xr-x 1 root wheel 5 Nov 8 13:38 /dev/ad0d@ -> ada0d > lrwxr-xr-x 1 root wheel 4 Nov 2 18:07 /dev/ad1@ -> ada1 > crw-r----- 1 root operator 0x59 Nov 8 13:36 /dev/ada0 > crw-r----- 1 root operator 0x75 Nov 8 13:37 /dev/ada0d > crw-r----- 1 root operator 0x5b Nov 2 18:07 /dev/ada1 > root@:~ # > > So I created /mnt/data and tried to mount ada1: > > root@:~ # mount /dev/ada1 /mnt/data > mount: /dev/ada1: Device not configured > > So then I try: > > root@:~ # bsdlabel /dev/ada1 > bsdlabel: cannot get disk geometry: No such file or directory > > So does this mean I really don't have an ada1 drive? Or is there some > step I'm missing to make it accessible. > > I really appreciate you reading this far and any help you might give. > > Cheers, > > Drew > > > > > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to > "freebsd-questions-unsubscribe@freebsd.org" ada1 at ata0 bus 0 scbus0 target 1 lun 0 ada1: <WDC WD800AB 03.06A> ATA-0 device ada1: 3.300MB/s transfers (PIO0, PIO 8192bytes) <<== really 3.3MB/S, that's slower than it should be. ada1: 0MB (0 512 byte sectors: 16H 63S/T 16383C) <<== Shows 0MB size, that's Definitely not right. ada1: Previously was known as ad1 I think either the IDE drive is bad as well, or its not compatible with the IDE controller. -- Thanks, Dean E. Weimer http://www.dweimer.net/
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?8f6c8132c42415d4dc955e24403beeb3>