Date: Wed, 14 Oct 2015 15:40:58 -0300 From: Juan Bernhard <juan@inti.gob.ar> To: freebsd-questions@freebsd.org Subject: Re: zfs pool ssd cache drive dropping off Message-ID: <561EA1BA.2030402@inti.gob.ar> In-Reply-To: <CAPYfQ9yn4Xvcv09rKedXA5J2MGnpKhKiWbhBw1mUP03C%2By5EkQ@mail.gmail.com>
index | next in thread | previous in thread | raw e-mail
El 14/10/2015 a las 02:24 p.m., cruxpot escribió: > I recently added a Crucial 64GB SSD drive that I had lying around to my zfs > pool. unfortunately, it keeps dropping off and I'm not sure why. The drive > wasn't failed when I removed it from an old laptop. It has happened twice > and only system restart brings it back. Here are the log messages, they > repeat but here is the base mess: > > > zpool status > pool: zrewt > state: ONLINE > status: One or more devices has been removed by the administrator. > Sufficient replicas exist for the pool to continue functioning in a > degraded state. > action: Online the device using 'zpool online' or replace the device with > 'zpool replace'. > scan: none requested > config: > > NAME STATE READ WRITE CKSUM > zrewt ONLINE 0 0 0 > raidz1-0 ONLINE 0 0 0 > ada0 ONLINE 0 0 0 > ada1 ONLINE 0 0 0 > ada2 ONLINE 0 0 0 > ada3 ONLINE 0 0 0 > cache > 16818205039835910221 REMOVED 0 0 0 was /dev/ada4 > > errors: No known data errors > > kernel: > Trying to mount root from zfs:zrewt []... > ahcich4: Timeout on slot 0 port 0 > ahcich4: is 00000000 cs 00000000 ss 00000001 rs 00000001 tfd 40 serr > 00000000 cmd 0004c017 > (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 > 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 1 port 0 > ahcich4: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr > 00000000 cmd 0004c117 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 2 port 0 > ahcich4: is 00000000 cs 00000004 ss 00000000 rs 00000004 tfd 80 serr > 00000000 cmd 0004c217 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 3 port 0 > ahcich4: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr > 00000000 cmd 0004c317 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retry was blocked > ada4 at ahcich4 bus 0 scbus6 target 0 lun 0 > ada4: <M4-CT064M4SSD2 0009> s/n 0000000011290314E425 detached > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 4 port 0 > ahcich4: is 00000000 cs 00000010 ss 00000000 rs 00000010 tfd 80 serr > 00000000 cmd 0004c417 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 5 port 0 > ahcich4: is 00000000 cs 00000020 ss 00000000 rs 00000020 tfd 80 serr > 00000000 cmd 0004c517 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 7 port 0 > ahcich4: is 00000000 cs 00000080 ss 00000000 rs 00000080 tfd 80 serr > 00000000 cmd 0004c717 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: Timeout on slot 8 port 0 > ahcich4: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd 80 serr > 00000000 cmd 0004c817 > (ada4:ahcich4:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 > 00 00 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 10 port 0 > ahcich4: is 00000000 cs 00000400 ss 00000000 rs 00000400 tfd 80 serr > 00000000 cmd 0004ca17 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: Timeout on slot 11 port 0 > ahcich4: is 00000000 cs 00000800 ss 00000800 rs 00000800 tfd 80 serr > 00000000 cmd 0004cb17 > (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 > 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated > (ada4:ahcich4:0:0:0): Periph destroyed > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 13 port 0 > ahcich4: is 00000000 cs 00002000 ss 00000000 rs 00002000 tfd 80 serr > 00000000 cmd 0004cd17 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted The SSD takes 31 seconds to respond. Try to use it as a regular disk, run some bechmarcks on it to test it with load. If the disk was working on another computer, che the cable and the sata port. Saludos, Juanhome | help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?561EA1BA.2030402>
