From owner-freebsd-questions@freebsd.org Wed Oct 14 18:56:06 2015 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 291E0A1511B for ; Wed, 14 Oct 2015 18:56:06 +0000 (UTC) (envelope-from juan@inti.gob.ar) Received: from sbg-out.inti.gob.ar (sbg-out.inti.gob.ar [200.10.161.69]) by mx1.freebsd.org (Postfix) with ESMTP id A9CBF157C for ; Wed, 14 Oct 2015 18:56:05 +0000 (UTC) (envelope-from juan@inti.gob.ar) X-AuditID: c80aa145-f79586d000001bfa-ec-561ea1b8e6aa Received: from [200.10.161.55] (jb.inti.gob.ar [200.10.161.55]) (using TLS with cipher DHE-RSA-AES128-SHA (128/128 bits)) (Client did not present a certificate) by sbg-out.inti.gob.ar (SMTP_INTI) with SMTP id 02.43.07162.8B1AE165; Wed, 14 Oct 2015 15:40:56 -0300 (ART) Subject: Re: zfs pool ssd cache drive dropping off To: freebsd-questions@freebsd.org References: From: Juan Bernhard Message-ID: <561EA1BA.2030402@inti.gob.ar> Date: Wed, 14 Oct 2015 15:40:58 -0300 User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.2.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 8bit X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFprALMWRmVeSWpSXmKPExsVygmuhue6OhXJhBr8/W1u8/LqJxYHRY8an +SwBjFFcNimpOZllqUX6dglcGSdfLWQsOKtXcfVzM0sD4zrlLkZODgkBE4mlbUtZIWwxiQv3 1rN1MXJxCAlMYpKYt7WPBSQhLGAkceLlJ3YQW0RAUeLMmUawBiGBAIm97xeAxdkE1CS+njkF FucV0JK439gIFmcRUJVY8HgdG4gtKhAjsfzxaUaIGkGJkzOfgM3nFAiUeLT2JJjNLGArcWfu bmYIW16ieets5gmMfLOQtMxCUjYLSdkCRuZVjMLFSem6+aUlesAwytRLz0/SSyzaxAgJJ9cd jHvXqR9iFOBgVOLhNTgqGybEmlhWXJl7iFGCg1lJhHd5uVyYEG9KYmVValF+fFFpTmrxIUZp DhYlcd7P34VChQTSE0tSs1NTC1KLYLJMHJxSDYz6fzIy5mjFFb2J+/VRxG3ix87FwXdZVk3s fFFpx7Lo/sFs1SLWTeEfD7d3XdDa/Df73ZM4U+tfP3bLexVa13//KnNnw6tHTwPNS1xEdpTF KayaySgY65cjk3dNxv2DkKm+X6PKmw+PfXJDO92y3jO6mT5Nfe90o4c98htPgWJO9v4v3PuW xiixFGckGmoxFxUnAgDCIN0lIwIAAA== X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 14 Oct 2015 18:56:06 -0000 El 14/10/2015 a las 02:24 p.m., cruxpot escribió: > I recently added a Crucial 64GB SSD drive that I had lying around to my zfs > pool. unfortunately, it keeps dropping off and I'm not sure why. The drive > wasn't failed when I removed it from an old laptop. It has happened twice > and only system restart brings it back. Here are the log messages, they > repeat but here is the base mess: > > > zpool status > pool: zrewt > state: ONLINE > status: One or more devices has been removed by the administrator. > Sufficient replicas exist for the pool to continue functioning in a > degraded state. > action: Online the device using 'zpool online' or replace the device with > 'zpool replace'. > scan: none requested > config: > > NAME STATE READ WRITE CKSUM > zrewt ONLINE 0 0 0 > raidz1-0 ONLINE 0 0 0 > ada0 ONLINE 0 0 0 > ada1 ONLINE 0 0 0 > ada2 ONLINE 0 0 0 > ada3 ONLINE 0 0 0 > cache > 16818205039835910221 REMOVED 0 0 0 was /dev/ada4 > > errors: No known data errors > > kernel: > Trying to mount root from zfs:zrewt []... > ahcich4: Timeout on slot 0 port 0 > ahcich4: is 00000000 cs 00000000 ss 00000001 rs 00000001 tfd 40 serr > 00000000 cmd 0004c017 > (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 > 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 1 port 0 > ahcich4: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr > 00000000 cmd 0004c117 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 2 port 0 > ahcich4: is 00000000 cs 00000004 ss 00000000 rs 00000004 tfd 80 serr > 00000000 cmd 0004c217 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 3 port 0 > ahcich4: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr > 00000000 cmd 0004c317 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retry was blocked > ada4 at ahcich4 bus 0 scbus6 target 0 lun 0 > ada4: s/n 0000000011290314E425 detached > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 4 port 0 > ahcich4: is 00000000 cs 00000010 ss 00000000 rs 00000010 tfd 80 serr > 00000000 cmd 0004c417 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Retrying command > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Timeout on slot 5 port 0 > ahcich4: is 00000000 cs 00000020 ss 00000000 rs 00000020 tfd 80 serr > 00000000 cmd 0004c517 > (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 > 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 7 port 0 > ahcich4: is 00000000 cs 00000080 ss 00000000 rs 00000080 tfd 80 serr > 00000000 cmd 0004c717 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: Timeout on slot 8 port 0 > ahcich4: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd 80 serr > 00000000 cmd 0004c817 > (ada4:ahcich4:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 > 00 00 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 10 port 0 > ahcich4: is 00000000 cs 00000400 ss 00000000 rs 00000400 tfd 80 serr > 00000000 cmd 0004ca17 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted > ahcich4: Timeout on slot 11 port 0 > ahcich4: is 00000000 cs 00000800 ss 00000800 rs 00000800 tfd 80 serr > 00000000 cmd 0004cb17 > (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 > 00 00 00 > (ada4:ahcich4:0:0:0): CAM status: Command timeout > (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated > (ada4:ahcich4:0:0:0): Periph destroyed > ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) > ahcich4: Poll timeout on slot 13 port 0 > ahcich4: is 00000000 cs 00002000 ss 00000000 rs 00002000 tfd 80 serr > 00000000 cmd 0004cd17 > (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 > (aprobe0:ahcich4:0:0:0): CAM status: Command timeout > (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted The SSD takes 31 seconds to respond. Try to use it as a regular disk, run some bechmarcks on it to test it with load. If the disk was working on another computer, che the cable and the sata port. Saludos, Juan