From owner-freebsd-questions@freebsd.org Wed Oct 14 17:24:14 2015 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 78AF9A15532 for ; Wed, 14 Oct 2015 17:24:14 +0000 (UTC) (envelope-from cruxpot@gmail.com) Received: from mail-io0-x230.google.com (mail-io0-x230.google.com [IPv6:2607:f8b0:4001:c06::230]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 471071DAC for ; Wed, 14 Oct 2015 17:24:14 +0000 (UTC) (envelope-from cruxpot@gmail.com) Received: by iodv82 with SMTP id v82so62910606iod.0 for ; Wed, 14 Oct 2015 10:24:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=2I/LpxxTrLvLpy7tdNuIBY7qTV3DZX2nVbBJgen5dcY=; b=sgsgs3blxlrBeRr8bvlZerdRpTRhfb9BfRzOfhslHauQT1j7KMZIaL53kwjLRMr4UJ xCLmXkwOU1DVzSlMwAyXs/rpncwVMNuVpfbSS3wQz/9xaa0Zgvs1Rjg5SVJmhyMGQc/n jJVkmfCsSlUTU+ZhWjqioScZRpRMbZiO1KcqgF1qe13e4Rq6ggEit/SBq0x+u/GFg/Pe P/cBkIj0yEULLwEPLDWNrvINQ2BdkIAOGBamkqxO46a+EGwBXOkJ7k3A4EEuJx5nny9U Q60yJPjS9d4oZO0RkMoSxc7mUftm7fyVqVIkFfnVp/MH7x301sNmdajjISEkWqgT2KKP kP4g== MIME-Version: 1.0 X-Received: by 10.107.165.77 with SMTP id o74mr5307109ioe.54.1444843453348; Wed, 14 Oct 2015 10:24:13 -0700 (PDT) Received: by 10.107.20.143 with HTTP; Wed, 14 Oct 2015 10:24:13 -0700 (PDT) Date: Wed, 14 Oct 2015 12:24:13 -0500 Message-ID: Subject: zfs pool ssd cache drive dropping off From: cruxpot To: freebsd-questions@freebsd.org Content-Type: text/plain; charset=UTF-8 X-Content-Filtered-By: Mailman/MimeDel 2.1.20 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 14 Oct 2015 17:24:14 -0000 I recently added a Crucial 64GB SSD drive that I had lying around to my zfs pool. unfortunately, it keeps dropping off and I'm not sure why. The drive wasn't failed when I removed it from an old laptop. It has happened twice and only system restart brings it back. Here are the log messages, they repeat but here is the base mess: zpool status pool: zrewt state: ONLINE status: One or more devices has been removed by the administrator. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Online the device using 'zpool online' or replace the device with 'zpool replace'. scan: none requested config: NAME STATE READ WRITE CKSUM zrewt ONLINE 0 0 0 raidz1-0 ONLINE 0 0 0 ada0 ONLINE 0 0 0 ada1 ONLINE 0 0 0 ada2 ONLINE 0 0 0 ada3 ONLINE 0 0 0 cache 16818205039835910221 REMOVED 0 0 0 was /dev/ada4 errors: No known data errors kernel: Trying to mount root from zfs:zrewt []... ahcich4: Timeout on slot 0 port 0 ahcich4: is 00000000 cs 00000000 ss 00000001 rs 00000001 tfd 40 serr 00000000 cmd 0004c017 (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 00 00 00 (ada4:ahcich4:0:0:0): CAM status: Command timeout (ada4:ahcich4:0:0:0): Retrying command ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Timeout on slot 1 port 0 ahcich4: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr 00000000 cmd 0004c117 (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Retrying command ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Timeout on slot 2 port 0 ahcich4: is 00000000 cs 00000004 ss 00000000 rs 00000004 tfd 80 serr 00000000 cmd 0004c217 (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Timeout on slot 3 port 0 ahcich4: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr 00000000 cmd 0004c317 (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retry was blocked ada4 at ahcich4 bus 0 scbus6 target 0 lun 0 ada4: s/n 0000000011290314E425 detached ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Timeout on slot 4 port 0 ahcich4: is 00000000 cs 00000010 ss 00000000 rs 00000010 tfd 80 serr 00000000 cmd 0004c417 (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Retrying command ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Timeout on slot 5 port 0 ahcich4: is 00000000 cs 00000020 ss 00000000 rs 00000020 tfd 80 serr 00000000 cmd 0004c517 (aprobe0:ahcich4:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Poll timeout on slot 7 port 0 ahcich4: is 00000000 cs 00000080 ss 00000000 rs 00000080 tfd 80 serr 00000000 cmd 0004c717 (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted ahcich4: Timeout on slot 8 port 0 ahcich4: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd 80 serr 00000000 cmd 0004c817 (ada4:ahcich4:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00 (ada4:ahcich4:0:0:0): CAM status: Command timeout (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Poll timeout on slot 10 port 0 ahcich4: is 00000000 cs 00000400 ss 00000000 rs 00000400 tfd 80 serr 00000000 cmd 0004ca17 (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted ahcich4: Timeout on slot 11 port 0 ahcich4: is 00000000 cs 00000800 ss 00000800 rs 00000800 tfd 80 serr 00000000 cmd 0004cb17 (ada4:ahcich4:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 38 78 2f 05 40 00 00 00 00 00 00 (ada4:ahcich4:0:0:0): CAM status: Command timeout (ada4:ahcich4:0:0:0): Error 5, Periph was invalidated (ada4:ahcich4:0:0:0): Periph destroyed ahcich4: AHCI reset: device not ready after 31000ms (tfd = 00000080) ahcich4: Poll timeout on slot 13 port 0 ahcich4: is 00000000 cs 00002000 ss 00000000 rs 00002000 tfd 80 serr 00000000 cmd 0004cd17 (aprobe0:ahcich4:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe0:ahcich4:0:0:0): CAM status: Command timeout (aprobe0:ahcich4:0:0:0): Error 5, Retries exhausted