From owner-freebsd-hackers@FreeBSD.ORG Wed Feb 13 17:41:12 2013 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.FreeBSD.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id D41573CF for ; Wed, 13 Feb 2013 17:41:12 +0000 (UTC) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Received: from wojtek.tensor.gdynia.pl (wojtek.tensor.gdynia.pl [188.252.31.196]) by mx1.freebsd.org (Postfix) with ESMTP id 27CB52E7 for ; Wed, 13 Feb 2013 17:41:11 +0000 (UTC) Received: from wojtek.tensor.gdynia.pl (localhost [127.0.0.1]) by wojtek.tensor.gdynia.pl (8.14.6/8.14.5) with ESMTP id r1DHf10b001570 for ; Wed, 13 Feb 2013 18:41:01 +0100 (CET) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Received: from localhost (wojtek@localhost) by wojtek.tensor.gdynia.pl (8.14.6/8.14.5/Submit) with ESMTP id r1DHf1BJ001567 for ; Wed, 13 Feb 2013 18:41:01 +0100 (CET) (envelope-from wojtek@wojtek.tensor.gdynia.pl) Date: Wed, 13 Feb 2013 18:41:01 +0100 (CET) From: Wojciech Puchar To: freebsd-hackers@freebsd.org Subject: disk errors on heavy write I/O Message-ID: User-Agent: Alpine 2.00 (BSF 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; format=flowed; charset=US-ASCII X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.2.7 (wojtek.tensor.gdynia.pl [127.0.0.1]); Wed, 13 Feb 2013 18:41:01 +0100 (CET) X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 Feb 2013 17:41:12 -0000 when doing lots of writes (large file) after few tens of gigabytes i've got as below. smartctl -t long (full surface test) reports no errors my disk is ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 ada0: ATA-7 SATA 2.x device ada0: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) ada0: Command Queueing enabled ada0: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C) i have the same drive model at other place, and got into the same situation once. any ideas what it is. i cannot find any firmware update. Feb 13 18:29:20 wojtek kernel: ahcich0: Timeout on slot 25 port 0 Feb 13 18:29:20 wojtek kernel: ahcich0: is 00000000 cs fdffffff ss ffffffff rs ffffffff tfd c0 serr 00000000 cmd 0000da17 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 00 b0 9c bf 40 02 00 00 08 00 00 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): CAM status: Command timeout Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Retrying command Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 10 74 0a 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 d0 31 10 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=17530617856, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 90 ef 15 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=17723260928, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 50 ad 1b 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=17915904000, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 10 6b 21 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=18108547072, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 d0 28 27 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=18301190144, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 90 e6 2c 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=18493833216, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 50 a4 32 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=18686476288, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 10 62 38 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=18879119360, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 d0 1f 3e 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=19071762432, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 20 90 dd 43 40 02 00 00 00 00 00 Feb 13 18:29:20 wojtek kernel: g_vfs_done():(ada0:ahcich0:0:0:0): CAM status: CCB request was invalid Feb 13 18:29:20 wojtek kernel: ada0a[WRITE(offset=19264405504, length=16384)]error = 22 Feb 13 18:29:20 wojtek kernel: (ada0:ahcich0:0:0:0): Error 22, Unretryable error