From owner-freebsd-stable@FreeBSD.ORG Tue Apr 26 19:09:11 2011 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D8258106567A for ; Tue, 26 Apr 2011 19:09:11 +0000 (UTC) (envelope-from gkontos.mail@gmail.com) Received: from mail-iw0-f182.google.com (mail-iw0-f182.google.com [209.85.214.182]) by mx1.freebsd.org (Postfix) with ESMTP id A2C828FC19 for ; Tue, 26 Apr 2011 19:09:11 +0000 (UTC) Received: by iwn33 with SMTP id 33so1064691iwn.13 for ; Tue, 26 Apr 2011 12:09:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:date:message-id:subject:from:to :content-type; bh=YM4QWQpeSCIg8BtnW1z1/NtKFX2lgz+e2/TknriLxyQ=; b=B4K0tAT/udNo2lxH9l7fMfoKVM0IFYPMlbbOcaVobJiVBhf2+/iylCxeITs33fvitf eWWnOLDkposwSkVg/627H7Bp061700TyqZAGATorCZFW71BR2pJX8ACFC53VqqbwVtOM QPmFRIOaiKvvX9OZIa6mqcur//ANK2nRU5TfY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=I8O2THX6fs7p8A4YyDu874//ybc7es268gc0nlckUiUuN9wQcKSHpRhQONTZAP3SaE 9oX4R+OSbtEOIuodOePcqHEDyN6i28pdLn07SwezXc2YBCTsoZrYNEU8aLz81R1QVp0I WwA8Gy/sQN/fYPflQAQ8avJ5+vB6F+zH/yH9E= MIME-Version: 1.0 Received: by 10.231.171.205 with SMTP id i13mr884687ibz.181.1303844950771; Tue, 26 Apr 2011 12:09:10 -0700 (PDT) Received: by 10.231.35.2 with HTTP; Tue, 26 Apr 2011 12:09:10 -0700 (PDT) Date: Tue, 26 Apr 2011 22:09:10 +0300 Message-ID: From: George Kontostanos To: freebsd-stable@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: Promise SATA controller issues... X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 26 Apr 2011 19:09:12 -0000 Greetings all, I have a Promise PDC40718 SATA300 controller running on a box from 8.0-Release since now now on 8.2-Stable 8.2-STABLE FreeBSD 8.2-STABLE #3: Thu Apr 21 15:23:08 EEST 2011. The controller is in jbod mode with 3 WD drives in Raidz1. ad6: 610480MB at ata3-master UDMA100 SATA 3Gb/s ad8: 610480MB at ata4-master UDMA100 SATA 3Gb/s ad10: 610480MB at ata5-master UDMA100 SATA 3Gb/s Today the box became unresponsive so I had to do a hard reset. From /var/log/messages: Apr 25 22:08:35 hp kernel: ata4: SIGNATURE: ffffffff Apr 25 22:08:35 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:35 hp kernel: ata4: error issuing SETFEATURES SET TRANSFER MODE command Apr 25 22:08:35 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:35 hp kernel: ata4: error issuing SETFEATURES ENABLE RCACHE command Apr 25 22:08:35 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:35 hp kernel: ata4: error issuing SETFEATURES ENABLE WCACHE command Apr 25 22:08:35 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:35 hp kernel: ata4: error issuing SET_MULTI command Apr 25 22:08:41 hp kernel: ata4: SIGNATURE: ffffffff Apr 25 22:08:41 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:41 hp kernel: ata4: error issuing SETFEATURES SET TRANSFER MODE command Apr 25 22:08:41 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:41 hp kernel: ata4: error issuing SETFEATURES ENABLE RCACHE command Apr 25 22:08:41 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:41 hp kernel: ata4: error issuing SETFEATURES ENABLE WCACHE command Apr 25 22:08:41 hp kernel: ata4: timeout waiting to issue command Apr 25 22:08:41 hp kernel: ata4: error issuing SET_MULTI command Apr 25 22:08:44 hp kernel: ata4: SIGNATURE: ffffffff Apr 25 22:08:44 hp kernel: ata4: timeout waiting to issue command ....... Apr 26 20:18:41 hp smartd[1049]: Device: /dev/ad8, failed to read SMART Attribute Data Apr 26 20:18:42 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:42 hp kernel: ata4: error issuing SMART command Apr 26 20:18:47 hp kernel: ata4: SIGNATURE: ffffffff Apr 26 20:18:47 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:47 hp kernel: ata4: error issuing SETFEATURES SET TRANSFER MODE command Apr 26 20:18:47 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:47 hp kernel: ata4: error issuing SETFEATURES ENABLE RCACHE command Apr 26 20:18:47 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:47 hp kernel: ata4: error issuing SETFEATURES ENABLE WCACHE command Apr 26 20:18:47 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:47 hp kernel: ata4: error issuing SET_MULTI command Apr 26 20:18:48 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:48 hp kernel: ata4: error issuing SMART command Apr 26 20:18:54 hp kernel: ata4: SIGNATURE: ffffffff Apr 26 20:18:54 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:54 hp kernel: ata4: error issuing SETFEATURES SET TRANSFER MODE command Apr 26 20:18:54 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:54 hp kernel: ata4: error issuing SETFEATURES ENABLE RCACHE command Apr 26 20:18:54 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:54 hp kernel: ata4: error issuing SETFEATURES ENABLE WCACHE command Apr 26 20:18:54 hp kernel: ata4: timeout waiting to issue command Apr 26 20:18:54 hp kernel: ata4: error issuing SET_MULTI command Apr 26 20:47:49 hp kernel: ata4: SIGNATURE: ffffffff Apr 26 20:47:49 hp kernel: ata4: timeout waiting to issue command Apr 26 20:47:49 hp kernel: ata4: error issuing SETFEATURES SET TRANSFER MODE command Apr 26 20:47:49 hp kernel: ata4: timeout waiting to issue command Apr 26 20:47:49 hp kernel: ata4: error issuing SETFEATURES ENABLE RCACHE command Apr 26 20:47:49 hp kernel: ata4: timeout waiting to issue command Apr 26 20:47:49 hp kernel: ata4: error issuing SETFEATURES ENABLE WCACHE command Apr 26 20:47:49 hp kernel: ata4: timeout waiting to issue command Apr 26 20:47:49 hp kernel: ata4: error issuing SET_MULTI command Apr 26 20:47:49 hp kernel: ad8: TIMEOUT - READ_DMA retrying (1 retry left) LBA=216798592 It appears from the logs that the problem lasted for a full day! However, after the reboot the drive did not perform any resilver and no data loss occurred. I have scrubbed my pool successfully and run smartmon tests also. SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 1466 - It doesn't appear to be a drive issue so I was wondering if the recent changes in controllers that appeared a few days ago might be related. Thanks -- George Kontostanos aisecure.net