From owner-freebsd-stable@FreeBSD.ORG Thu Jun 10 17:20:00 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 5FB9E106566B for ; Thu, 10 Jun 2010 17:20:00 +0000 (UTC) (envelope-from robin@icir.org) Received: from fruitcake.ICSI.Berkeley.EDU (fruitcake.ICSI.Berkeley.EDU [192.150.186.11]) by mx1.freebsd.org (Postfix) with ESMTP id 406478FC12 for ; Thu, 10 Jun 2010 17:20:00 +0000 (UTC) Received: from empire.icsi.berkeley.edu (empire.ICSI.Berkeley.EDU [192.150.186.169]) by fruitcake.ICSI.Berkeley.EDU (8.12.11.20060614/8.12.11) with ESMTP id o5AGTKYO002270 for ; Thu, 10 Jun 2010 09:29:20 -0700 (PDT) Received: by empire.icsi.berkeley.edu (Postfix, from userid 502) id 78D6929284B; Thu, 10 Jun 2010 09:29:19 -0700 (PDT) Date: Thu, 10 Jun 2010 09:29:19 -0700 From: Robin Sommer To: freebsd-stable@freebsd.org Message-ID: <20100610162918.GA23022@icir.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.19 (2009-01-05) Subject: File system trouble with ICH9 controller X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 10 Jun 2010 17:20:00 -0000 I'm running 8.0-RELEASE-p2 (amd64) on a larger number of Supermicro SBI-7425C-T3 blades. Each of the blades has 2 x 500GB disks striped into a single volume via the on-board ICH9 RAID controller. However, after running fine for a while (days), the blades crash eventually with file system problems such as the one below. Initially I thought that must be a bad disk, but by now 5 different blades have shown similar problems so I'm suspecting some OS issue. Has anybody seen something similar before? Could this be an incompatibility with the RAID controller (I haven't found much recent on Google but there are a number of older threads indicating that it might not be well supported. Not sure though whether those still apply). Any other thoughts? Thanks, Robin --------- syslog ------------------------------------------------------- Jun 9 10:00:02 blade19 kernel: ar0s1a[WRITE(offset=704187858944, length=114688)]error = 5 Jun 9 10:00:02 blade19 kernel: g_vfs_done():ar0s1a[WRITE(offset=704188219392, length=131072)]error = 5 Jun 9 10:00:02 blade19 kernel: g_vfs_done():ar0s1a[WRITE(offset=704188891136, length=114688)]error = 5 Jun 9 10:00:02 blade19 kernel: g_vfs_done():ar0s1a[WRITE(offset=704189382656, length=114688)]error = 5 Jun 9 10:00:02 blade19 kernel: g_vfs_done():ar0s1a[WRITE(offset=704189743104, length=131072)] Jun 9 10:00:02 blade19 kernel: error = 5 --------- system information ------------------------------------------ # uname -a FreeBSD blade5 8.0-RELEASE-p2 FreeBSD 8.0-RELEASE-p2 #0: Tue Jan 5 21:11:58 UTC 2010 root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64 # pciconf -lv | grep SATA device = '82801IB/IR/IH (ICH9 Family) SATA RAID Controller' # atacontrol list ATA channel 2: Master: ad4 SATA revision 2.x Slave: no device present ATA channel 3: Master: ad6 SATA revision 2.x Slave: no device present # dmesg | grep ata atapci0: port 0x1c50-0x1c57,0x1c44-0x1c47,0x1c48-0x1c4f,0x1c40-0x1c43,0x18e0-0x18ff mem 0xfcc00000-0xfcc007ff irq 17 at device 31.2 on pci0 atapci0: [ITHREAD] atapci0: AHCI called from vendor specific driver atapci0: AHCI v1.20 controller with 6 3Gbps ports, PM supported ata2: on atapci0 ata2: [ITHREAD] ata3: on atapci0 ata3: [ITHREAD] ata4: on atapci0 ata4: stopping AHCI engine failed ata4: [ITHREAD] ata5: on atapci0 ata5: stopping AHCI engine failed ata5: [ITHREAD] ata6: on atapci0 ata6: [ITHREAD] ata7: on atapci0 ata7: [ITHREAD] ad4: 476940MB at ata2-master SATA300 ad6: 476940MB at ata3-master SATA300 ar0: writing of DDF metadata is NOT supported yet ar0: disk0 READY using ad4 at ata2-master ar0: disk1 READY using ad6 at ata3-master -- Robin Sommer * Phone +1 (510) 666-2886 * robin@icir.org ICSI/LBNL * Fax +1 (510) 666-2956 * www.icir.org