From owner-freebsd-current@FreeBSD.ORG Wed Sep 7 03:59:09 2011 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CF82C1065673 for ; Wed, 7 Sep 2011 03:59:09 +0000 (UTC) (envelope-from matt.thyer@gmail.com) Received: from mail-ww0-f50.google.com (mail-ww0-f50.google.com [74.125.82.50]) by mx1.freebsd.org (Postfix) with ESMTP id 5B8A88FC0C for ; Wed, 7 Sep 2011 03:59:09 +0000 (UTC) Received: by wwi36 with SMTP id 36so7056874wwi.31 for ; Tue, 06 Sep 2011 20:59:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=z/So+SEuaGHjOdL/2V963ZmS71tA6S0ois4pZ4DqGIc=; b=f6yu60T1Nkdez3VeqgFgKFQvdC9V0uw50+ReFdKT99BEoBA6FL4+P1d8m67vFHVTtw HW+a2byofkrDZ8/sDxbPlmDYh8c+yccSCbp0W1HNays1joAkYMgDG8Lz9HG1GyU+4A8A H9lHdQKzgAbQf8F9p/q7nMJ1ldhd7vJqr3dgc= MIME-Version: 1.0 Received: by 10.216.205.169 with SMTP id j41mr5566961weo.89.1315367948277; Tue, 06 Sep 2011 20:59:08 -0700 (PDT) Received: by 10.216.11.9 with HTTP; Tue, 6 Sep 2011 20:59:08 -0700 (PDT) Received: by 10.216.11.9 with HTTP; Tue, 6 Sep 2011 20:59:08 -0700 (PDT) In-Reply-To: <1922360058.114440.1315350266688.JavaMail.root@mail-01.cse.ucsc.edu> References: <2050180973.114414.1315349796607.JavaMail.root@mail-01.cse.ucsc.edu> <1922360058.114440.1315350266688.JavaMail.root@mail-01.cse.ucsc.edu> Date: Wed, 7 Sep 2011 13:29:08 +0930 Message-ID: From: Matt Thyer To: Tim Gustafson Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-current@freebsd.org Subject: Re: RELENG_8 / mpt / zpool Errors X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 07 Sep 2011 03:59:09 -0000 On Sep 7, 2011 8:53 AM, "Tim Gustafson" wrote: > > Hi all, > > I'm running RELENG_8: > > ---------- > root@bsd-03: uname -a > FreeBSD bsd-03 8.2-STABLE FreeBSD 8.2-STABLE #0: Mon Aug 22 14:58:58 PDT 2011 root@bsd-03:/usr/obj/usr/src/sys/GENERIC amd64 > ---------- > > We've got an MPT controller installed with 32 drives attached: > > ---------- > root@bsd-03: dmesg | grep mpt > mpt0: port 0xec00-0xecff mem 0xef3fc000-0xef3fffff,0xef3e0000-0xef3effff irq 32 at device 0.0 on pci3 > mpt0: [ITHREAD] > mpt0: MPI Version=1.5.19.0 > ses0 at mpt0 bus 0 scbus1 target 32 lun 0 > ses1 at mpt0 bus 0 scbus1 target 33 lun 0 > da5 at mpt0 bus 0 scbus1 target 0 lun 0 > .....SNIP..... > da36 at mpt0 bus 0 scbus1 target 31 lun 0 > ---------- > > We have a zpool on those drives configured into one large zfs file system: [snip] > We're seeing some occasional oddness. About every two weeks it seems the controller temporarily loses connectivity with the drives and the zpool goes a bit bonkers and reports a dozen or so corrupted files. A "zpool scrub" goes through and reports that everything's been fixed and everything seems OK again (although I have not 100% confirmed that there is no file corruption yet, but I'm giving ZFS's check-summing logic the benefit of the doubt here). [snip] > So, is this an OS/driver issue? Is it a bad controller? Bad cables? Bad disks? > > As always, any help is greatly appreciated. Thanks! > > -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- > Tim Gustafson tjg@soe.ucsc.edu > Baskin School of Engineering 831-459-5354 > UC Santa Cruz Baskin Engineering 317B > -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- What are the drives exactly? You may have issues like TLER or frequent head parking. Are these SATA, SCSI or SAS and are port multipliers in use?