From owner-svn-src-all@FreeBSD.ORG Sat Mar 6 19:09:28 2010 Return-Path: Delivered-To: svn-src-all@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 48440106564A for ; Sat, 6 Mar 2010 19:09:28 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from mx.egr.msu.edu (surfnturf.egr.msu.edu [35.9.37.164]) by mx1.freebsd.org (Postfix) with ESMTP id 08D528FC12 for ; Sat, 6 Mar 2010 19:09:27 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by mx.egr.msu.edu (Postfix) with ESMTP id 1D59519A157 for ; Sat, 6 Mar 2010 14:09:27 -0500 (EST) X-Virus-Scanned: amavisd-new at egr.msu.edu Received: from mx.egr.msu.edu ([127.0.0.1]) by localhost (surfnturf.egr.msu.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id oV6NqZtBVOER for ; Sat, 6 Mar 2010 14:09:26 -0500 (EST) Received: from [35.9.44.65] (daemon.egr.msu.edu [35.9.44.65]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: mcdouga9) by mx.egr.msu.edu (Postfix) with ESMTPSA id D279919A154 for ; Sat, 6 Mar 2010 14:09:26 -0500 (EST) Message-ID: <4B92A866.8070108@egr.msu.edu> Date: Sat, 06 Mar 2010 14:09:26 -0500 From: Adam McDougall User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; en-US; rv:1.9.1.8) Gecko/20100304 Thunderbird/3.0.3 MIME-Version: 1.0 To: svn-src-all@freebsd.org References: <201002141938.o1EJcRpx065470@svn.freebsd.org> <4B7D4962.8070706@freebsd.org> <4B7EC763.4090507@FreeBSD.org> <4B81C41F.2080601@freebsd.org> <4B822CD4.5080604@FreeBSD.org> <4B822FEB.5030901@freebsd.org> In-Reply-To: <4B822FEB.5030901@freebsd.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: svn commit: r203889 - in stable/8/sys: cam cam/ata cam/scsi dev/ahci dev/asr dev/ata dev/ciss dev/hptiop dev/hptrr dev/mly dev/mpt dev/ppbus dev/siis dev/trm dev/twa dev/usb/storage X-BeenThere: svn-src-all@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "SVN commit messages for the entire src tree \(except for " user" and " projects" \)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 06 Mar 2010 19:09:28 -0000 On 02/22/10 02:19, Lawrence Stewart wrote: > On 02/22/10 18:05, Alexander Motin wrote: >> Lawrence Stewart wrote: >>> On 02/20/10 04:16, Alexander Motin wrote: >>>> Lawrence Stewart wrote: >>>>> I compiled DDB into my r203889 kernel. Unfortunately my ILO emulates a >>>>> USB keyboard so I can't do anything in DDB which is a huge pain, but >>>>> here's the info I did get (hand transcribed): >>>>> >>>>> Fatal trap 12: page fault while in kernel mode >>>>> current process: mpt_raid0 >>>>> Stopped at xpt_rescan+0x1d: movq 0x10(%rsi),%rdx >>>>> >>>>> 1. Any thoughts on how to resolve the regression in the mpt driver >>>>> with >>>>> the r203889 commit? >>>> >>> Perhaps this commit should be backed out of 8-STABLE until we get a >>> chance to diagnose a bit more? >> >> I also have successful reports with this driver, so problem is not >> common. So I don't think it is reasonable to back-out it now. As soon as >> you are the only complaining now, it is only you who can debug the >> issue. So could you be so kind to provide more info? Even without >> keyboard you should be able to get verbose boot messages and full panic >> message. > > Fair enough, if I'm the only person complaining and you've had other > success reports, then that's cool. > > I will get some more info and report back but will have to wait until > later in the week before the machine can be scheduled for some > out-of-hours downtime. > > Cheers, > Lawrence > Short version of my tale below: it might be bad disks... It might be worth mentioning that I discovered similar (but not identical) symptoms today on a Sun Fire v20z with parallel SCSI, no hardware mirroring being used (just zfs/gmirror). I had just reinstalled it to a feb 27 build of 8-stable and encountered trouble while running portsnap extract. I was seeing timeouts, resets, and errors involving both drives. I tried reinstalling with older versions even 8.0-release and still ran into trouble, although on one fresh boot I was watching the console before it started complaining and saw a short burst of messages relating to da1 only: (da1:mpt0:0:1:0): WRITE(10). CDB: 2a 0 0 40 ad 22 0 0 80 0 (da1:mpt0:0:1:0): CAM Status: SCSI Status Error (da1:mpt0:0:1:0): SCSI Status: Check Condition (da1:mpt0:0:1:0): HARDWARE FAILURE info:40ad22 asc:80,87 (da1:mpt0:0:1:0): Vendor Specific ASC field replaceable unit: be (da1:mpt0:0:1:0): Retrying Command (per Sense Data) I haven't looked up the vendor code but I replaced both disks (couldn't easily find the same model as da0 so I replaced both) and so far its fine with the Feb 27 build.