From owner-freebsd-scsi@FreeBSD.ORG Sun May 11 01:07:04 2008 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2BCCF106566C for ; Sun, 11 May 2008 01:07:04 +0000 (UTC) (envelope-from sbruno@miralink.com) Received: from plato.miralink.com (mail.miralink.com [70.103.185.20]) by mx1.freebsd.org (Postfix) with ESMTP id F07F58FC22 for ; Sun, 11 May 2008 01:07:03 +0000 (UTC) (envelope-from sbruno@miralink.com) Received: from localhost (localhost.localdomain [127.0.0.1]) by plato.miralink.com (Postfix) with ESMTP id 9A85E1C802A; Sat, 10 May 2008 18:07:03 -0700 (PDT) Received: from plato.miralink.com ([127.0.0.1]) by localhost (plato.miralink.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 21632-03; Sat, 10 May 2008 18:07:02 -0700 (PDT) Received: from [10.47.1.126] (vpn.office.miralink.com [10.0.0.5]) by plato.miralink.com (Postfix) with ESMTP id 099581C800D; Sat, 10 May 2008 18:07:01 -0700 (PDT) Message-ID: <482646B5.807@miralink.com> Date: Sat, 10 May 2008 18:07:01 -0700 From: Sean Bruno User-Agent: Thunderbird 2.0.0.14 (X11/20080501) MIME-Version: 1.0 To: Graham Allan References: <20080509011028.GV25577@physics.umn.edu> <20080509215621.GX25577@physics.umn.edu> In-Reply-To: <20080509215621.GX25577@physics.umn.edu> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-DSPAM-Result: Innocent X-DSPAM-Processed: Sat May 10 18:07:03 2008 X-DSPAM-Confidence: 0.9997 X-DSPAM-Probability: 0.0000 X-DSPAM-Signature: 482646b7270421337958704 X-DSPAM-Factors: 27, X-Virus-Scanned: amavisd-new at X-Spam-Status: No, score=-4.499 tagged_above=-10 required=6.6 autolearn=ham tests=[ALL_TRUSTED=-1.8, BAYES_00=-2.599, DSPAM_HAM=-0.1] X-Spam-Score: -4.499 X-Spam-Level: Cc: freebsd-scsi@freebsd.org Subject: Re: Hang on boot in isp with QLA2342 after upgrading to 6.3 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 May 2008 01:07:04 -0000 Graham Allan wrote: > On Thu, May 08, 2008 at 08:10:28PM -0500, Graham Allan wrote: > >> Hi, >> >> I've been trying to figure out a problem on a system which I just >> upgraded from FreeBSD 6.1 to 6.3. It's a Dell 1750 with a QLA2342 >> (isp2312) HBA, connected to an EMC DS-16B2 (aka Brocade 3800) switch, >> and from there to a couple of AC&NC Jetstor arrays. It's all been >> working fine like this for some time under 6.1. >> >> When I boot the system under 6.3, though I get a hang like this: >> >> isp0: Interrupting Mailbox Command (0x6f) Timeout (500000us) >> isp0: PLOGI 0x031a00 failed >> isp0: Interrupting Mailbox Command (0x6e) Timeout (10000000us) >> isp0: Mailbox Command 'SEND SNS' failed (TIMEOUT) >> isp0: isp_pdb_sync: isp_scan_fabric failed >> isp0: Interrupting Mailbox Command (0x6e) Timeout (10000000us) >> isp0: Mailbox Command 'SEND SNS' failed (TIMEOUT) >> isp0: isp_pdb_sync: isp_scan_fabric failed >> isp0: Mailbox Command 'SEND SNS' failed (COMMAND ERROR) >> isp0: isp_pdb_sync: isp_scan_fabric failed >> >> after which the system stays hung. >> > > I've done some more testing on this and am left with a headache because > nothing makes sense! Would certainly be grateful if any fiber channel > gurus might comment. > > I tested again with FreeBSD 7.0 and got the same result as above. > > Next I tried swapping components in and out of the SAN to see if > different combinations have different results. These tests were with > 7.0 since that was what I still had installed. The SAN switch has the > following devices connected: > > 1) AC&NC Jetstor 416F > 2) AC&NC Jetstor 516F > 3) Alphaserver ES40, Tru64 5.1B-6, KGPSA-CA (Emulex) HBA > 4) Dell 1750, (the system I'm writing about), Qlogic 2342 HBA > 5) Dell 1750, Windows 2003, LSI 7202P HBA > > There's no zoning on the switch since all three servers connect to the > two Jetstors. Access to the RAID volumes is controlled by host filters > on the Jetstors. > > So I tested with different combinations of devices connected to the > switch, with perplexing results: > > FreeBSD + 416F only - boots fine > FreeBSD + 516F only - boots fine > FreeBSD + 416F + 516F - boots fine > FreeBSD + 416F + 516F + windows - boots fine > FreeBSD + 416F + 516F + ES40 - hangs with the above error > but, continuing... > FreeBSD + 516F + ES40 - boots fine > FreeBSD + 416F + ES40 - boots fine > FreeBSD + 416F + 516F + ES40 - hangs again > > I can't make any sense of this... there are so many different systems > involved that there's no way to know where the problem really lies. > Although as it did work with FreeBSD 6.1, it feels to me like something > is wrong in the newer isp driver, but I have no solid knowledge to base > that on. > > Thanks for any ideas, > > Graham > I see that you tested 6.1 but not 6.2 ... if you could, can you check 6.2? I'm trying to limit the code searching and that would help a bit. Sean