From owner-freebsd-scsi@FreeBSD.ORG Mon Apr 30 11:07:41 2012 Return-Path: Delivered-To: freebsd-scsi@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id C7DF010656D6 for ; Mon, 30 Apr 2012 11:07:41 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id B1C558FC21 for ; Mon, 30 Apr 2012 11:07:41 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.5/8.14.5) with ESMTP id q3UB7fHE054021 for ; Mon, 30 Apr 2012 11:07:41 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.5/8.14.5/Submit) id q3UB7eIB054019 for freebsd-scsi@FreeBSD.org; Mon, 30 Apr 2012 11:07:40 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 30 Apr 2012 11:07:40 GMT Message-Id: <201204301107.q3UB7eIB054019@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-scsi@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 30 Apr 2012 11:07:41 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/165982 scsi [mpt] mpt instability, drive resets, and losses on Fre o kern/165740 scsi [cam] SCSI code must drain callbacks before free o kern/163713 scsi [aic7xxx] [patch] Add Adaptec29329LPE to aic79xx_pci.c o kern/162256 scsi [mpt] QUEUE FULL EVENT and 'mpt_cam_event: 0x0' o kern/161809 scsi [cam] [patch] set kern.cam.boot_delay via build option o kern/159412 scsi [ciss] 7.3 RELEASE: ciss0 ADAPTER HEARTBEAT FAILED err o kern/157770 scsi [iscsi] [panic] iscsi_initiator panic o kern/154432 scsi [xpt] run_interrupt_driven_hooks: still waiting after o kern/153514 scsi [cam] [panic] CAM related panic o kern/153361 scsi [ciss] Smart Array 5300 boot/detect drive problem o kern/152250 scsi [ciss] [patch] Kernel panic when hw.ciss.expose_hidden o kern/151564 scsi [ciss] ciss(4) should increase CISS_MAX_LOGICAL to 10 o docs/151336 scsi Missing documentation of scsi_ and ata_ functions in c s kern/149927 scsi [cam] hard drive not stopped before removing power dur o kern/148083 scsi [aac] Strange device reporting o kern/147704 scsi [mpt] sys/dev/mpt: new chip revision, partially unsupp o kern/146287 scsi [ciss] ciss(4) cannot see more than one SmartArray con o kern/145768 scsi [mpt] can't perform I/O on SAS based SAN disk in freeb o kern/144648 scsi [aac] Strange values of speed and bus width in dmesg o kern/144301 scsi [ciss] [hang] HP proliant server locks when using ciss o kern/142351 scsi [mpt] LSILogic driver performance problems o kern/134488 scsi [mpt] MPT SCSI driver probes max. 8 LUNs per device o kern/132250 scsi [ciss] ciss driver does not support more then 15 drive o kern/132206 scsi [mpt] system panics on boot when mirroring and 2nd dri o kern/130621 scsi [mpt] tranfer rate is inscrutable slow when use lsi213 o kern/129602 scsi [ahd] ahd(4) gets confused and wedges SCSI bus o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127717 scsi [ata] [patch] [request] - support write cache toggling o kern/123674 scsi [ahc] ahc driver dumping o kern/123520 scsi [ahd] unable to boot from net while using ahd o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 50 problems total. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 05:51:18 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CD6AD106566B for ; Thu, 3 May 2012 05:51:18 +0000 (UTC) (envelope-from trent@snakebite.org) Received: from exchange.liveoffice.com (exchla3.liveoffice.com [64.70.67.188]) by mx1.freebsd.org (Postfix) with ESMTP id AE4F88FC0C for ; Thu, 3 May 2012 05:51:18 +0000 (UTC) Received: from EXHUB03.exchhosting.com (192.168.11.104) by exhub09.exchhosting.com (192.168.11.107) with Microsoft SMTP Server (TLS) id 8.3.213.0; Wed, 2 May 2012 22:50:10 -0700 Received: from EXMBX10.exchhosting.com ([fe80::9c37:32f6:a508:a44f]) by EXHUB03.exchhosting.com ([fe80::ac41:fbe5:3959:ad64%12]) with mapi; Wed, 2 May 2012 22:50:11 -0700 From: Trent Nelson To: "freebsd-scsi@freebsd.org" Date: Wed, 2 May 2012 22:50:09 -0700 Thread-Topic: Programmatically getting a FC drive's WWNN, WWPN and PortID Thread-Index: Ac0o8JmdTdfGf10bRCydae+Sqzy33g== Message-ID: <6C58B329-2DFC-4F2A-9FE1-F47B142748FD@snakebite.org> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Subject: Programmatically getting a FC drive's WWNN, WWPN and PortID X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 05:51:18 -0000 How can I programmatically get the WWNN, WWPN and PortID of a drive? T= he info is available via dmesg when the drive is detected: % dmesg | grep da31 da31 at isp1 bus 0 scbus6 target 13 lun 0 da31: Fixed Direct Access SCSI-3 device=20 da31: 200.000MB/s transfers WWNN 0x20000004cf83a4b2 WWPN 0x21000004= cf83a4b2 PortID 0xcd da31: Command Queueing enabled da31: 35003MB (71687371 512 byte sectors: 255H 63S/T 4462C) I figured `camcontrol inquiry` would be my best shot, but no dice: % camcontrol inquiry da31 pass32: Fixed Direct Access SCSI-3 device= =20 pass32: Serial Number 3FP1G51L00007240R1BS pass32: 200.000MB/s transfers, Command Queueing Enabled It would be ace if camcontrol inquiry could be taught about WWNNs, WWPN= s and PortIDs. In the mean time, is there any `camcontrol cmd`-type magi= c I could use to get at the values? Use case: some Python code I'm working on that basically does this: <1. knowledge of which disks are in which JBOD arrays> + <2. knowledge of JBOD ports->SAN switch ports> + <3. knowledge of localhost HBA->SAN switch port> + <4. knowledge of localhost HBA->accessible disks by WWPN> + <5. knowledge of port bandwidth limits> + <6. knowledge of other systems' 'claimed' disks+paths> =3D Automatic optimal gmultipath/zpool configuration =20 Step 4 needs to be able to get at disk WWNN/WWPN/PortIDs for a given HB= A in a programatic/reliable fashion (`dmesg | grep da33` isn't reliable). If there isn't some sort of magic `camcontrol cmd` I can send in the interim, I'm all ears for what I'd need to hack in order to implement this functionality :-) Regards, Trent. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 13:34:52 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id EC8491065675 for ; Thu, 3 May 2012 13:34:52 +0000 (UTC) (envelope-from mj@feral.com) Received: from ns1.feral.com (ns1.feral.com [192.67.166.1]) by mx1.freebsd.org (Postfix) with ESMTP id BB0A08FC1D for ; Thu, 3 May 2012 13:34:52 +0000 (UTC) Received: from [192.168.135.100] (c-76-126-166-136.hsd1.ca.comcast.net [76.126.166.136]) (authenticated bits=0) by ns1.feral.com (8.14.4/8.14.4) with ESMTP id q43DYpIU029504 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO); Thu, 3 May 2012 06:34:51 -0700 (PDT) (envelope-from mj@feral.com) Message-ID: <4FA28976.2070607@feral.com> Date: Thu, 03 May 2012 06:34:46 -0700 From: Matthew Jacob Organization: Feral Software User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: Trent Nelson References: <20120502203948.GA1141@snakebite.org> <4FA1DF1C.2030706@feral.com> <20120503050956.GA18983@snakebite.org> In-Reply-To: <20120503050956.GA18983@snakebite.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.7 (ns1.feral.com [192.67.166.1]); Thu, 03 May 2012 06:34:52 -0700 (PDT) Cc: freebsd-scsi@freebsd.org Subject: Re: Missing sysctl options for isp driver X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: mj@feral.com List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 13:34:53 -0000 > I, uh, I don't think I'm using target mode. I've got four Xyratex RS- > 1600-FC2 JBODs (16x146GB + 48x36GB) connected to two-but-soon-to-be-four > HP StorageWorks SAN Switch 2/16s. Each FreeBSD box has a minimum of two > FC HBA ports; each HBA port goes to a different switch, and zoning config > on the switches controls which disks each hosts sees (although more on > this later). > > So, uh, I think that constitutes fabric mode right? The switch reports > the HBA ports as F-Ports and, when zoning's enabled, all the FLOGI/PLOGI > stuff seems to work properly. > > (My SAN/FC-fu has been cobbled together from trial-and-error, so I'm not > entirely sure what target mode is or whether or not target mode bugs are > likely to affect me.) Target Mode is where the FreeBSD box can pretend to be a disk. Yes, that's fabric. > > >> Did you have specific bugs you were concerned with? > Well... as I'm still playing around with the setup, I'm not sure if some > of the oddities I'm seeing are bugs or not. Things seem to work when I've > got zoning enabled on the SAN switches. > > When zoning's disabled and the HBAs are plugged into the fabric, things > sort of go a bit ballistic... The isp driver doesn't seem to handle getting > 128 RSCNs (2 for each dual-ported FC drive) in one go very well. Mailbox > timeouts, hung `camcontrol rescan all`'s, lots of isp[n] dmesg errors and > warnings -- even the SAN switch console starts barfing errors. > > This sort of stuff (queue random dmesg snippets from earlier): > > (da37:isp1:0:88:0): got CAM status 0x4a > (da37:isp1:0:88:0): fatal error, failed to attach to device > (da37:isp1:0:88:0): removing device entry > (da2:isp0:0:9:0): Synchronize cache failed, status == 0x1b, scsi status == 0x0 > (da38:isp1:0:102:0): got CAM status 0x4a > (da38:isp1:0:102:0): fatal error, failed to attach to device > (da38:isp1:0:102:0): removing device entry > isp0: Chan 0 PLOGI 0x020500 failed > isp0: Chan 0 PLOGI 0x020600 failed > isp0: Chan 0 PLOGI 0x020800 failed > isp0: Mailbox Command 'BUS RESET' failed (COMMAND ERROR) > isp1: Mailbox Command 'BUS RESET' failed (COMMAND ERROR) > (da119:isp1:0:3:0): lost device - 0 outstanding, 0 refs > (da119:isp1:0:3:0): removing device entry > isp1: Chan 0 PLOGI 0x0201ca failed > isp1: Chan 0 PLOGI 0x0201cb failed > isp0: Chan 0 PLOGI 0x0101e0 failed > isp0: Chan 0 PLOGI 0x0101e1 failed > isp0: Chan 0 PortID 0x0102ad already at 512 handle 35 state 5 > isp0: Chan 0 PortID 0x0102ae already at 512 handle 34 state 5 > isp0: Chan 0 PortID 0x0102b1 already at 512 handle 33 state 5 > isp0: Chan 0 PortID 0x0102b2 already at 512 handle 32 state 5 > isp0: Chan 0 PortID 0x0102b3 already at 512 handle 31 state 5 > > If I plug the HBA directly into a port on the JBOD, I get no such issues; > all /dev/da* device nodes are created instantly and everything just seems > to work. (I presume HBA->JBOD would result in an FC-AL topology rather > than an F-port topology, which I figure translates into a different code > path in the isp driver.) > > When I enabled zoning and configured a HBA to only see a handful of disks, > everything behaved properly. I'm going to update to head and get a debug > build going with all the ISP debug bells and whistles; I'll report back on > HBA->fabric behaviour when zoning is off after that. > > (FWIW, I've got two HP-UX (PA-RISC and IA64) boxes with two HBAs also > plugged into the fabric; they don't go into fits like FreeBSD/isp seems > to when zoning is turned off.) > What kind of switches are you using? This is definitely wrong, and probably means that I need to do some work again here. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 13:37:47 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3B5E8106566B for ; Thu, 3 May 2012 13:37:47 +0000 (UTC) (envelope-from mj@feral.com) Received: from ns1.feral.com (ns1.feral.com [192.67.166.1]) by mx1.freebsd.org (Postfix) with ESMTP id E37DF8FC16 for ; Thu, 3 May 2012 13:37:46 +0000 (UTC) Received: from [192.168.135.100] (c-76-126-166-136.hsd1.ca.comcast.net [76.126.166.136]) (authenticated bits=0) by ns1.feral.com (8.14.4/8.14.4) with ESMTP id q43DbkeC029525 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO) for ; Thu, 3 May 2012 06:37:46 -0700 (PDT) (envelope-from mj@feral.com) Message-ID: <4FA28A25.1010004@feral.com> Date: Thu, 03 May 2012 06:37:41 -0700 From: Matthew Jacob Organization: Feral Software User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: freebsd-scsi@freebsd.org References: <6C58B329-2DFC-4F2A-9FE1-F47B142748FD@snakebite.org> In-Reply-To: <6C58B329-2DFC-4F2A-9FE1-F47B142748FD@snakebite.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.7 (ns1.feral.com [192.67.166.1]); Thu, 03 May 2012 06:37:46 -0700 (PDT) Subject: Re: Programmatically getting a FC drive's WWNN, WWPN and PortID X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: mj@feral.com List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 13:37:47 -0000 On 5/2/2012 10:50 PM, Trent Nelson wrote: > How can I programmatically get the WWNN, WWPN and PortID of a drive? The > info is available via dmesg when the drive is detected: > > % dmesg | grep da31 > da31 at isp1 bus 0 scbus6 target 13 lun 0 > da31: Fixed Direct Access SCSI-3 device > da31: 200.000MB/s transfers WWNN 0x20000004cf83a4b2 WWPN 0x21000004cf83a4b2 PortID 0xcd > da31: Command Queueing enabled > da31: 35003MB (71687371 512 byte sectors: 255H 63S/T 4462C) > > I figured `camcontrol inquiry` would be my best shot, but no dice: > > % camcontrol inquiry da31 > pass32: Fixed Direct Access SCSI-3 device > pass32: Serial Number 3FP1G51L00007240R1BS > pass32: 200.000MB/s transfers, Command Queueing Enabled > > It would be ace if camcontrol inquiry could be taught about WWNNs, WWPNs > and PortIDs. In the mean time, is there any `camcontrol cmd`-type magic > I could use to get at the values? > > Use case: some Python code I'm working on that basically does this: > <1. knowledge of which disks are in which JBOD arrays> + > <2. knowledge of JBOD ports->SAN switch ports> + > <3. knowledge of localhost HBA->SAN switch port> + > <4. knowledge of localhost HBA->accessible disks by WWPN> + > <5. knowledge of port bandwidth limits> + > <6. knowledge of other systems' 'claimed' disks+paths> > > = Automatic optimal gmultipath/zpool configuration > > Step 4 needs to be able to get at disk WWNN/WWPN/PortIDs for a given HBA > in a programatic/reliable fashion (`dmesg | grep da33` isn't reliable). > > If there isn't some sort of magic `camcontrol cmd` I can send in the > interim, I'm all ears for what I'd need to hack in order to implement > this functionality :-) > > Regards, There's a lot functionality like this that has never been completed in FreeBSD/camcontrol. Linux sort of does this under sysfs, but really the right approach would have been to implement the SNIA HBAAPI interfaces. That said, there are ioctls for some of this stuff for isp. See http://people.freebsd.org/~mjacob/isp_tools.tgz > Trent. > > _______________________________________________ > freebsd-scsi@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-scsi > To unsubscribe, send any mail to "freebsd-scsi-unsubscribe@freebsd.org" > From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 14:39:48 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 154771065670 for ; Thu, 3 May 2012 14:39:48 +0000 (UTC) (envelope-from mj@feral.com) Received: from ns1.feral.com (ns1.feral.com [192.67.166.1]) by mx1.freebsd.org (Postfix) with ESMTP id DA0D48FC12 for ; Thu, 3 May 2012 14:39:47 +0000 (UTC) Received: from [192.168.135.100] (c-76-126-166-136.hsd1.ca.comcast.net [76.126.166.136]) (authenticated bits=0) by ns1.feral.com (8.14.4/8.14.4) with ESMTP id q43Edkqf029937 (version=TLSv1/SSLv3 cipher=DHE-RSA-CAMELLIA256-SHA bits=256 verify=NO); Thu, 3 May 2012 07:39:47 -0700 (PDT) (envelope-from mj@feral.com) Message-ID: <4FA298AD.9030706@feral.com> Date: Thu, 03 May 2012 07:39:41 -0700 From: Matthew Jacob Organization: Feral Software User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:12.0) Gecko/20120428 Thunderbird/12.0.1 MIME-Version: 1.0 To: Trent Nelson References: <20120502203948.GA1141@snakebite.org> <4FA1DF1C.2030706@feral.com> <20120503050956.GA18983@snakebite.org> In-Reply-To: <20120503050956.GA18983@snakebite.org> X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.2.7 (ns1.feral.com [192.67.166.1]); Thu, 03 May 2012 07:39:47 -0700 (PDT) Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: freebsd-scsi@freebsd.org Subject: Re: Missing sysctl options for isp driver X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: mj@feral.com List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 14:39:48 -0000 Oh, one thing to always check is to make sure you're using loaded firmware for isp rather than resident random BIOS firmware. Make sure you have ispfw_load="YES" in your /boot/loader.conf You can also, if you just have 2300's, do isp_2300_load="YES" instead. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 16:58:44 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id C06831065670 for ; Thu, 3 May 2012 16:58:44 +0000 (UTC) (envelope-from trent@snakebite.org) Received: from exchange.liveoffice.com (exchla3.liveoffice.com [64.70.67.188]) by mx1.freebsd.org (Postfix) with ESMTP id 9B8C18FC0C for ; Thu, 3 May 2012 16:58:44 +0000 (UTC) Received: from EXHUB03.exchhosting.com (192.168.11.104) by exhub14.exchhosting.com (192.168.11.123) with Microsoft SMTP Server (TLS) id 8.3.213.0; Thu, 3 May 2012 09:58:44 -0700 Received: from EXMBX10.exchhosting.com ([fe80::9c37:32f6:a508:a44f]) by EXHUB03.exchhosting.com ([fe80::ac41:fbe5:3959:ad64%12]) with mapi; Thu, 3 May 2012 09:58:43 -0700 From: Trent Nelson To: "mj@feral.com" Date: Thu, 3 May 2012 09:58:40 -0700 Thread-Topic: Missing sysctl options for isp driver Thread-Index: Ac0pTf75NPJ/UMmTRoqq9owxmKGvww== Message-ID: <3B12476E-569B-415D-9F2A-BF67652BB518@snakebite.org> References: <20120502203948.GA1141@snakebite.org> <4FA1DF1C.2030706@feral.com> <20120503050956.GA18983@snakebite.org> <4FA28976.2070607@feral.com> In-Reply-To: <4FA28976.2070607@feral.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Cc: "freebsd-scsi@freebsd.org" Subject: Re: Missing sysctl options for isp driver X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 16:58:44 -0000 >> (da37:isp1:0:88:0): got CAM status 0x4a >> (da37:isp1:0:88:0): fatal error, failed to attach to device >> (da37:isp1:0:88:0): removing device entry >> (da2:isp0:0:9:0): Synchronize cache failed, status =3D=3D 0x1b, scsi sta= tus =3D=3D 0x0 >> (da38:isp1:0:102:0): got CAM status 0x4a >> (da38:isp1:0:102:0): fatal error, failed to attach to device >> (da38:isp1:0:102:0): removing device entry >> isp0: Chan 0 PLOGI 0x020500 failed >> isp0: Chan 0 PLOGI 0x020600 failed >> isp0: Chan 0 PLOGI 0x020800 failed >> isp0: Mailbox Command 'BUS RESET' failed (COMMAND ERROR) >> isp1: Mailbox Command 'BUS RESET' failed (COMMAND ERROR) >> (da119:isp1:0:3:0): lost device - 0 outstanding, 0 refs >> (da119:isp1:0:3:0): removing device entry >> isp1: Chan 0 PLOGI 0x0201ca failed >> isp1: Chan 0 PLOGI 0x0201cb failed >> isp0: Chan 0 PLOGI 0x0101e0 failed >> isp0: Chan 0 PLOGI 0x0101e1 failed >> isp0: Chan 0 PortID 0x0102ad already at 512 handle 35 state 5 >> isp0: Chan 0 PortID 0x0102ae already at 512 handle 34 state 5 >> isp0: Chan 0 PortID 0x0102b1 already at 512 handle 33 state 5 >> isp0: Chan 0 PortID 0x0102b2 already at 512 handle 32 state 5 >> isp0: Chan 0 PortID 0x0102b3 already at 512 handle 31 state 5 >>=20 >> If I plug the HBA directly into a port on the JBOD, I get no such is= sues; >> all /dev/da* device nodes are created instantly and everything just = seems >> to work. (I presume HBA->JBOD would result in an FC-AL topology rat= her >> than an F-port topology, which I figure translates into a different = code >> path in the isp driver.) >>=20 >> When I enabled zoning and configured a HBA to only see a handful of = disks, >> everything behaved properly. I'm going to update to head and get a = debug >> build going with all the ISP debug bells and whistles; I'll report b= ack on >> HBA->fabric behaviour when zoning is off after that. >>=20 >> (FWIW, I've got two HP-UX (PA-RISC and IA64) boxes with two HBAs als= o >> plugged into the fabric; they don't go into fits like FreeBSD/isp se= ems >> to when zoning is turned off.) >>=20 >=20 > What kind of switches are you using? This is definitely wrong, and=20 > probably means that I need to do some work again here. They're Brocade Silkworms (3000 model?) branded by HP/Compaq as "HP Storage= Works SAN Switch 2/16". Both switches are running Brocade's Fabric OS v3.2= .1c, which is basically the latest (and last) firmware issued in the 3.x se= ries. The 4.x firmware/OS only runs on newer hardware. I got my main boxes to netboot off a head debug build before leaving the la= b, I'll try and replicate the nutty-behaviour-when-unzoned over the next co= uple of days and get a bit more info for you. (You're more than welcome to= full access to the network if it helps debugging/testing things.) Trent. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 17:01:09 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3D016106564A for ; Thu, 3 May 2012 17:01:09 +0000 (UTC) (envelope-from trent@snakebite.org) Received: from EXHUB04.exchhosting.com (exchla3.liveoffice.com [64.70.67.188]) by mx1.freebsd.org (Postfix) with ESMTP id 19D458FC1C for ; Thu, 3 May 2012 17:01:09 +0000 (UTC) Received: from EXMBX10.exchhosting.com ([fe80::9c37:32f6:a508:a44f]) by EXHUB04.exchhosting.com ([fe80::e08b:16b6:14a0:73b0%12]) with mapi; Thu, 3 May 2012 10:01:09 -0700 From: Trent Nelson To: "mj@feral.com" Date: Thu, 3 May 2012 10:01:06 -0700 Thread-Topic: Missing sysctl options for isp driver Thread-Index: Ac0pTlU/5Xf5CcAaSjuekn5v65MBbg== Message-ID: <837F20D9-3374-4ED8-9C0D-81051B17508C@snakebite.org> References: <20120502203948.GA1141@snakebite.org> <4FA1DF1C.2030706@feral.com> <20120503050956.GA18983@snakebite.org> <4FA298AD.9030706@feral.com> In-Reply-To: <4FA298AD.9030706@feral.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: "freebsd-scsi@freebsd.org" Subject: Re: Missing sysctl options for isp driver X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 17:01:09 -0000 Yup, I've had that in loader.conf from the beginning. (I've flashed all th= e cards to the latest QLogic firmware anyway -- 3.03 or something -- which = is identical to the version ispfw loads, FWIW...) On May 3, 2012, at 10:39 AM, Matthew Jacob wrote: Oh, one thing to always check is to make sure you're using loaded firmware = for isp rather than resident random BIOS firmware. Make sure you have ispfw_load=3D"YES" in your /boot/loader.conf You can also, if you just have 2300's, do isp_2300_load=3D"YES" instead. From owner-freebsd-scsi@FreeBSD.ORG Thu May 3 17:08:33 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DC99D1065678 for ; Thu, 3 May 2012 17:08:33 +0000 (UTC) (envelope-from trent@snakebite.org) Received: from exchange.liveoffice.com (exchla3.liveoffice.com [64.70.67.188]) by mx1.freebsd.org (Postfix) with ESMTP id BCD9D8FC1D for ; Thu, 3 May 2012 17:08:33 +0000 (UTC) Received: from exhub13.exchhosting.com (192.168.11.122) by exhub14.exchhosting.com (192.168.11.123) with Microsoft SMTP Server (TLS) id 8.3.213.0; Thu, 3 May 2012 10:08:33 -0700 Received: from EXMBX10.exchhosting.com ([fe80::9c37:32f6:a508:a44f]) by exhub13.exchhosting.com ([::1]) with mapi; Thu, 3 May 2012 10:08:32 -0700 From: Trent Nelson To: "mj@feral.com" Date: Thu, 3 May 2012 10:08:29 -0700 Thread-Topic: Programmatically getting a FC drive's WWNN, WWPN and PortID Thread-Index: Ac0pT15LK+VcCrvqRPCCw2pi15tnNA== Message-ID: References: <6C58B329-2DFC-4F2A-9FE1-F47B142748FD@snakebite.org> <4FA28A25.1010004@feral.com> In-Reply-To: <4FA28A25.1010004@feral.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Cc: "freebsd-scsi@freebsd.org" Subject: Re: Programmatically getting a FC drive's WWNN, WWPN and PortID X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 May 2012 17:08:33 -0000 On May 3, 2012, at 9:37 AM, Matthew Jacob wrote: >=20 > See http://people.freebsd.org/~mjacob/isp_tools.tgz >=20 Hot dang, what a goldmine. Thanks! Trent. From owner-freebsd-scsi@FreeBSD.ORG Fri May 4 05:29:20 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D9E841065670 for ; Fri, 4 May 2012 05:29:20 +0000 (UTC) (envelope-from spork@bway.net) Received: from xena.bway.net (xena.bway.net [216.220.96.26]) by mx1.freebsd.org (Postfix) with ESMTP id 772DC8FC0C for ; Fri, 4 May 2012 05:29:20 +0000 (UTC) Received: (qmail 64828 invoked by uid 0); 4 May 2012 05:29:14 -0000 Received: from smtp.bway.net (216.220.96.25) by xena.bway.net with ESMTPS (DHE-RSA-AES256-SHA encrypted); 4 May 2012 05:29:14 -0000 Received: (qmail 64824 invoked by uid 90); 4 May 2012 05:29:13 -0000 Received: from unknown (HELO ?10.3.2.41?) (spork@96.57.144.66) by smtp.bway.net with ESMTPA; 4 May 2012 05:29:13 -0000 From: Charles Sprickman Content-Type: text/plain; charset=us-ascii Message-Id: Date: Fri, 4 May 2012 01:29:13 -0400 To: freebsd-scsi@freebsd.org Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Apple Message framework v1084) X-Mailer: Apple Mail (2.1084) Subject: mfi and "copy out failed" messages X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 04 May 2012 05:29:20 -0000 I'm wondering if anyone has some interest in this issue, I recently = think I tracked down a long-standing fs corruption and panic issue on a = Dell 2970 that I was never able to solve: http://lists.freebsd.org/pipermail/freebsd-fs/2010-July/008858.html = (there are other threads, but that's the gist of the issue) I'd read in various threads that the "mfiX: Copy out failed" was a = harmless message. But recently I started thinking that there had to be = some relation between those messages and the panics. The timing fits - = I had megacli performing a status check on the controller in a periodic = script that kicked off with the daily run. Most of my panics were = during or shortly after the daily run. The "Copy out failed" messages = always corresponded to megacli being run. 132 days ago I removed the daily megacli check and the box has not had a = kernel panic since then. Previous to this my longest uptime was not = more than a few months. While this is by no means 100% definitive, it = sure seems like something is going on here. My best guess is that = megacli and/or the mfi driver are interacting in a bad way and that the = "Copy out failed" message is indicating something did not hit the = controller that should have. My earlier assumption was that it was just = some control message megacli was sending that didn't make it, but now = I'm thinking it's some request to write actual data to the drive that's = failing. As a reminder, the card in question is: mfi0: port 0xec00-0xecff mem = 0xe9f80000-0xe9fbffff,0xe9fc0000-0xe9ffffff irq 37 at device 0.0 on pci7 mfi0: 3049 (boot + 3s/0x0020/info) - Firmware version 1.22.02-0612 mfi0: 3051 (boot + 23s/0x0020/info) - Controller hardware revision ID = (0x0) mfi0: 3052 (boot + 23s/0x0020/info) - Package version 6.2.0-0013 If anyone with knowledge of the mfi driver would like to comment, I'd = very much appreciate it. This box is going to be repurposed in the = coming months as an ESXi host to hold some backup/standby VMs, but = before that I would not mind taking some time to test any patches, extra = debug printfs in mfi, etc. I suspect I can probably trigger the panic = pretty easily by mimicking the daily run conditions - just kick off a = find from "/" and then repeatedly loop the megacli command to check the = array health. =20 The box is still on 7.3, but I'd gladly upgrade to 8.3 and test there if = needed once the box is freed up. Thanks, Charles -- Charles Sprickman NetEng/SysAdmin Bway.net - New York's Best Internet www.bway.net spork@bway.net - 212.655.9344