From owner-freebsd-scsi@FreeBSD.ORG Mon May 8 08:45:34 2006 Return-Path: X-Original-To: freebsd-scsi@freebsd.org Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A69DB16A400 for ; Mon, 8 May 2006 08:45:34 +0000 (UTC) (envelope-from kpielorz@tdx.co.uk) Received: from caladan.tdx.co.uk (caladan.tdx.co.uk [62.13.130.230]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1F6E443D46 for ; Mon, 8 May 2006 08:45:33 +0000 (GMT) (envelope-from kpielorz@tdx.co.uk) Received: from unsupported (thebrick.dmpriest.net.uk [62.13.130.30]) by caladan.tdx.co.uk (8.13.6/8.13.6/Kp) with ESMTP id k488jWpX028277 for ; Mon, 8 May 2006 09:45:32 +0100 (BST) Date: Mon, 08 May 2006 09:47:11 +0100 From: Karl Pielorz To: freebsd-scsi@freebsd.org Message-ID: <39E0E0A8ACF54E31B690E879@unsupported> X-Mailer: Mulberry/4.0.4 (Win32) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline Subject: 6.1RC-2 ciss Driver hangs on Rebuild for Internal Drives with external MSA20 attached X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 08 May 2006 08:45:34 -0000 Hi All, We've recently added an MSA20 external SATA enclosure to our HP Proliant DL380 server. While testing, we found the following problem: If you fail an internal RAID array, when the system starts rebuilding it - any disk access to ciss0 will 'hang' - killing the server. The rebuild does complete OK (guess that's the controller continuing to do it in-background) - but the machine never recovers from the hang. If you remove the MSA20 - internal drives can be failed, and will rebuild fine. Additionally - if you fail a drive in the MSA20 when it's attached - it will fail, and rebuild perfectly Ok (no hangs). I've also noticed the following appears logged, when the machine is going to hang: " ciss0: ** Hot-plug drive inserted: SCSI port 2 ID 5 ciss0: ** State change, logical drive 2 [server hangs at this point - after ~30 sec you get...] ciss0: error sending 195 LUN command (35) <---- Presumably not good :) ciss0: Warning, cannot get physical lun list ciss0: logical drive 2 (da2) changed status interim recovery->ready for recovery, spare status 0x0 " Any suggestions? - We've checked the firmware on both the server, and the MSA20 is the latest. The cable is the HP supplied one. Thanks, -Karl You can find a full verbose boot for this machine, admittedly under 6.1RC1 (which had the same problem) at: http://www.tdx.com/verbose_6.1rc1.txt