From owner-freebsd-stable@FreeBSD.ORG Fri Apr 29 08:06:35 2011 Return-Path: Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 73CCE1065672; Fri, 29 Apr 2011 08:06:35 +0000 (UTC) (envelope-from marck@rinet.ru) Received: from woozle.rinet.ru (woozle.rinet.ru [195.54.192.68]) by mx1.freebsd.org (Postfix) with ESMTP id F0C608FC20; Fri, 29 Apr 2011 08:06:34 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by woozle.rinet.ru (8.14.4/8.14.4) with ESMTP id p3T7pLc0029344; Fri, 29 Apr 2011 11:51:21 +0400 (MSD) (envelope-from marck@rinet.ru) Date: Fri, 29 Apr 2011 11:51:21 +0400 (MSD) From: Dmitry Morozovsky To: ken@FreeBSD.org Message-ID: User-Agent: Alpine 2.00 (BSF 1167 2008-08-23) X-NCC-RegID: ru.rinet X-OpenPGP-Key-ID: 6B691B03 MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.6 (woozle.rinet.ru [0.0.0.0]); Fri, 29 Apr 2011 11:51:21 +0400 (MSD) Cc: freebsd-stable@FreeBSD.org Subject: mps driver instability under stable/8 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 29 Apr 2011 08:06:35 -0000 Dear Ken, I have SuperMicro Server with mps driver you managed, with 24 SATA disks under SAS x36 expander with large ZFS Sometimes, under random disk load such as daily find, it lost all its devices: [-- MARK -- Fri Apr 29 03:00:00 2011] mps0: IOC Fault 0x40005900, Resetting^M (pass20:mps0:0:22:0): SCSI command timeout on device handle 0x0020 SMID 442^M mps0: IOC Fault 0x40001500, Resetting^M (da19:mps0:0:21:0): SCSI command timeout on device handle 0x001f SMID 172^M (da19:mps0:0:21:0): SCSI command timeout on device handle 0x001f SMID 511^M (da20:mps0:0:20:0): SCSI command timeout on device handle 0x001e SMID 240^M .. (da4:mps0:0:0:0): SCSI command timeout on device handle 0x000a SMID 844^M (da22:mps0:0:23:0): SCSI command timeout on device handle 0x0021 SMID 713^M (da18:mps0:0:22:0): SCSI command timeout on device handle 0x0020 SMID 603^M and hangs there forever (in zio state). I've prepared debugging kernel with DDB and would be glad to help catch the situation. Thanks! -- Sincerely, D.Marck [DM5020, MCK-RIPE, DM3-RIPN] [ FreeBSD committer: marck@FreeBSD.org ] ------------------------------------------------------------------------ *** Dmitry Morozovsky --- D.Marck --- Wild Woozle --- marck@rinet.ru *** ------------------------------------------------------------------------