From owner-freebsd-stable@FreeBSD.ORG Fri Mar 12 18:38:39 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B46FF106566B; Fri, 12 Mar 2010 18:38:39 +0000 (UTC) (envelope-from mavbsd@gmail.com) Received: from mail-fx0-f223.google.com (mail-fx0-f223.google.com [209.85.220.223]) by mx1.freebsd.org (Postfix) with ESMTP id 992518FC13; Fri, 12 Mar 2010 18:38:38 +0000 (UTC) Received: by fxm23 with SMTP id 23so1495594fxm.3 for ; Fri, 12 Mar 2010 10:38:37 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:message-id:date:from :user-agent:mime-version:to:cc:subject:references:in-reply-to :x-enigmail-version:content-type:content-transfer-encoding; bh=2tSyOi4nNgSTPzhkO/2TebU726++GvNqCLZ0bMQmqJA=; b=FVMh/4AcGRUCrnko2IgXtXrTiOX+VfOxd0l3VkMABv3pE1Kul2VIdfEt1lccYUkGme qicz4Cvc7oesghoqr2dyE6YDRFwllEl2Upmw1xUeNxrpRQQ/Z0duKbDo9MOKDvo7j8ao H++q270RxZts9dSRPsETQmgzze/hilQ0MfcFo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:message-id:date:from:user-agent:mime-version:to:cc:subject :references:in-reply-to:x-enigmail-version:content-type :content-transfer-encoding; b=egaYpz25eFmIJYF3TCBe4rUjmeA85ZQTpqWEYdvXNC1r3Gx8EFFQ6ddnlKu+nuFOjD T4xYEFc62hrIaKE2wmeAMxw01CfpXDT3xfvToyOJO+w4Rtsu0bSoBh5RfwRWb3uDXOux 2nuSpS+DGL6h2LRt3vRTuiKhaHWUhw5S1HzmU= Received: by 10.223.164.75 with SMTP id d11mr1166233fay.68.1268419117543; Fri, 12 Mar 2010 10:38:37 -0800 (PST) Received: from mavbook.mavhome.dp.ua (s224.GtokyoFL6.vectant.ne.jp [222.228.90.224]) by mx.google.com with ESMTPS id 15sm1203260fxm.12.2010.03.12.10.38.34 (version=SSLv3 cipher=RC4-MD5); Fri, 12 Mar 2010 10:38:36 -0800 (PST) Sender: Alexander Motin Message-ID: <4B9A8A27.8050608@FreeBSD.org> Date: Fri, 12 Mar 2010 20:38:31 +0200 From: Alexander Motin User-Agent: Thunderbird 2.0.0.23 (X11/20091212) MIME-Version: 1.0 To: Kai Gallasch References: <20100311133916.42ba69b0@orwell.free.de> <20100312115028.GG1819@garage.freebsd.pl> In-Reply-To: <20100312115028.GG1819@garage.freebsd.pl> X-Enigmail-Version: 0.96.0 Content-Type: text/plain; charset=KOI8-R Content-Transfer-Encoding: 7bit Cc: freebsd-fs@FreeBSD.org, freebsd-stable@FreeBSD.org, Pawel Jakub Dawidek Subject: Re: proliant server lockups with freebsd-amd64-stable (2010-03-10) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 12 Mar 2010 18:38:39 -0000 Pawel Jakub Dawidek wrote: > On Thu, Mar 11, 2010 at 01:39:16PM +0100, Kai Gallasch wrote: >> I have some trouble with an opteron server locking up spontaneously. It looses >> all networks connectivity and even through console I can get no shell. >> >> Lockups occur mostly under disk load (periodic daily, bacula backup >> running, make buildworld/buildkernel) and I can provoke them easily. > [...] >> 4 0 0 0 LL *cissmtx 0xffffff04ed820c00 [g_down] > [...] >> 100046 L *cissmtx 0xffffff04ed820c00 [irq257: ciss0] > [...] > > I was analizing similar problem as potential ZFS bug. It turned out to > be bug in ciss(4) and I believe mav@ (CCed) has fix for that. That my patch is already at 8-STABLE since r204873 of 2010-03-08. Make sure you have it. In this case trap stopped process at ciss_get_request(), which indeed called holding cissmtx lock. But there is no place to sleep or loop there, so may be it was just spontaneous. With bugs I was fixing there was a chance to loop indefinitely between ciss and CAM on resource constraint. That increases chance for such situation to be caught. You may try also look what's going on with `top -HS` and `systat -vm 1`. -- Alexander Motin