From owner-freebsd-stable@FreeBSD.ORG Fri Jul 27 15:00:08 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 6F1DC1065673; Fri, 27 Jul 2012 15:00:08 +0000 (UTC) (envelope-from asmrookie@gmail.com) Received: from mail-lpp01m010-f54.google.com (mail-lpp01m010-f54.google.com [209.85.215.54]) by mx1.freebsd.org (Postfix) with ESMTP id 353048FC0A; Fri, 27 Jul 2012 15:00:07 +0000 (UTC) Received: by laai10 with SMTP id i10so2641038laa.13 for ; Fri, 27 Jul 2012 08:00:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:reply-to:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=SF/qyQ7XOXmSMxI4/2/nNcRGZhDd6mY+q24QG8EhS4c=; b=Ni9cb9X84JMXxqm3ENyOotrDOFZTDaJ8n3r+4hmF16gDlaOMmshMBQJyDAwSt4ZCxa sPOQa5dX2jVynwoDXVUIickRr1IEnB+/ZqPoMGvLyMZbBp+57NJuA7+B7nvC56YyoIEc olW3udqTc3yLzM3Ho6rRFzk8FafLtHuRuS/jQqpJfmpDZ1OofRWiJ2x5GgHwZzqQgeVZ qHVWnGoHvLoQVhnF3zKBiSq0wsVQwrd3olax72ArVw27+XBwJvlSpP1GaBmAdUkHP2v2 tjckhaUtQRILummlZw7V2jolW0k+O4R257t/58rI5Hs3i2kyzmoq4w2omYqMiGYqZBFk GWag== MIME-Version: 1.0 Received: by 10.152.136.18 with SMTP id pw18mr2894970lab.17.1343401205938; Fri, 27 Jul 2012 08:00:05 -0700 (PDT) Sender: asmrookie@gmail.com Received: by 10.112.27.65 with HTTP; Fri, 27 Jul 2012 08:00:05 -0700 (PDT) In-Reply-To: <4ECC422A-F7A8-4F6C-9E9D-01080927C36D@averesystems.com> References: <1343350238.12294.10.camel@powernoodle.corp.yahoo.com> <23294764-F30B-4732-8C41-3F0ECA5F273C@averesystems.com> <4ECC422A-F7A8-4F6C-9E9D-01080927C36D@averesystems.com> Date: Fri, 27 Jul 2012 16:00:05 +0100 X-Google-Sender-Auth: IJ60C-5I0ziNroZ1-p70vlfJDaI Message-ID: From: Attilio Rao To: Andrew Boyer Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Cc: Ryan Stone , FreeBSD Stable Mailing List , John Baldwin , Andriy Gapon Subject: Re: IPMI hardware watchdogs Re: dell r420/r320 stable/9 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: attilio@FreeBSD.org List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 27 Jul 2012 15:00:08 -0000 On Fri, Jul 27, 2012 at 3:55 PM, Andrew Boyer wro= te: > > On Jul 27, 2012, at 10:42 AM, Attilio Rao wrote: > >> On Fri, Jul 27, 2012 at 3:33 PM, Andrew Boyer = wrote: >>> >>> On Jul 26, 2012, at 8:50 PM, Sean Bruno wrote: >>> >>>> For the time being I had to revert the following from my stable/9 tree= . >>>> Otherwise I would get a kernel panic on shutdown from ipmi(4). >>>> >>>> http://svnweb.freebsd.org/base?view=3Drevision&revision=3D237839 >>>> http://svnweb.freebsd.org/base?view=3Drevision&revision=3D221121 >>>> >>> >>> On a somewhat related note: We noticed recently that you can't pet or d= isable the IPMI hardware watchdog once SCHEDULER_STOPPED() is true. This m= eans it can fire unexpectedly while you're dumping core or rebooting, depen= ding on how long the timeout was on the pet before the panic. The ipmi dri= ver will need to process the command differently if the scheduler is stoppe= d. I haven't had time to look at a fix yet. >> >> I recall I fixed that internally for SV, but the key here is that we >> need to find an unified (or a default policy). >> More specifically, do we want the watchdog also covers the kernel dump >> part (because of possible deadlocks when dumping). If the answer is >> yes, we likely need pat the watchdog from within the dumping cycle >> itself. If the answer is no, then we can just disable it when entering >> the panic path. But anyway, we need to identify a default policy that >> makes sense first. >> >> Attilio >> > > For our use case, we need the system to reset if the dump hangs. This means we might likely go to control by hand the watchdog patting in the panic path and more specifically I guess this reduces to patting the watching from within the dumping cycle (there could be other expensive points we can consider but nothing that pop off my head right now). Maybe Ryan can share with us if SV can contribute the code back about that specific part. Attilio --=20 Peace can only be achieved by understanding - A. Einstein