Date: Wed, 15 Sep 2021 11:23:03 -0400 From: mike tancsa <mike@sentex.net> To: Alexander Motin <mav@FreeBSD.org>, FreeBSD-STABLE Mailing List <freebsd-stable@freebsd.org> Subject: Re: ipmi0: Watchdog set returned 0xc0 (releng_13) Message-ID: <b5d6dbbc-92b8-52db-0aab-d2aa04288f8d@sentex.net> In-Reply-To: <b6f97a7d-73ce-c3d3-ee56-5a7ae5bfed05@FreeBSD.org> References: <c065ff32-719f-ef2e-85b9-d5504a36810f@sentex.net> <51e58d7a-1b33-f7e3-13aa-3be5fee5c826@FreeBSD.org> <2c1ad9db-b92a-d0a8-b2ae-b3ffafc04604@sentex.net> <fc5f879f-91da-c45d-366c-29c721de948e@FreeBSD.org> <f93369e5-51b6-1a0c-cc14-e16a71be314b@sentex.net> <1f0d32e1-e907-080f-1669-65148c245dd8@sentex.net> <b6f97a7d-73ce-c3d3-ee56-5a7ae5bfed05@FreeBSD.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On 9/14/2021 9:29 PM, Alexander Motin wrote: > Hi Mike, > > Could you try my 6c2d4404161a commit? I don't know about your case, bu= t > it fixes 0xcc error I see on my systems for timeouts below 120 seconds.= Hi Alexander, This is on the Supermicro X11SCH-F.=C2=A0 BMC firmware was version 1.73 (latest version on the website) ipmi0: <IPMI System Interface> port 0xca2,0xca3 on acpi0 ipmi0: KCS mode found at io 0xca2 on acpi ipmi0: IPMI device rev. 1, firmware rev. 1.73, version 2.0, device support mask 0xbf ipmi0: Number of channels 2 ipmi0: Attached watchdog ipmi0: Establishing power cycle handler Its no longer printing the error!=C2=A0 If I start up watchdogd -t 30 and then do a killall -9 watchdogd, it does a graceful shutdown of the box !?!=C2=A0 Thats very cool. Even be= tter than before as a hard reset. But I guess will it do a hard reset if the box is actually live locked ?=C2=A0 I did a quick test to confirm, that i= t does indeed not wait around too=C2=A0 long.=C2=A0 I added an infinite loo= p in /usr/local/etc/rc.d/stop-shutdown.sh and it only fired for 6 seconds before the box hard reset its logged in the BMC log too. # ipmitool sel list =C2=A0=C2=A0 1 | 09/15/2021 | 14:42:04 | Watchdog2 #0xca | Timer interrup= t () | Asserted =C2=A0=C2=A0 2 | 09/15/2021 | 14:42:22 | Watchdog2 #0xca | Power cycle ()= | Asserted I also tried on a X11SSL-F ipmi0: IPMI device rev. 1, firmware rev. 1.60, version 2.0, device support mask 0xbf ipmi0: Number of channels 2 ipmi0: Attached watchdog ipmi0: Establishing power cycle handler =C2=A0# ipmitool sel list | tail -3 =C2=A0=C2=A0 6 | 08/20/2021 | 20:45:38 | Fan #0x45 | Lower Non-recoverabl= e going low=C2=A0 | Asserted =C2=A0=C2=A0 7 | 09/15/2021 | 11:15:28 | Watchdog2 #0xca | Timer interrup= t () | Asserted =C2=A0=C2=A0 8 | 09/15/2021 | 11:15:38 | Watchdog2 #0xca | Power cycle ()= | Asserted # I have a RELENG_12 box in production I will try as well later, but so far so good.=C2=A0 Thanks for fixing! =C2=A0=C2=A0=C2=A0 ---Mike =C2=A0=C2=A0=C2=A0 ---Mike
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?b5d6dbbc-92b8-52db-0aab-d2aa04288f8d>