Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 15 Sep 2021 11:23:03 -0400
From:      mike tancsa <mike@sentex.net>
To:        Alexander Motin <mav@FreeBSD.org>, FreeBSD-STABLE Mailing List <freebsd-stable@freebsd.org>
Subject:   Re: ipmi0: Watchdog set returned 0xc0 (releng_13)
Message-ID:  <b5d6dbbc-92b8-52db-0aab-d2aa04288f8d@sentex.net>
In-Reply-To: <b6f97a7d-73ce-c3d3-ee56-5a7ae5bfed05@FreeBSD.org>
References:  <c065ff32-719f-ef2e-85b9-d5504a36810f@sentex.net> <51e58d7a-1b33-f7e3-13aa-3be5fee5c826@FreeBSD.org> <2c1ad9db-b92a-d0a8-b2ae-b3ffafc04604@sentex.net> <fc5f879f-91da-c45d-366c-29c721de948e@FreeBSD.org> <f93369e5-51b6-1a0c-cc14-e16a71be314b@sentex.net> <1f0d32e1-e907-080f-1669-65148c245dd8@sentex.net> <b6f97a7d-73ce-c3d3-ee56-5a7ae5bfed05@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On 9/14/2021 9:29 PM, Alexander Motin wrote:
> Hi Mike,
>
> Could you try my 6c2d4404161a commit?  I don't know about your case, bu=
t
> it fixes 0xcc error I see on my systems for timeouts below 120 seconds.=


Hi Alexander,

This is on the Supermicro X11SCH-F.=C2=A0 BMC firmware was version 1.73
(latest version on the website)

ipmi0: <IPMI System Interface> port 0xca2,0xca3 on acpi0
ipmi0: KCS mode found at io 0xca2 on acpi
ipmi0: IPMI device rev. 1, firmware rev. 1.73, version 2.0, device
support mask 0xbf
ipmi0: Number of channels 2
ipmi0: Attached watchdog
ipmi0: Establishing power cycle handler

Its no longer printing the error!=C2=A0

If I start up watchdogd -t 30

and then do a

killall -9 watchdogd,

it does a graceful shutdown of the box !?!=C2=A0 Thats very cool. Even be=
tter
than before as a hard reset. But I guess will it do a hard reset if the
box is actually live locked ?=C2=A0 I did a quick test to confirm, that i=
t
does indeed not wait around too=C2=A0 long.=C2=A0 I added an infinite loo=
p in
/usr/local/etc/rc.d/stop-shutdown.sh and it only fired for 6 seconds
before the box hard reset

its logged in the BMC log too.

# ipmitool sel list
=C2=A0=C2=A0 1 | 09/15/2021 | 14:42:04 | Watchdog2 #0xca | Timer interrup=
t () |
Asserted
=C2=A0=C2=A0 2 | 09/15/2021 | 14:42:22 | Watchdog2 #0xca | Power cycle ()=
 | Asserted



I also tried on a X11SSL-F

ipmi0: IPMI device rev. 1, firmware rev. 1.60, version 2.0, device
support mask 0xbf
ipmi0: Number of channels 2
ipmi0: Attached watchdog
ipmi0: Establishing power cycle handler

=C2=A0# ipmitool sel list | tail -3
=C2=A0=C2=A0 6 | 08/20/2021 | 20:45:38 | Fan #0x45 | Lower Non-recoverabl=
e going
low=C2=A0 | Asserted
=C2=A0=C2=A0 7 | 09/15/2021 | 11:15:28 | Watchdog2 #0xca | Timer interrup=
t () |
Asserted
=C2=A0=C2=A0 8 | 09/15/2021 | 11:15:38 | Watchdog2 #0xca | Power cycle ()=
 | Asserted
#

I have a RELENG_12 box in production I will try as well later, but so
far so good.=C2=A0 Thanks for fixing!

=C2=A0=C2=A0=C2=A0 ---Mike



=C2=A0=C2=A0=C2=A0 ---Mike





Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?b5d6dbbc-92b8-52db-0aab-d2aa04288f8d>