Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 20 Jun 2022 19:59:52 -0500
From:      Larry Rosenman <ler@lerctr.org>
To:        Ultima <ultima1252@gmail.com>
Cc:        Freebsd current <freebsd-current@freebsd.org>
Subject:   Re: MCE: Does this look possibly like a slot issue?
Message-ID:  <983660c80cc6717e3b49821f7957ee80@lerctr.org>
In-Reply-To: <CANJ8om7_=O95K3MOPZU8KgYWQAUkJTuTeyQRrgRr_S-=BHkN3A@mail.gmail.com>
References:  <c9d183a8a8083056a08946321694b70d@lerctr.org> <CANJ8om774CyUB4VBdAztEhipPFDW1PAMZQsXbk8%2Boro-3Tg8gA@mail.gmail.com> <c29f59fbb209874549f5f68efd14a3c2@lerctr.org> <CANJ8om7t339RDA0JtoeNT1SfBdgkGM4-bn6JZXG0ZqOTgt82SA@mail.gmail.com> <49c1dfe9bbe9912282b0f0339a0c077b@lerctr.org> <CANJ8om7_=O95K3MOPZU8KgYWQAUkJTuTeyQRrgRr_S-=BHkN3A@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
--=_2a5f4419d9ada4d3f970c5ed9bd02eac
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=US-ASCII;
 format=flowed



SuperMicro X8DTN+

2 Processors, 6-core/12-Thread. CPU: Intel(R) Xeon(R) CPU           
E5645  @ 2.40GHz (2400.20-MHz K8-class CPU)

I'll bring it down and swap DIMMS around

On 06/20/2022 7:57 pm, Ultima wrote:

> Hey Larry,
> 
> One red flag I am seeing is that the error is being produced on
> the same CPU/bank with each error you have provided so far.
> 
> Can you try and follow my original recommendation and swap
> currently installed DIMM with the problem DIMM slot and see
> if anything changes?
> 
> Can you also provide the motherboard model? Also, do you
> have multiple CPUs installed in this system?
> 
> Best regards,
> Richard Gallamore
> 
> On Mon, Jun 20, 2022 at 5:41 PM Larry Rosenman <ler@lerctr.org> wrote:
> 
> Yes and Yes.
> 
> On 06/20/2022 7:37 pm, Ultima wrote:
> 
> Are you sure that the module you replaced it with was good?
> Are you sure you replaced the correct module?
> 
> Best regards,
> Richard Gallamore
> 
> On Mon, Jun 20, 2022 at 5:23 PM Larry Rosenman <ler@lerctr.org> wrote:
> 
> I'm seeing them constantly:
> 
> root@freenas[~]# mcelog --dmi
> Hardware event. This is not a software error.
> MCE 0
> CPU 22 BANK 8 TSC 20aab486464a
> MISC ac29890200046444 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 44
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> WARNING: SMBIOS data is often unreliable. Take with a grain of salt!
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 1
> CPU 22 BANK 8 TSC 296dfcc82582
> MISC ac29890200041381 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 81
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 2
> CPU 22 BANK 8 TSC 2a5604a6a070
> MISC ac29890200044281
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory ECC error occurred during scrub
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 81
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 88000040000200cf MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> Hardware event. This is not a software error.
> MCE 3
> CPU 22 BANK 8 TSC 31e141418eb8
> MISC ac29890200046a4a ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 4a
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 4
> CPU 22 BANK 8 TSC 3a014afee106
> MISC ac29890200046646 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 46
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 5
> CPU 22 BANK 8 TSC 41d1dbef1a6a
> MISC ac29890200046141 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 41
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 6
> CPU 22 BANK 8 TSC 4a1b1ecef446
> MISC ac29890200046a4a ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 4a
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 7
> CPU 22 BANK 8 TSC 527bc27db776
> MISC ac29890200040386 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 86
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> Hardware event. This is not a software error.
> MCE 8
> CPU 22 BANK 8 TSC 5aa4ecdd795a
> MISC ac29890200046646 ADDR ee2f6e800
> TIME 1655770989 Mon Jun 20 19:23:09 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 46
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> root@freenas[~]#
> 
> and I replaced the DIMM yesterday :(
> 
> On 06/20/2022 7:19 pm, Ultima wrote:
> 
> Hey Larry,
> 
> It is possible it's the motherboard itself, but it's rare. The way I
> would determine this is to swap the DIMM module with another
> populated slot on the motherboard and see if the error migrated
> to the new slot or not. Also, this error doesn't necessarily mean
> there is a problem that needs to be addressed. If you have been
> running the system for many months and you see ECC errors a
> handful of times, it can probably be safely ignored.
> 
> Best regards,
> Richard Gallamore
> 
> On Mon, Jun 20, 2022 at 3:14 PM Larry Rosenman <ler@lerctr.org> wrote: 
> I've gotten a BUNCH of these on my TrueNAS server.  I've replaced this
> DIMM a couple of times, and still the MCE's continue.
> Is it possible it's Motherboard slot issue?
> 
> Hardware event. This is not a software error.
> MCE 8
> CPU 22 BANK 8 TSC 5aa4ecdd795a
> MISC ac29890200046646 ADDR ee2f6e800
> TIME 1655762472 Mon Jun 20 17:01:12 2022
> MCG status:
> Memory read ECC error
> Memory corrected error count (CORE_ERR_CNT): 1
> Memory transaction Tracker ID (RTId): 46
> Memory DIMM ID of error: 0
> Memory channel ID of error: 1
> Memory ECC syndrome: ac298902
> STATUS 8c0000400001009f MCGSTATUS 0
> MCGCAP 1c09 APICID 34 SOCKETID 0
> CPUID Vendor Intel Family 6 Model 44 Step 2
> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
> Device Locator: P2-DIMM2C
> Bank Locator: BANK14
> Manufacturer: Hyundai
> Serial Number: 40F3C20F
> Asset Tag:
> Part Number: HMT151R7BFR4C-H9
> 
> --
> Larry Rosenman                     http://www.lerctr.org/~ler
> Phone: +1 214-642-9640                 E-Mail: ler@lerctr.org
> US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106

-- 
Larry Rosenman                     http://www.lerctr.org/~ler
Phone: +1 214-642-9640                 E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106

-- 
Larry Rosenman                     http://www.lerctr.org/~ler
Phone: +1 214-642-9640                 E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106

-- 
Larry Rosenman                     http://www.lerctr.org/~ler
Phone: +1 214-642-9640                 E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106
--=_2a5f4419d9ada4d3f970c5ed9bd02eac
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html; charset=UTF-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html; charset=
=3DUTF-8" /></head><body style=3D'font-size: 10pt; font-family: Arial,Helve=
tica,sans-serif'>
<p>SuperMicro X8DTN+</p>
<p>2 Processors, 6-core/12-Thread. CPU: Intel(R) Xeon(R) CPU &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; E5645 &nbsp;@ 2.40GHz (2400.20-MHz K8-class CPU)</p>
<p><br /></p>
<p>I'll bring it down and swap DIMMS around</p>
<p><br /></p>
<p id=3D"reply-intro">On 06/20/2022 7:57 pm, Ultima wrote:</p>
<blockquote type=3D"cite" style=3D"padding: 0 0.4em; border-left: #1010ff 2=
px solid; margin: 0">
<div id=3D"replybody1">
<div dir=3D"ltr">
<div>Hey Larry,</div>
<div>&nbsp;</div>
<div>One red flag I am seeing is that the error is being produced on</div>
<div>the same CPU/bank with each error you have provided so far.</div>
<div>&nbsp;</div>
<div>Can you try and follow my original recommendation and swap</div>
<div>currently installed DIMM with the problem DIMM slot and see</div>
<div>if anything changes?</div>
<div>&nbsp;</div>
<div>Can you also provide the motherboard model? Also, do you</div>
<div>have multiple CPUs installed in this system?</div>
<div>&nbsp;</div>
<div>Best regards,</div>
<div>Richard Gallamore</div>
<div>
<div>&nbsp;</div>
</div>
</div>
<br />
<div class=3D"v1gmail_quote">
<div class=3D"v1gmail_attr" dir=3D"ltr">On Mon, Jun 20, 2022 at 5:41 PM Lar=
ry Rosenman &lt;<a href=3D"mailto:ler@lerctr.org" rel=3D"noreferrer">ler@le=
rctr.org</a>&gt; wrote:</div>
<blockquote class=3D"v1gmail_quote" style=3D"margin: 0px 0px 0px 0.8ex; bor=
der-left: 1px solid #cccccc; padding-left: 1ex;">
<div style=3D"font-size: 10pt; font-family: Arial,Helvetica,sans-serif;">
<p>Yes and Yes.</p>
<p><br /></p>
<p id=3D"v1gmail-m_3316908189451833722reply-intro">On 06/20/2022 7:37 pm, U=
ltima wrote:</p>
<blockquote style=3D"padding: 0px 0.4em; border-left: 2px solid #1010ff; ma=
rgin: 0px;">
<div id=3D"v1gmail-m_3316908189451833722replybody1">
<div dir=3D"ltr">
<div>Are you sure that the module you replaced it with was good?</div>
<div>Are you sure you replaced the correct module?</div>
<div>&nbsp;</div>
<div>Best regards,</div>
<div>Richard Gallamore</div>
</div>
<br />
<div>
<div dir=3D"ltr">On Mon, Jun 20, 2022 at 5:23 PM Larry Rosenman &lt;<a href=
=3D"mailto:ler@lerctr.org" rel=3D"noreferrer">ler@lerctr.org</a>&gt; wrote:=
</div>
<blockquote style=3D"margin: 0px 0px 0px 0.8ex; border-left: 1px solid #ccc=
ccc; padding-left: 1ex;">
<div style=3D"font-size: 10pt; font-family: Arial,Helvetica,sans-serif;">
<p>I'm seeing them constantly:</p>
<p>root@freenas[~]# mcelog --dmi<br />Hardware event. This is not a softwar=
e error.<br />MCE 0<br />CPU 22 BANK 8 TSC 20aab486464a<br />MISC ac2989020=
0046444 ADDR ee2f6e800<br />TIME 1655770989 Mon Jun 20 19:23:09 2022<br />M=
CG status:<br />Memory read ECC error<br />Memory corrected error count (CO=
RE_ERR_CNT): 1<br />Memory transaction Tracker ID (RTId): 44<br />Memory DI=
MM ID of error: 0<br />Memory channel ID of error: 1<br />Memory ECC syndro=
me: ac298902<br />STATUS 8c0000400001009f MCGSTATUS 0<br />MCGCAP 1c09 APIC=
ID 34 SOCKETID 0<br />CPUID Vendor Intel Family 6 Model 44 Step 2<br />WARN=
ING: SMBIOS data is often unreliable. Take with a grain of salt!<br />DDR3 =
DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB<br />Device Locator: P2=
-DIMM2C<br />Bank Locator: BANK14<br />Manufacturer: Hyundai<br />Serial Nu=
mber: 40F3C20F<br />Asset Tag:<br />Part Number: HMT151R7BFR4C-H9<br />Hard=
ware event. This is not a software error.<br />MCE 1<br />CPU 22 BANK 8 TSC=
 296dfcc82582<br />MISC ac29890200041381 ADDR ee2f6e800<br />TIME 165577098=
9 Mon Jun 20 19:23:09 2022<br />MCG status:<br />Memory read ECC error<br /=
>Memory corrected error count (CORE_ERR_CNT): 1<br />Memory transaction Tra=
cker ID (RTId): 81<br />Memory DIMM ID of error: 0<br />Memory channel ID o=
f error: 1<br />Memory ECC syndrome: ac298902<br />STATUS 8c0000400001009f =
MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID 0<br />CPUID Vendor Intel F=
amily 6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz Other Width 72 Data Width 64=
 Size 4 GB<br />Device Locator: P2-DIMM2C<br />Bank Locator: BANK14<br />Ma=
nufacturer: Hyundai<br />Serial Number: 40F3C20F<br />Asset Tag:<br />Part =
Number: HMT151R7BFR4C-H9<br />Hardware event. This is not a software error.=
<br />MCE 2<br />CPU 22 BANK 8 TSC 2a5604a6a070<br />MISC ac29890200044281<=
br />TIME 1655770989 Mon Jun 20 19:23:09 2022<br />MCG status:<br />Memory =
ECC error occurred during scrub<br />Memory corrected error count (CORE_ERR=
_CNT): 1<br />Memory transaction Tracker ID (RTId): 81<br />Memory DIMM ID =
of error: 0<br />Memory channel ID of error: 1<br />Memory ECC syndrome: ac=
298902<br />STATUS 88000040000200cf MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 =
SOCKETID 0<br />CPUID Vendor Intel Family 6 Model 44 Step 2<br />Hardware e=
vent. This is not a software error.<br />MCE 3<br />CPU 22 BANK 8 TSC 31e14=
1418eb8<br />MISC ac29890200046a4a ADDR ee2f6e800<br />TIME 1655770989 Mon =
Jun 20 19:23:09 2022<br />MCG status:<br />Memory read ECC error<br />Memor=
y corrected error count (CORE_ERR_CNT): 1<br />Memory transaction Tracker I=
D (RTId): 4a<br />Memory DIMM ID of error: 0<br />Memory channel ID of erro=
r: 1<br />Memory ECC syndrome: ac298902<br />STATUS 8c0000400001009f MCGSTA=
TUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID 0<br />CPUID Vendor Intel Family =
6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size =
4 GB<br />Device Locator: P2-DIMM2C<br />Bank Locator: BANK14<br />Manufact=
urer: Hyundai<br />Serial Number: 40F3C20F<br />Asset Tag:<br />Part Number=
: HMT151R7BFR4C-H9<br />Hardware event. This is not a software error.<br />=
MCE 4<br />CPU 22 BANK 8 TSC 3a014afee106<br />MISC ac29890200046646 ADDR e=
e2f6e800<br />TIME 1655770989 Mon Jun 20 19:23:09 2022<br />MCG status:<br =
/>Memory read ECC error<br />Memory corrected error count (CORE_ERR_CNT): 1=
<br />Memory transaction Tracker ID (RTId): 46<br />Memory DIMM ID of error=
: 0<br />Memory channel ID of error: 1<br />Memory ECC syndrome: ac298902<b=
r />STATUS 8c0000400001009f MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID=
 0<br />CPUID Vendor Intel Family 6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz =
Other Width 72 Data Width 64 Size 4 GB<br />Device Locator: P2-DIMM2C<br />=
Bank Locator: BANK14<br />Manufacturer: Hyundai<br />Serial Number: 40F3C20=
F<br />Asset Tag:<br />Part Number: HMT151R7BFR4C-H9<br />Hardware event. T=
his is not a software error.<br />MCE 5<br />CPU 22 BANK 8 TSC 41d1dbef1a6a=
<br />MISC ac29890200046141 ADDR ee2f6e800<br />TIME 1655770989 Mon Jun 20 =
19:23:09 2022<br />MCG status:<br />Memory read ECC error<br />Memory corre=
cted error count (CORE_ERR_CNT): 1<br />Memory transaction Tracker ID (RTId=
): 41<br />Memory DIMM ID of error: 0<br />Memory channel ID of error: 1<br=
 />Memory ECC syndrome: ac298902<br />STATUS 8c0000400001009f MCGSTATUS 0<b=
r />MCGCAP 1c09 APICID 34 SOCKETID 0<br />CPUID Vendor Intel Family 6 Model=
 44 Step 2<br />DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB<br=
 />Device Locator: P2-DIMM2C<br />Bank Locator: BANK14<br />Manufacturer: H=
yundai<br />Serial Number: 40F3C20F<br />Asset Tag:<br />Part Number: HMT15=
1R7BFR4C-H9<br />Hardware event. This is not a software error.<br />MCE 6<b=
r />CPU 22 BANK 8 TSC 4a1b1ecef446<br />MISC ac29890200046a4a ADDR ee2f6e80=
0<br />TIME 1655770989 Mon Jun 20 19:23:09 2022<br />MCG status:<br />Memor=
y read ECC error<br />Memory corrected error count (CORE_ERR_CNT): 1<br />M=
emory transaction Tracker ID (RTId): 4a<br />Memory DIMM ID of error: 0<br =
/>Memory channel ID of error: 1<br />Memory ECC syndrome: ac298902<br />STA=
TUS 8c0000400001009f MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID 0<br /=
>CPUID Vendor Intel Family 6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz Other W=
idth 72 Data Width 64 Size 4 GB<br />Device Locator: P2-DIMM2C<br />Bank Lo=
cator: BANK14<br />Manufacturer: Hyundai<br />Serial Number: 40F3C20F<br />=
Asset Tag:<br />Part Number: HMT151R7BFR4C-H9<br />Hardware event. This is =
not a software error.<br />MCE 7<br />CPU 22 BANK 8 TSC 527bc27db776<br />M=
ISC ac29890200040386 ADDR ee2f6e800<br />TIME 1655770989 Mon Jun 20 19:23:0=
9 2022<br />MCG status:<br />Memory read ECC error<br />Memory corrected er=
ror count (CORE_ERR_CNT): 1<br />Memory transaction Tracker ID (RTId): 86<b=
r />Memory DIMM ID of error: 0<br />Memory channel ID of error: 1<br />Memo=
ry ECC syndrome: ac298902<br />STATUS 8c0000400001009f MCGSTATUS 0<br />MCG=
CAP 1c09 APICID 34 SOCKETID 0<br />CPUID Vendor Intel Family 6 Model 44 Ste=
p 2<br />DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB<br />Devi=
ce Locator: P2-DIMM2C<br />Bank Locator: BANK14<br />Manufacturer: Hyundai<=
br />Serial Number: 40F3C20F<br />Asset Tag:<br />Part Number: HMT151R7BFR4=
C-H9<br />Hardware event. This is not a software error.<br />MCE 8<br />CPU=
 22 BANK 8 TSC 5aa4ecdd795a<br />MISC ac29890200046646 ADDR ee2f6e800<br />=
TIME 1655770989 Mon Jun 20 19:23:09 2022<br />MCG status:<br />Memory read =
ECC error<br />Memory corrected error count (CORE_ERR_CNT): 1<br />Memory t=
ransaction Tracker ID (RTId): 46<br />Memory DIMM ID of error: 0<br />Memor=
y channel ID of error: 1<br />Memory ECC syndrome: ac298902<br />STATUS 8c0=
000400001009f MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID 0<br />CPUID =
Vendor Intel Family 6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz Other Width 72=
 Data Width 64 Size 4 GB<br />Device Locator: P2-DIMM2C<br />Bank Locator: =
BANK14<br />Manufacturer: Hyundai<br />Serial Number: 40F3C20F<br />Asset T=
ag:<br />Part Number: HMT151R7BFR4C-H9<br /><a href=3D"#v1m_331690818945183=
3722_NOP">root@freenas[~]#</a></p>
<p><br /></p>
<p>and I replaced the DIMM yesterday :(&nbsp;</p>
<p><br /></p>
<p><br /></p>
<p id=3D"v1gmail-m_3316908189451833722v1gmail-m_2643139469111892975reply-in=
tro">On 06/20/2022 7:19 pm, Ultima wrote:</p>
<blockquote style=3D"padding: 0px 0.4em; border-left: 2px solid #1010ff; ma=
rgin: 0px;">
<div id=3D"v1gmail-m_3316908189451833722v1gmail-m_2643139469111892975replyb=
ody1">
<div dir=3D"ltr">
<div>Hey Larry,</div>
<div>&nbsp;</div>
<div>&nbsp;It is possible it's the motherboard itself, but it's rare. The w=
ay I</div>
<div>would determine this is to swap the DIMM module with another</div>
<div>populated slot on the motherboard and see if the error migrated</div>
<div>to the new slot or not. Also, this error doesn't necessarily mean</div>
<div>there is a problem that needs to be addressed. If you have been</div>
<div>running the system for many months and you see ECC errors a</div>
<div>handful of times, it can probably be safely ignored.</div>
<div>&nbsp;</div>
<div>Best regards,</div>
<div>Richard Gallamore</div>
</div>
<br />
<div>
<div dir=3D"ltr">On Mon, Jun 20, 2022 at 3:14 PM Larry Rosenman &lt;<a href=
=3D"mailto:ler@lerctr.org" rel=3D"noreferrer">ler@lerctr.org</a>&gt; wrote:=
</div>
<blockquote style=3D"margin: 0px 0px 0px 0.8ex; border-left: 1px solid #ccc=
ccc; padding-left: 1ex;">I've gotten a BUNCH of these on my TrueNAS server.=
&nbsp; I've replaced this <br />DIMM a couple of times, and still the MCE's=
 continue.<br />Is it possible it's Motherboard slot issue?<br /><br />Hard=
ware event. This is not a software error.<br />MCE 8<br />CPU 22 BANK 8 TSC=
 5aa4ecdd795a<br />MISC ac29890200046646 ADDR ee2f6e800<br />TIME 165576247=
2 Mon Jun 20 17:01:12 2022<br />MCG status:<br />Memory read ECC error<br /=
>Memory corrected error count (CORE_ERR_CNT): 1<br />Memory transaction Tra=
cker ID (RTId): 46<br />Memory DIMM ID of error: 0<br />Memory channel ID o=
f error: 1<br />Memory ECC syndrome: ac298902<br />STATUS 8c0000400001009f =
MCGSTATUS 0<br />MCGCAP 1c09 APICID 34 SOCKETID 0<br />CPUID Vendor Intel F=
amily 6 Model 44 Step 2<br />DDR3 DIMM 800 Mhz Other Width 72 Data Width 64=
 Size 4 GB<br />Device Locator: P2-DIMM2C<br />Bank Locator: BANK14<br />Ma=
nufacturer: Hyundai<br />Serial Number: 40F3C20F<br />Asset Tag:<br />Part =
Number: HMT151R7BFR4C-H9<br /><br /><br /><br />-- <br />Larry Rosenman&nbs=
p; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<a =
href=3D"http://www.lerctr.org/~ler" target=3D"_blank" rel=3D"noopener noref=
errer">http://www.lerctr.org/~ler</a><br />Phone: +1 214-642-9640&nbsp; &nb=
sp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;E-Mail: <a href=3D"mail=
to:ler@lerctr.org" rel=3D"noreferrer">ler@lerctr.org</a><br />US Mail: 5708=
 Sabbia Dr, Round Rock, TX 78665-2106<br /><br /></blockquote>
</div>
</div>
</blockquote>
<p><br /></p>
<div id=3D"v1gmail-m_3316908189451833722v1gmail-m_2643139469111892975signat=
ure">
<div style=3D"margin: 0px; padding: 0px; font-family: monospace;"><span>--&=
nbsp;<br />Larry Rosenman &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a h=
ref=3D"http://www.lerctr.org/~ler" target=3D"_blank" rel=3D"noopener norefe=
rrer">http://www.lerctr.org/~ler</a><br />Phone: +1 214-642-9640 &nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;E-Mail: <a href=3D"mailto:ler@lerctr.org" rel=3D"noreferrer">ler=
@lerctr.org</a><br />US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106<br =
/></span></div>
</div>
</div>
</blockquote>
</div>
</div>
</blockquote>
<p><br /></p>
<div id=3D"v1gmail-m_3316908189451833722signature">
<div style=3D"margin: 0px; padding: 0px; font-family: monospace;"><span>--&=
nbsp;<br />Larry Rosenman &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<a h=
ref=3D"http://www.lerctr.org/~ler" target=3D"_blank" rel=3D"noopener norefe=
rrer">http://www.lerctr.org/~ler</a><br />Phone: +1 214-642-9640 &nbsp;&nbs=
p;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;&nbsp;E-Mail: <a href=3D"mailto:ler@lerctr.org" rel=3D"noreferrer">ler=
@lerctr.org</a><br />US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106<br =
/></span></div>
</div>
</div>
</blockquote>
</div>
</div>
</blockquote>
<p><br /></p>
<div id=3D"signature">
<div class=3D"pre" style=3D"margin: 0; padding: 0; font-family: monospace">=
<span class=3D"sig">--&nbsp;<br />Larry Rosenman &nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp=
;&nbsp;&nbsp;&nbsp;<a href=3D"http://www.lerctr.org/~ler" target=3D"_blank"=
 rel=3D"noopener noreferrer">http://www.lerctr.org/~ler</a><br />Phone: +1 =
214-642-9640 &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
bsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;E-Mail: <a href=3D"mailto:ler@lerctr.org"=
>ler@lerctr.org</a><br />US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106=
<br /></span></div>
</div>
</body></html>

--=_2a5f4419d9ada4d3f970c5ed9bd02eac--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?983660c80cc6717e3b49821f7957ee80>