Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 18 Jan 2022 11:25:46 +0300
From:      Dima Panov <fluffy@FreeBSD.org>
To:        Willem Jan Withagen <wjw@digiware.nl>, Tomoaki AOKI <junchoon@dec.sakura.ne.jp>
Cc:        Eugene Grosbein <eugen@grosbein.net>, stable@freebsd.org
Subject:   Re: Trying to boot a supermicro H8DMT board
Message-ID:  <b821a194-a87f-456e-8339-84d45ef9f8da@Canary>
In-Reply-To: <8096cd7e-bc11-5fa7-cc96-6bcdf1278ffc@digiware.nl>
References:  <8ac447b6-eaaf-0a8f-da69-27db15dd6f55@digiware.nl> <2ec39eef-d2e2-c55e-b032-43de86e71a57@digiware.nl> <3d87a0b3-7bed-453b-df23-4a258ea46fbb@grosbein.net> <d8e6c746-3ec1-9c21-d5e7-44dc9600bb0b@digiware.nl> <802cf542-979d-b8e1-3f71-616b026eb852@grosbein.net> <48f57581-1f39-9f57-0e44-19c2c2bb3aeb@digiware.nl> <a0315a54-aefa-a3a3-2ac3-94d6e9410961@grosbein.net> <eac3dcef-9183-8fa5-b0de-b70650235960@digiware.nl> <78a47e83-a339-0c79-0ee0-9e55be80c78b@grosbein.net> <d0dbce19-f5e5-3ff0-99d6-55a9c94a4b48@digiware.nl> <2f49fd20-cb5a-5ccc-7f9b-0229bc8e14b1@grosbein.net> <86766549-be58-1125-867e-ae4c415e1bb4@digiware.nl> <7903a41f-94ba-2caf-9270-a1bd9582c600@grosbein.net> <229c3042-3297-7903-9778-9b55d5c3f998@digiware.nl> <71d1e25c-f1f6-2371-486e-2382d67a3fc5@grosbein.net> <c6588210-ac68-f081-f0a4-85669aa84eb3@digiware.nl> <9d73e9ba-af23-ea90-e5fa-cf3a04a8513b@grosbein.net> <7c5d9cc0-be85-c855-a294-71a93f2c5440@digiware.nl> <20220118021432.197aa1241d53b1cba6e8c562@dec.sakura.ne.jp> <8096cd7e-bc11-5fa7-cc96-6bcdf1278ffc@digiware.nl>

next in thread | previous in thread | raw e-mail | index | archive | help
--61e6798a_327b23c6_93a
Content-Type: multipart/alternative; boundary="61e6798a_6b8b4567_93a"

--61e6798a_6b8b4567_93a
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Moin=21

As mobo manual says, u could use cpu1/1b, cpu1/1a, cpu2/1b, cpu2/1a due t=
o hard dependency on pair interleaving support.
Slots fills from far to close order from each cpu by pairs only

--
Dima. (desktop, kde, x11, office, ports-secteam)=40=46reeBSD team
(fluffy=40=46reeBSD.org, https://t.me/dima=5Fpanov)

> On Monday, Jan 17, 2022 at 9:47 PM, Willem Jan Withagen <wjw=40digiware=
.nl (mailto:wjw=40digiware.nl)> wrote:
> On 17-1-2022 18:14, Tomoaki AOKI wrote:
> > On Mon, 17 Jan 2022 15:04:16 +0100
> > Willem Jan Withagen <wjw=40digiware.nl> wrote:
> >
> > > On 17-1-2022 14:46, Eugene Grosbein wrote:
> > > > 17.01.2022 20:24, Willem Jan Withagen wrote:
> > > >
> > > > > > Well, perform independent hardware (memory) testing with some=
thing like memtest86+
> > > > > > and if it is all right, you show ask someone more knowledgeab=
le. Maybe CC: arch=40freebsd.org
> > > > > Perhaps should have done that when I started, but supplier assu=
red me that
> > > > > the they just retired the boards with out any issues.
> > > > > Memtest86 found the faulty DIMM in 30 secs...
> > > > >
> > > > > Not sure if we could/want educate vm=5Fmem=5Finit() to actually=
 detect this.
> > > > > It is still in the part where everthing is still running on the=
 first CPU.
> > > > > Making things a bit easier to understand what is going on.
> > > > >
> > > > > Lets see if the box will run on 3 DIMMs for the rime being.
> > > > > Then figure out with DMIdecode what we need expand again.
> > > > Is it ECC memory or non-ECC=3F
> > > > The kernel already have full memory testing performed at boot tim=
e
> > > > unless disabled with another loader knob:
> > > >
> > > > hw.memtest.tests=3D0
> > > >
> > > > Try booting it with memory testing disabled and without hw.physme=
m limitation.
> > > > Maybe it will boot.
> > > >
> > > > With ECC, it could be hardware interrupt while kernel runs that t=
est
> > > > and wrong in-kernel processing of the interrupt.
> > > Swapped the DIMM with 3 others, but still the same errors.
> > > Then I changed DIMM slot, and the errors went away.
> > > So definitely a hardware issue
> > >
> > > when booted =46reeBSD reported already only 12Gb in system ( there =
are 4
> > > 4GB dimms)
> > > Using 8Gb. DIMMs are ECC.
> > > But then still it would only boot when mem set to 8G.
> > >
> > > Waiting for memtest to finish at least one pass.
> > > Usually that will take quite some time.
> > >
> > > --WjW
> > >
> > >
> > Not sure this is the case, but some motherboards have severe limitati=
on
> > about DIMM slot usage, if not fully used.
> >
> > =46or example, assuming slot No. are B0-0, 1, 2, 3 and B1-0, 1, 2, 3,=

> >
> > *Must use =22interleaved. If 4 in 8 slots are to be used,
> > B0-0, B0-2, B1-0, B1-2 shall be used.
> > (Some forced B0-1, B0-3, B1-1, B1-3, IIRC)
> >
> > *Must NOT use =22interleaved.
> > B0-0, B0-1, B1-0, B1-1 shall be used.
> >
> > *Must NOT use B1 unless B0 is full of DIMs.
> > B0-0. B0-1, B0-2, B0-3 shall be used.
> >
> > and so on, depending on motherboard vendor (at worst, per model.)
>
> Yup, I know... I used the board in the configuration I got it.
> And its a DUAL processor board with 2 opterons.
> The config works correct for the first Opteron (Called CPU1)
> using slots: CPU1/DIMM1A and CPU1/DIMM1B
> But on the second CPU I have to use the third slot....
> so using slots: CPU2/DIMM1B and CPU2/DIMM2B
>
> And my memtest86 has complete 1 full pass over 16G without errors.
> So I'm guessing that the order is not majorly picky.
>
> But you are correct in noting this, so I will read up ont this in the
> manual.
>
> Thanx,
> --WjW
>
>

--61e6798a_6b8b4567_93a
Content-Type: text/html; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

<html xmlns=3D=22http://www.w3.org/1999/xhtml=22><head>; <title></title> <=
meta name=3D=22viewport=22 content=3D=22width=3Ddevice-width, initial-sca=
le=3D1.0, user-scalable=3Dno=22> </head> <body><img id=3D=22A8=4630934D1=46=
604=4637BC577051=46E521=46=46=22 alt=3D=22=22 width=3D=220px=22 src=3D=22=
https://read-receipts.canarymail.io:8100/track/2BD3A57834=462C18772CBE819=
A75674B4=5FA8=4630934D1=46604=4637BC577051=46E521=46=46.png=22 height=3D=22=
0px=22><div id=3D=22CanaryBody=22> <div> Moin=21</div><div><br></div><div=
>As mobo manual says, u could use cpu1/1b, cpu1/1a, cpu2/1b, cpu2/1a due =
to hard dependency on pair interleaving support.=C2=A0</div><div>Slots fi=
lls from far to close order from each cpu by pairs only</div> <div><br></=
div> </div> <div id=3D=22CanarySig=22> <div> <div style=3D=22font-family:=
Helvetica;=22>--<br><div>Dima. (desktop, kde, x11, office, ports-secteam)=
=40=46reeBSD team</div><div>(fluffy=40=46reeBSD.org, https://t.me/dima=5F=
panov)</div></div> <div><br></div> </div> </div> <div id=3D=22CanaryDropb=
ox=22> </div> <blockquote id=3D=22CanaryBlockquote=22> <div> <div>On Mond=
ay, Jan 17, 2022 at 9:47 PM, Willem Jan Withagen &lt;<a href=3D=22mailto:=
wjw=40digiware.nl=22>wjw=40digiware.nl</a>&gt; wrote:<br></div> <div>On 1=
7-1-2022 18:14, Tomoaki AOKI wrote: <br><blockquote type=3D=22cite=22>On =
Mon, 17 Jan 2022 15:04:16 +0100 <br>Willem Jan Withagen &lt;wjw=40digiwar=
e.nl&gt; wrote: <br> <br><blockquote type=3D=22cite=22>On 17-1-2022 14:46=
, Eugene Grosbein wrote: <br><blockquote type=3D=22cite=22>17.01.2022 20:=
24, Willem Jan Withagen wrote: <br> <br><blockquote type=3D=22cite=22><bl=
ockquote type=3D=22cite=22>Well, perform independent hardware (memory) te=
sting with something like memtest86+ <br>and if it is all right, you show=
 ask someone more knowledgeable. Maybe CC: arch=40freebsd.org <br></block=
quote>Perhaps should have done that when I started, but supplier assured =
me that <br>the they just retired the boards with out any issues. <br>Mem=
test86 found the faulty DIMM in 30 secs... <br> <br>Not sure if we could/=
want educate vm=5Fmem=5Finit() to actually detect this. <br>It is still i=
n the part where everthing is still running on the first CPU. <br>Making =
things a bit easier to understand what is going on. <br> <br>Lets see if =
the box will run on 3 DIMMs for the rime being. <br>Then figure out with =
DMIdecode what we need expand again. <br></blockquote>Is it ECC memory or=
 non-ECC=3F <br>The kernel already have full memory testing performed at =
boot time <br>unless disabled with another loader knob: <br> <br>hw.memte=
st.tests=3D0 <br> <br>Try booting it with memory testing disabled and wit=
hout hw.physmem limitation. <br>Maybe it will boot. <br> <br>With ECC, it=
 could be hardware interrupt while kernel runs that test <br>and wrong in=
-kernel processing of the interrupt. <br></blockquote>Swapped the DIMM wi=
th 3 others, but still the same errors. <br>Then I changed DIMM slot, and=
 the errors went away. <br>So definitely a hardware issue <br> <br>when b=
ooted =46reeBSD reported already only 12Gb in system ( there are 4 <br>4G=
B dimms) <br>Using 8Gb. DIMMs are ECC. <br>But then still it would only b=
oot when mem set to 8G. <br> <br>Waiting for memtest to finish at least o=
ne pass. <br>Usually that will take quite some time. <br> <br>--WjW <br> =
<br> <br></blockquote>Not sure this is the case, but some motherboards ha=
ve severe limitation <br>about DIMM slot usage, if not fully used. <br> <=
br>=46or example, assuming slot No. are B0-0, 1, 2, 3 and B1-0, 1, 2, 3, =
<br> <br>*Must use =22interleaved. If 4 in 8 slots are to be used, <br>B0=
-0, B0-2, B1-0, B1-2 shall be used. <br>(Some forced B0-1, B0-3, B1-1, B1=
-3, IIRC) <br> <br>*Must NOT use =22interleaved. <br>B0-0, B0-1, B1-0, B1=
-1 shall be used. <br> <br>*Must NOT use B1 unless B0 is full of DIMs. <b=
r>B0-0. B0-1, B0-2, B0-3 shall be used. <br> <br>and so on, depending on =
motherboard vendor (at worst, per model.) <br></blockquote> <br>Yup, I kn=
ow... I used the board in the configuration I got it. <br>And its a DUAL =
processor board with 2 opterons. <br>The config works correct for the fir=
st Opteron (Called CPU1) <br>using slots: CPU1/DIMM1A and CPU1/DIMM1B <br=
>But on the second CPU I have to use the third slot.... <br>so using slot=
s: CPU2/DIMM1B and CPU2/DIMM2B <br> <br>And my memtest86 has complete 1 f=
ull pass over 16G without errors. <br>So I'm guessing that the order is n=
ot majorly picky. <br> <br>But you are correct in noting this, so I will =
read up ont this in the <br>manual. <br> <br>Thanx, <br>--WjW <br> <br> <=
br></div> </div> </blockquote> </body></html>
--61e6798a_6b8b4567_93a--

--61e6798a_327b23c6_93a
Content-Type: application/pgp-signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: Canary PGP V3

iQJVBAABCgA/OBxEaW1hIFBhbm92IChGcmVlQlNELk9SRyBDb21taXR0ZXIpIDxm
bHVmZnlARnJlZUJTRC5PUkc+BQJh5nmKAAoJEPuLoJ3VOY8p4p0QANZ8ZB2cnK2J
YC5yOE2A/T6Mw3L+efPTzg83KYTDxeW4+MpDUzJv4ubmQDUF2R8qs4GPZXQmdtPW
dvQ9Gjr909LQV2DiJo6jn9LXx+9yLuwDFZwp3P10p/hATakwQiwZxp2qrqQvVgBb
7oXgWe7Y379dnNc1LGg/GS4rbNQ3WCKLM6BMyAyKYXqWFvD5PDG5w4vIR9g7bszh
L0q2oR7y3F2w4Icn7We7OmG0saZl7YQZ8GFdIiiKLUhc7R+kxJvRZErPUwECHei7
IDU5TXijPQg9Bc5JbJXcrFSxBL9XwGHKy/m9xw/OGFe0Gj4Qo9zsT0W46VT1smJS
rF04EI5gowgB9XYI9YAaeGFEq7I8ywRBHQMABXKfJIq3ltyTY7m5JevDnSHKdPUG
ccEdAQnHcM59z/J1fvwoebq5tKvMDre613fTIv4x3aFNi8O+sfpjIyME1EfcQ8p3
DEVdm7L94YBActUgQ6do3gs5MqV2rIhcnWrfXtsGhZWIx/53SIL4q7kRqsfJHdso
9qEu//nRT/sA6Cd1dj7xCJKtw53aB5UpZ86xhpgYUd2SkB/scV12A90UVY3DekBp
Kj1WSllgXkVs3FVu3S5CQ4Qqb1XQS9sSvWLEdqLrI0PhYV8Iby25Ze2wTDcp8NAt
yCjqHabd8AiYDyHs82KuAFdae91aIgvk
=73Jv
-----END PGP SIGNATURE-----

--61e6798a_327b23c6_93a--




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?b821a194-a87f-456e-8339-84d45ef9f8da>