Date: Wed, 21 Dec 2016 13:58:46 +0200 From: Konstantin Belousov <kostikbel@gmail.com> To: Steve Wills <swills@FreeBSD.org> Cc: John Baldwin <jhb@freebsd.org>, freebsd-current@freebsd.org, Adrian Chadd <adrian.chadd@gmail.com>, Slawa Olhovchenkov <slw@zxy.spb.ru> Subject: Re: Enabling NUMA in BIOS stop booting FreeBSD Message-ID: <20161221115846.GJ94325@kib.kiev.ua> In-Reply-To: <d3a7a24f-c2bb-914d-e62b-a99c247efd9f@FreeBSD.org> References: <20161214102711.GF94325@kib.kiev.ua> <20161215224500.GM98176@zxy.spb.ru> <CAJ-Vmon8NX9c0C6rK896UoSDdT=L0MVLuNkciuwzrgv9dEYjuA@mail.gmail.com> <1995191.rPAPaoHxgb@ralph.baldwin.cx> <d3a7a24f-c2bb-914d-e62b-a99c247efd9f@FreeBSD.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Dec 20, 2016 at 04:49:29PM -0500, Steve Wills wrote: > Hi, > > On 12/16/2016 16:20, John Baldwin wrote: > > On Thursday, December 15, 2016 03:57:58 PM Adrian Chadd wrote: > >> heh, an updated BIOS that solves the problem will solve the problem. :) > >> > >> I think you have enough information to provide to supermicro. Ie, > >> "SMAP says X, when physical memory pages at addresses X are accessed, > >> they don't behave like memory, maybe something is wrong". > >> > >> All I can think of is some hack to add a blacklist for that region so > >> you can boot the unit. But it makes me wonder what else is going on. > > > > We have the blacklist: it is the memory test. That is the way to workaround > > this type of BIOS breakage. This is just the first time in over a decade that > > test has been relevant. > > I've got a SuperMicro X10SRA board that I bought back in March, I think. > It was run CURRENT fine since then, until last month, when it started > hanging during boot. I was about to update it to a new version of > CURRENT when it started hanging at boot, but hadn't updated yet. The > hang is after (verbose boot): What is the exact version of the kernel you are running and which hangs ? Try to bisect. Do you have EARLY_AP_STARTUP option in the kernel config ? > > ACPI APIC Table: <SUPERM SMCI--MB> > Package ID shift: 4 > L3 cache ID shift: 4 > L2 cache ID shift: 1 > L1 cache ID shift: 1 > Core ID shift: 1 Send NMI with 'ipmi power diag' and show the machine state from ddb. > > Recently I've tried booting 9.3 and 10.3 on it without success. Other > operating systems boot fine. Thinking the hang was similar to the one in > this thread (or at least the board is), I tried many different BIOS > changes and also tried enabling the memory test, but none of that > changes anything. This is a single socket board so there are no NUMA or > memory interleaving options in the BIOS. The BIOS is up to date (2.0a). > It will boot if SMP is disabled. That's obviously sub-optimal, but is > useful for building updated kernels, which I've tried. If anyone has any > suggestions or ideas, I'd appreciate it. > > Thanks, > Steve >
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20161221115846.GJ94325>