Date: Thu, 6 Jun 2019 10:43:26 -0400 From: Mark Saad <nonesuch@longcount.org> To: FreeBSD Hackers <freebsd-hackers@freebsd.org> Subject: Kernel panic on 12-STABLE-r348203 amd64 E5-2690v4 with Cluster on die mode enabled Message-ID: <CAMXt9NYg5rK%2BjdAJKVwCaWGaE4GZ5W6Np3=0_RZQoz=%2B00uQxw@mail.gmail.com>
next in thread | raw e-mail | index | archive | help
All I posed this yesterday but; I am not sure what happened. Here is the short version. I received two new Dell r630's each with a E5-2690 v4 . The E5-2690 v4 has 14 Cores in two packages on the one chip. I don't remember the exact topology however as a result the BIOS supports the NUMA / Memory mode know as Cluster on Die were each package on the one chip shows up as its own NUMA domain. The issue is this when enabled the box boots 12-RELEASE a-ok. When I rebuilt 12.0-STABLE-r348203 it would panic early in the boot process. Here is a dump of the console =============================== Loading kernel... /boot/kernel/kernel text=0x168d811 data=0x1cf968+0x768c80 syms=[0x8+0x1778e8+0x8 / +0x194f1d] Loading configured modules... /boot/kernel/ipmi.ko size 0x11e10 at 0x2645000 loading required module 'smbus' /boot/kernel/smbus.ko size 0x2ef0 at 0x2657000 /boot/entropy size=0x1000 /boot/kernel/cc_httcp.ko size 0x2330 at 0x265b000 ---<<BOOT>>---c_hmodule 'smbus' Copyright (c) 1992-2019 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 12.0-STABLE r348693 GENERIC amd64 FreeBSD clang version 8.0.0 (tags/RELEASE_800/final 356365) (based on LLVM 8.0.0) panic: UMA zone "UMA Zones": Increase vm.boot_pages cpuid = 0 time = 1 KDB: stack backtrace: #0 0xffffffff80c16df7 at ??+0 #1 0xffffffff80bcaccd at ??+0 #2 0xffffffff80bcab23 at ??+0 #3 0xffffffff80f0b03c at ??+0 #4 0xffffffff80f08d8d at ??+0 #5 0xffffffff80f0bb3d at ??+0 #6 0xffffffff80f0b301 at ??+0 #7 0xffffffff80f0b3d1 at ??+0 #8 0xffffffff80f066c4 at ??+0 #9 0xffffffff80f0543f at ??+0 #10 0xffffffff80f23aef at ??+0 #11 0xffffffff80f1133b at ??+0 #12 0xffffffff80b619c8 at ??+0 #13 0xffffffff8036a02c at ??+0 Uptime: 1s =============================== The only solution was to mess with vm.boot_pages . I got it booted with 128 as the value. Also to be clear if I switched back to Home Snoop, Early Snoop the box is fine. Its only unhappy whit Cluster on Die and 12.0-STABLE . Anyone know whats going on ? -- mark saad | nonesuch@longcount.org
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAMXt9NYg5rK%2BjdAJKVwCaWGaE4GZ5W6Np3=0_RZQoz=%2B00uQxw>