From owner-freebsd-hackers@freebsd.org Thu Jun 6 14:50:23 2019 Return-Path: Delivered-To: freebsd-hackers@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 7CFA215B5B6F for ; Thu, 6 Jun 2019 14:50:23 +0000 (UTC) (envelope-from markjdb@gmail.com) Received: from mail-it1-x143.google.com (mail-it1-x143.google.com [IPv6:2607:f8b0:4864:20::143]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) server-signature RSA-PSS (4096 bits) client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "smtp.gmail.com", Issuer "GTS CA 1O1" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 556EF6FE8A for ; Thu, 6 Jun 2019 14:50:22 +0000 (UTC) (envelope-from markjdb@gmail.com) Received: by mail-it1-x143.google.com with SMTP id a186so396323itg.0 for ; Thu, 06 Jun 2019 07:50:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=fb97PORDr7CiCUeiOIF95ZABhJvWNcrMwP/8/7ODT98=; b=C6xDRhT43mgGi/Wa9cscdAcJZFIu5CfQyFj7cbLRziYtj7Mvw5ZzbbBgeSn+QZmtTt EIWBh8rsq29Mi6QsyWGuWf6pI7n+Y5U+NUKtaaDXIzr/Fgt78tKdm+5eYHe+naIy8LkC aOR6r2JQTlweLZFMhqZJ8+0TWP3VysOTTrR8cS5PAtrhlc5G20go/MMD5GZ2DVC3Ay2y U/V5UaN/ZqLivdKIsVjWv1sKScB9b8O/oOUlBaznZWRr6VlcpKz83FOqbiVBVhqE/i5v TezW6h+La3NqW2PzdFmLSfomEgAdxm82flLcnYZAyBB8vzUoRnipS4dxvBp2SBflsONa N1yw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=fb97PORDr7CiCUeiOIF95ZABhJvWNcrMwP/8/7ODT98=; b=Iy9JfdyUwvKhrD5EQUcjUX7HXA4cawlp6OBMTz2VLjoGbLzUMBQ9VZ6GgqOxkdRIP3 BhV3Edgg3DgaX2GaF6W9V0aVSJAcveomGa7x74bj2z5AP4fVSHa2g3iiQmMjXcfWG2T+ e1XxlfBXHzkHcLnA6fKvCqjnkLvfIeyg5NQdDTDT9xdrCpjyVnuDe2MkKmF9BrjxT6iu fSjMcThm7+Epnoyphn5EPOUGTMmqnWGK8zxbtnprfFRRhLeW4WCH1U+JHLRzh+UUXfLq uigm1KM7XuLroEz1xT15E6ZmjVYj3D7a2iy6TBZaWcQJqRO1eWRqkQtLT/TdOlPQHcE8 T1iQ== X-Gm-Message-State: APjAAAUa30irQ/3CnPwJndl+d8HOQ0xji84DOgPCmP5CFTIIHhREBmgq TwwtHhqvh3zuUCmcqGxc7ZZpMcZm X-Google-Smtp-Source: APXvYqw+809++sAort53WZd5r0hGGRLaN9UDX/qLILwmYteWcR2MtOv3QXyaivMi4Dr/Su1ILRKzrg== X-Received: by 2002:a24:ac60:: with SMTP id m32mr372295iti.40.1559832621647; Thu, 06 Jun 2019 07:50:21 -0700 (PDT) Received: from raichu (toroon0560w-lp130-12-70-50-22-99.dsl.bell.ca. [70.50.22.99]) by smtp.gmail.com with ESMTPSA id 138sm1030047itu.26.2019.06.06.07.50.20 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Thu, 06 Jun 2019 07:50:20 -0700 (PDT) Sender: Mark Johnston Date: Thu, 6 Jun 2019 10:50:16 -0400 From: Mark Johnston To: Mark Saad Cc: FreeBSD Hackers Subject: Re: Kernel panic on 12-STABLE-r348203 amd64 E5-2690v4 with Cluster on die mode enabled Message-ID: <20190606145016.GA4116@raichu> References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.12.0 (2019-05-25) X-Rspamd-Queue-Id: 556EF6FE8A X-Spamd-Bar: --- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=gmail.com header.s=20161025 header.b=C6xDRhT4; spf=pass (mx1.freebsd.org: domain of markjdb@gmail.com designates 2607:f8b0:4864:20::143 as permitted sender) smtp.mailfrom=markjdb@gmail.com X-Spamd-Result: default: False [-3.36 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; R_SPF_ALLOW(-0.20)[+ip6:2607:f8b0:4000::/36]; RCVD_COUNT_THREE(0.00)[3]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[gmail.com:+]; RCPT_COUNT_TWO(0.00)[2]; MX_GOOD(-0.01)[cached: alt3.gmail-smtp-in.l.google.com]; FORGED_SENDER(0.30)[markj@freebsd.org,markjdb@gmail.com]; MIME_TRACE(0.00)[0:+]; RCVD_TLS_LAST(0.00)[]; FREEMAIL_ENVFROM(0.00)[gmail.com]; ASN(0.00)[asn:15169, ipnet:2607:f8b0::/32, country:US]; FROM_NEQ_ENVFROM(0.00)[markj@freebsd.org,markjdb@gmail.com]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000,0]; R_DKIM_ALLOW(-0.20)[gmail.com:s=20161025]; FROM_HAS_DN(0.00)[]; NEURAL_HAM_SHORT(-0.95)[-0.954,0]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; MIME_GOOD(-0.10)[text/plain]; PREVIOUSLY_DELIVERED(0.00)[freebsd-hackers@freebsd.org]; DMARC_NA(0.00)[freebsd.org]; TO_MATCH_ENVRCPT_SOME(0.00)[]; RCVD_IN_DNSWL_NONE(0.00)[3.4.1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.2.0.0.4.6.8.4.0.b.8.f.7.0.6.2.list.dnswl.org : 127.0.5.0]; IP_SCORE(-0.69)[ip: (2.11), ipnet: 2607:f8b0::/32(-3.22), asn: 15169(-2.30), country: US(-0.06)]; MID_RHS_NOT_FQDN(0.50)[] X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 06 Jun 2019 14:50:23 -0000 On Thu, Jun 06, 2019 at 10:43:26AM -0400, Mark Saad wrote: > All > I posed this yesterday but; I am not sure what happened. Here is the > short version. > I received two new Dell r630's each with a E5-2690 v4 . The E5-2690 v4 > has 14 Cores in two packages on the one chip. I don't remember the > exact topology however as a result the BIOS supports the NUMA / Memory > mode know as Cluster on Die were each package on the one chip shows up > as its own NUMA domain. The issue is this when enabled the box boots > 12-RELEASE a-ok. When I rebuilt 12.0-STABLE-r348203 it would panic > early in the boot process. > Here is a dump of the console > =============================== > Loading kernel... > /boot/kernel/kernel text=0x168d811 data=0x1cf968+0x768c80 > syms=[0x8+0x1778e8+0x8 / > +0x194f1d] > Loading configured modules... > /boot/kernel/ipmi.ko size 0x11e10 at 0x2645000 > loading required module 'smbus' > /boot/kernel/smbus.ko size 0x2ef0 at 0x2657000 > /boot/entropy size=0x1000 > /boot/kernel/cc_httcp.ko size 0x2330 at 0x265b000 > ---<>---c_hmodule 'smbus' > Copyright (c) 1992-2019 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 12.0-STABLE r348693 GENERIC amd64 > FreeBSD clang version 8.0.0 (tags/RELEASE_800/final 356365) (based on > LLVM 8.0.0) > panic: UMA zone "UMA Zones": Increase vm.boot_pages > cpuid = 0 > time = 1 > KDB: stack backtrace: > #0 0xffffffff80c16df7 at ??+0 > #1 0xffffffff80bcaccd at ??+0 > #2 0xffffffff80bcab23 at ??+0 > #3 0xffffffff80f0b03c at ??+0 > #4 0xffffffff80f08d8d at ??+0 > #5 0xffffffff80f0bb3d at ??+0 > #6 0xffffffff80f0b301 at ??+0 > #7 0xffffffff80f0b3d1 at ??+0 > #8 0xffffffff80f066c4 at ??+0 > #9 0xffffffff80f0543f at ??+0 > #10 0xffffffff80f23aef at ??+0 > #11 0xffffffff80f1133b at ??+0 > #12 0xffffffff80b619c8 at ??+0 > #13 0xffffffff8036a02c at ??+0 > Uptime: 1s > > =============================== > > The only solution was to mess with vm.boot_pages . I got it booted > with 128 as the value. > Also to be clear if I switched back to Home Snoop, Early Snoop the box > is fine. Its only > unhappy whit Cluster on Die and 12.0-STABLE . > > Anyone know whats going on ? Could you build a kernel with "options DIAGNOSTIC" configured and boot in verbose mode? The kernel should print its boot page allocations to the console. Then, compare the output with a boot with cluster on die disabled.