From owner-freebsd-stable@freebsd.org Thu Feb 4 21:34:31 2021 Return-Path: Delivered-To: freebsd-stable@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id B5E7452FBF8 for ; Thu, 4 Feb 2021 21:34:31 +0000 (UTC) (envelope-from mmacy@freebsd.org) Received: from mailman.nyi.freebsd.org (unknown [127.0.1.3]) by mx1.freebsd.org (Postfix) with ESMTP id 4DWsJR4V3Tz3Ccm for ; Thu, 4 Feb 2021 21:34:31 +0000 (UTC) (envelope-from mmacy@freebsd.org) Received: by mailman.nyi.freebsd.org (Postfix) id 5A6DD52FC63; Thu, 4 Feb 2021 21:34:30 +0000 (UTC) Delivered-To: stable@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 53AA952FC62 for ; Thu, 4 Feb 2021 21:34:30 +0000 (UTC) (envelope-from mmacy@freebsd.org) Received: from smtp.freebsd.org (smtp.freebsd.org [IPv6:2610:1c1:1:606c::24b:4]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "smtp.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4DWsJP4PWgz3ClG; Thu, 4 Feb 2021 21:34:29 +0000 (UTC) (envelope-from mmacy@freebsd.org) Received: from mail-lj1-f177.google.com (mail-lj1-f177.google.com [209.85.208.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "GTS CA 1O1" (verified OK)) (Authenticated sender: mmacy) by smtp.freebsd.org (Postfix) with ESMTPSA id BE271BD9B; Thu, 4 Feb 2021 21:34:26 +0000 (UTC) (envelope-from mmacy@freebsd.org) Received: by mail-lj1-f177.google.com with SMTP id e18so5185000lja.12; Thu, 04 Feb 2021 13:34:26 -0800 (PST) X-Gm-Message-State: AOAM530ZQ7dPVKn+GAWvYHXFy5EDy9b5w4gv0r5CijPeBZLhdLDl4tqu uGSRfTYsDWhFD0gr1y31zzG6ZlQ7yp5SIUxgRY4= X-Google-Smtp-Source: ABdhPJy5iIfsdmRD1ZjZE/HiA0TPsQPZvfda7uaEPchih9r2rWgYBCCq9D6F1xfsVrdheX46+qVpm3lZbYdl1Ae18KE= X-Received: by 2002:a2e:9f4e:: with SMTP id v14mr743246ljk.497.1612474464684; Thu, 04 Feb 2021 13:34:24 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Matthew Macy Date: Thu, 4 Feb 2021 13:34:13 -0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Page fault in _mca_init during startup To: Alan Somers Cc: FreeBSD Stable ML Content-Type: text/plain; charset="UTF-8" X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.34 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 04 Feb 2021 21:34:31 -0000 On Thu, Feb 4, 2021 at 1:31 PM Alan Somers wrote: > > After upgrading a machine to FreeBSD, 12.2, it hit the following panic on > its first reboot. I suspect that a few other servers have hit this too, > but since it happens before swap is mounted there are no core dumps, and > they usually reboot immediately. The code in question hasn't changed since > 2018. The panic happened in cmci_monitor at line 930. Does anybody have > any suggestions for how I could debug further? I can't readily reproduce > it, and I can't dump core, but I'd like to investigate it any way I can. > The server in question has dual Xeon Gold 6142 CPUs. > I can't actually help :( but I can add a +1 with similar hardware or equivalent specs. It's not frequent, but it's often enough to be annoying. -M > if (!(ctl & MC_CTL2_CMCI_EN)) > /* This bank does not support CMCI. */ > return; > > cc = &cmc_state[PCPU_GET(cpuid)][i]; // <- panic here > > /* Determine maximum threshold. */ > > > Fatal trap 12: page fault while in kernel mode > cpuid = 26; apic id = 34 > fault virtual address = 0xd0 > fault code = supervisor read data, page not present > instruction pointer = 0x20:0xffffffff8125a009 > stack pointer = 0x28:0xfffffe0000b65f20 > frame pointer = 0x28:0xfffffe0000b65f50 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = resume, IOPL = 0 > current process = 11 (idle: cpu26) > trap number = 12 > panic: page fault > cpuid = 26 > time = 1 > KDB: stack backtrace: > db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame > 0xfffffe0000b65be0 > vpanic() at vpanic+0x17b/frame 0xfffffe0000b65c30 > panic() at panic+0x43/frame 0xfffffe0000b65c90 > trap_fatal() at trap_fatal+0x391/frame 0xfffffe0000b65cf0 > trap_pfault() at trap_pfault+0x4f/frame 0xfffffe0000b65d40 > trap() at trap+0x286/frame 0xfffffe0000b65e50 > calltrap() at calltrap+0x8/frame 0xfffffe0000b65e50 > --- trap 0xc, rip = 0xffffffff8125a009, rsp = 0xfffffe0000b65f20, rbp = > 0xfffffe0000b65f50 --- > _mca_init() at _mca_init+0x5d9/frame 0xfffffe0000b65f50 > init_secondary_tail() at init_secondary_tail+0xfd/frame 0xfffffe0000b65f80 > init_secondary() at init_secondary+0x2d1/frame 0xfffffe0000b65ff0 > KDB: enter: panic > [ thread pid 11 tid 100029 ] > Stopped at kdb_enter+0x37: movq $0,0x12bc1f6(%rip) > _______________________________________________ > freebsd-stable@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"