From owner-freebsd-stable@freebsd.org Thu Nov 29 00:19:12 2018 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 196F21141C9E for ; Thu, 29 Nov 2018 00:19:12 +0000 (UTC) (envelope-from markjdb@gmail.com) Received: from mail-it1-x12d.google.com (mail-it1-x12d.google.com [IPv6:2607:f8b0:4864:20::12d]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4150E8DCC5 for ; Thu, 29 Nov 2018 00:19:11 +0000 (UTC) (envelope-from markjdb@gmail.com) Received: by mail-it1-x12d.google.com with SMTP id c9so908516itj.1 for ; Wed, 28 Nov 2018 16:19:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=UTjgGjuttcJACQG8JhFVnUo2n2HarEMXqiviKATUx5s=; b=X56TSIBfhd6BAhcidDLiLOpLWuubdrnbGzsdHZlvNt2mQOFu0zuGT7t80JzsiXg84G VXAK1PeXY2nS8XuKKMZI3lpJsrzaU2Y2Zl59YiVU2dgkiZZj3ewctHwP95eYhbaRY3LH YJMRTmvJVUneAXgcmhPD9dCDwOwT+OQrp/iP65CeQ3U74vpGfUpMkCehadQ87sxnZg5Z d0rIXPSUbBM1q/NfCN21l8UgphVau3LnGXj1+ijri7cESHzYeR8nNxMGNyBHt1ibLOy3 39nWHOr2pNBvqKmlARncyu+66m4YeZU7f4ls4ITOCDwnRiJnvBUIdX/crOGNXArHPQZv 20/w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=UTjgGjuttcJACQG8JhFVnUo2n2HarEMXqiviKATUx5s=; b=rquym0tiHFA1DT4D17hMQzNzbAwCNPMscFrpdf+1ac6cAeCI+CDJ/oSyaUDLffagrD dMCPEXb+V0FjpIyB+rcFJn18Kze/unaFqEld0uB3/wL+IQVenIoJGqzaU5bF9yOJp+We W/TlE+mLSwIYNkSMIvSsHx+OdlUzLRIOUCdijJcT046QKcceFXJaNrefgEF53sKHduqk JxguOUdrZPGpgYBV8jabbV8h9TBus5P/mZwNmIN8b954NJ1449d38U2O5eI8ng35zoxG 96bf0uhF27nD4RtniW3F4XgmAIaycuFBVQBE30yjBhaKQk1E6dX6yIjcWo+AhywhSUxA Qw2A== X-Gm-Message-State: AA+aEWZIY/xuCpTqD8Qc0YLhNzGaumUSWyu/awVfk8jjW3E2fN1ujRbr uPaxEJnohCtzXIYmcEjn//M= X-Google-Smtp-Source: AFSGD/XT8J0mQsa1xBJazCe7IoR+97PgHd/cfDnUO274zRnnhWD7CFwVahdCgaX3QBAwegaWByvyoQ== X-Received: by 2002:a24:e20d:: with SMTP id q13-v6mr5446517ith.157.1543450750432; Wed, 28 Nov 2018 16:19:10 -0800 (PST) Received: from raichu (toroon0560w-lp130-07-64-229-95-98.dsl.bell.ca. [64.229.95.98]) by smtp.gmail.com with ESMTPSA id y5sm356193itb.42.2018.11.28.16.19.09 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Wed, 28 Nov 2018 16:19:09 -0800 (PST) Sender: Mark Johnston Date: Wed, 28 Nov 2018 19:19:04 -0500 From: Mark Johnston To: Garrett Wollman Cc: Konstantin Belousov , freebsd-stable@freebsd.org Subject: Re: Trap 12 in vm_page_alloc_after() Message-ID: <20181129001904.GA63393@raichu> References: <23538.4310.710700.401331@hergotha.csail.mit.edu> <20181119050944.GW2378@kib.kiev.ua> <23547.30738.149260.454185@hergotha.csail.mit.edu> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <23547.30738.149260.454185@hergotha.csail.mit.edu> User-Agent: Mutt/1.10.1 (2018-07-13) X-Rspamd-Queue-Id: 4150E8DCC5 X-Spamd-Result: default: False [-4.75 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; TO_DN_SOME(0.00)[]; R_SPF_ALLOW(-0.20)[+ip6:2607:f8b0:4000::/36]; RCVD_COUNT_THREE(0.00)[3]; DKIM_TRACE(0.00)[gmail.com:+]; MX_GOOD(-0.01)[cached: alt3.gmail-smtp-in.l.google.com]; NEURAL_HAM_SHORT(-0.99)[-0.994,0]; FORGED_SENDER(0.30)[markj@freebsd.org,markjdb@gmail.com]; RCVD_TLS_LAST(0.00)[]; FREEMAIL_ENVFROM(0.00)[gmail.com]; ASN(0.00)[asn:15169, ipnet:2607:f8b0::/32, country:US]; FROM_NEQ_ENVFROM(0.00)[markj@freebsd.org,markjdb@gmail.com]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000,0]; R_DKIM_ALLOW(-0.20)[gmail.com]; FROM_HAS_DN(0.00)[]; RCPT_COUNT_THREE(0.00)[3]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; MIME_GOOD(-0.10)[text/plain]; PREVIOUSLY_DELIVERED(0.00)[freebsd-stable@freebsd.org]; DMARC_NA(0.00)[freebsd.org]; TO_MATCH_ENVRCPT_SOME(0.00)[]; RCVD_IN_DNSWL_NONE(0.00)[d.2.1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.2.0.0.4.6.8.4.0.b.8.f.7.0.6.2.list.dnswl.org : 127.0.5.0]; IP_SCORE(-2.05)[ip: (-6.99), ipnet: 2607:f8b0::/32(-1.80), asn: 15169(-1.36), country: US(-0.09)]; MID_RHS_NOT_FQDN(0.50)[]; FREEMAIL_CC(0.00)[gmail.com] X-Rspamd-Server: mx1.freebsd.org X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 29 Nov 2018 00:19:12 -0000 On Sun, Nov 25, 2018 at 11:35:30PM -0500, Garrett Wollman wrote: > < said: > > > On Sun, Nov 18, 2018 at 08:24:38PM -0500, Garrett Wollman wrote: > >> Has anyone seen this before? It's on a busy NFS server, but hasn't > >> been observed on any of our other NFS servers. > >> > >> ------------------------------------------------------------------------ > >> Fatal trap 12: page fault while in kernel mode > > >> --- trap 0xc, rip = 0xffffffff809a903d, rsp = 0xfffffe17eb8d0710, rbp = 0xfffffe17eb8d0750 --- > >> vm_page_alloc_after() at vm_page_alloc_after+0x15d/frame 0xfffffe17eb8d0750 > > > What is the line number for vm_page_alloc_after+0x15d ? > > Do you have NUMA enabled on 11 ? > > If gdb is to be believed, the trap is at line 1687: > > /* > * At this point we had better have found a good page. > */ > KASSERT(m != NULL, ("missing page")); > free_count = vm_phys_freecnt_adj(m, -1); > >>>>>> if ((m->flags & PG_ZERO) != 0) > vm_page_zero_count--; > mtx_unlock(&vm_page_queue_free_mtx); > vm_page_alloc_check(m); > > The faulting instruction is: > > 0xffffffff809a903d : testb $0x8,0x5a(%r14) > > There are no options matching /numa/i in the configuration. (This is > a non-debugging configuration so the KASSERT is inoperative, I > assume.) I have about a dozen other servers with the same kernel and > they're not crashing, but obviously they all have different loads and > sets of active clients. If you're using a Skylake, I suspect that you can set the hw.skz63_enable tunable to 0 as a workaround, assuming you're not using any code that relies on Intel TSX. (I don't think there's anything in the base system that does.) There are some details in https://reviews.freebsd.org/D18374