From owner-freebsd-stable@freebsd.org Sat Feb 3 09:37:48 2018 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 305A0ED2045 for ; Sat, 3 Feb 2018 09:37:48 +0000 (UTC) (envelope-from hps@selasky.org) Received: from mail.turbocat.net (turbocat.net [88.99.82.50]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id B675C7E2D8; Sat, 3 Feb 2018 09:37:47 +0000 (UTC) (envelope-from hps@selasky.org) Received: from hps2016.home.selasky.org (unknown [62.141.128.70]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.turbocat.net (Postfix) with ESMTPSA id 3AC1E26009E; Sat, 3 Feb 2018 10:37:40 +0100 (CET) Subject: Re: panic when loading mlxen To: Daniel Braniss , "K. Macy" Cc: freebsd-stable References: <83F78F7E-4DC2-41F5-837A-CB728D67B1E3@cs.huji.ac.il> <0D15BA02-5E39-4E04-84E1-3BF6D62065C1@cs.huji.ac.il> From: Hans Petter Selasky Message-ID: <673e43f8-4cc8-5c66-86f3-6b9dfd6b4a76@selasky.org> Date: Sat, 3 Feb 2018 10:34:46 +0100 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:52.0) Gecko/20100101 Thunderbird/52.5.2 MIME-Version: 1.0 In-Reply-To: <0D15BA02-5E39-4E04-84E1-3BF6D62065C1@cs.huji.ac.il> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.25 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 03 Feb 2018 09:37:48 -0000 On 02/03/18 08:34, Daniel Braniss wrote: > > >> On 2 Feb 2018, at 20:47, K. Macy wrote: >> >> That's odd since it doesn't use any of taskqgroup stuff. I take it you >> can't get a core? > > no core but some more info: > db> bt > Tracing pid 0 tid 100000 td 0xffffffff81e0e500 > taskqgroup_attach_cpu() at taskqgroup_attach_cpu+0x4f/frame 0xffffffff822e4c30 > tasklet_subsystem_init() at tasklet_subsystem_init+0xde/frame 0xffffffff822e4c90 > mi_startup() at mi_startup+0x9c/frame 0xffffffff822e4cb0 > btext() at btext+0x2c > >> >> Also, why are you loading it in loader.conf (slower) as opposed to rc.conf? > sometimes it’s booted diskless, and the driver is needed early. > and btw, this box doesn’t even have a mellanox card. > > >> -M >> >> >> >> On Fri, Feb 2, 2018 at 4:46 AM, Daniel Braniss wrote: >>> with latest stable (r328769) when I have >>> mlxen_load=“YES” >>> in my loader.conf it panics: >>> >>> KDB: debugger backends: ddbsize 0x4638 at 0x22d6000 f >>> KDB: current backend: ddb >>> Copyright (c) 1992-2018 The FreeBSD Project. >>> Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 >>> The Regents of the University of California. All rights reserved. >>> FreeBSD is a registered trademark of The FreeBSD Foundation. >>> FreeBSD 11.1-STABLE #18: Fri Feb 2 10:46:12 IST 2018 >>> danny@pe-44:/home/obj/pe-44/net/rnd/r+d/stable/11/sys/HUJI amd64 >>> FreeBSD clang version 5.0.1 (tags/RELEASE_501/final 320880) (based on LLVM 5.0.1) >>> VT(vga): resolution 640x480 >>> CPU: Intel(R) Xeon(R) CPU E5507 @ 2.27GHz (2261.04-MHz K8-class CPU) >>> Origin="GenuineIntel" Id=0x106a5 Family=0x6 Model=0x1a Stepping=5 >>> Features=0xbfebfbff >>> Features2=0x9ce3bd >>> AMD Features=0x28100800 >>> AMD Features2=0x1 >>> VT-x: PAT,HLT,MTF,PAUSE,EPT,VPID >>> TSC: P-state invariant, performance statistics >>> real memory = 25769803776 (24576 MB) >>> avail memory = 24931561472 (23776 MB) >>> Event timer "LAPIC" quality 100 >>> ACPI APIC Table: >>> FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs >>> FreeBSD/SMP: 2 package(s) x 4 core(s) >>> ioapic1: Changing APIC ID to 1 >>> ioapic0 irqs 0-23 on motherboard >>> ioapic1 irqs 32-55 on motherboard >>> >>> >>> Fatal trap 12: page fault while in kernel mode >>> cpuid = 0; apic id = 10 >>> fault virtual address = 0x1818 >>> fault code = supervisor write data, page not present >>> instruction pointer = 0x20:0xffffffff80ad427f >>> stack pointer = 0x28:0xffffffff822e3be0 >>> frame pointer = 0x28:0xffffffff822e3c30 >>> code segment = base 0x0, limit 0xfffff, type 0x1b >>> = DPL 0, pres 1, long 1, def32 0, gran 1 >>> processor eflags = interrupt enabled, resume, IOPL = 0 >>> current process = 0 (swapper) >>> [ thread pid 0 tid 100000 ] >>> Stopped at taskqgroup_attach_cpu+0x4f: lock cmpxchgq %r12,(%rdi) Hi, It should work if you "kldload mlxen" after boot or add it to kld_list in /etc/rc.conf. Looks like I have one more combination to test after the LinuxKPI upgrade in 11-stable. Thanks for notifying me. --HPS