Date: Sun, 25 Mar 2018 14:50:05 +1100 From: Andrew Reilly <areilly@bigpond.net.au> To: Warner Losh <imp@bsdimp.com> Cc: FreeBSD Current <current@freebsd.org> Subject: Re: 12-Current-r331347 panics on boot (r331346 and earlier didn't.) Message-ID: <20180325035005.GA2551@Zen.ac-r.nu> In-Reply-To: <CANCZdfozmyxC5MuNS8Tu_LD1bbAYNTnTcPe52-6sz9KPCQou_Q@mail.gmail.com> References: <20180324035653.GA3411@Zen.ac-r.nu> <CANCZdfozmyxC5MuNS8Tu_LD1bbAYNTnTcPe52-6sz9KPCQou_Q@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
OK, I've completed the search: r331346 works, r331347 panics somewhere in the initialization of random. In the 331347 change (Add the "TCP Blackbox Recorder") I can't see anything obvious to tweak, unfortunately. It's a fair chunk of new code but it's all network-stack related, and my kernel is panicking long before any network activity happens. Any suggestions? Cheers, Andrew On Sat, Mar 24, 2018 at 08:14:40AM -0600, Warner Losh wrote: > Also, what rev failed? I booted r331464 last night w/o issue. > > Warner > > On Fri, Mar 23, 2018 at 9:56 PM, Andrew Reilly <areilly@bigpond.net.au> > wrote: > > > Hi all, > > > > For reasons that still escape me, I haven't been able to get a kernel dump > > to debug, sorry. > > > > Just thought that I'd generate a fairly low-quality report, to see if > > anyone has some ideas. > > > > The last kernel that I have that booted OK (and I'm now running) is: > > FreeBSD Zen.ac-r.nu 12.0-CURRENT FreeBSD 12.0-CURRENT #1 r331064M: Sat > > Mar 17 07:54:51 AEDT 2018 root@Zen:/usr/obj/usr/src/amd64.amd64/sys/GENERIC > > amd64 > > > > The machine is a: > > CPU: AMD Ryzen 7 1700 Eight-Core Processor (2994.46-MHz K8-class > > CPU) > > Origin="AuthenticAMD" Id=0x800f11 Family=0x17 Model=0x1 Stepping=1 > > Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8, > > APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT> > > > > Kernels built from head as of a couple of hours ago get through launching > > the other CPUs and then stops somewhere in random, apparently: > > > > SMP: AP CPU #2 Launched! > > Timecounter "TSC-low" frequency 1497223020 Hz quality 1000 > > random: entpanic: mtx_lock() of spin mutex (null) @ > > /usr/src/sys/kern/subr_bus.c:617 > > cpuid = 0 > > time = 1 > > KDB: stack backtrace: > > db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame > > 0xfffffe00004507a0 > > vpanic() at vpanic+0x18d/frame 0xfffffe0000450800 > > doadump () at doadump/frame 0xfffffe0000450880 > > __mtx_lock_flags() at __mtx_lock_flags+0x163/frame 0xfffffe00004508d0 > > devctl_queue_data_f() at devctl_queue_data_f+0x6a/frame 0xfffffe0000450900 > > g_dev_taste() at g_dev_taste+0x370/frame 0xfffffe0000450a10 > > g_new_provider_event() at g_new_provider_event+0xfa/frame > > 0xfffffe0000450a30 > > g_run_events() at g_run_events+0x151/frame 0xfffffe0000450a70 > > fork_exit() at fork_exit+0x84/frame 0xfffffe0000450ab0 > > fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe0000450ab0 > > --- trap 0, rip = 0, rsp = 0, rbp = 0 --- > > KDB: enter: panic > > [ thread pid 14 tid 100052 ] > > Stopped at kdb_enter+0x3b: movq $0,kdb_why > > db> dump > > Cannot dump: no dump device specified. > > db> > > > > Now dumping worked fine the last time the kernel panicked: I have > > dumpdev=AUTO in rc.conf and I have swap on nvd0p3 (first) and > > /dev/zvol/root/swap > > (second, larger than the first.) > > > > Root on the nvd0p2 is ZFS, and ther's a four-drive raidZ with user > > directories and what-not on them, and another ZFS on an external USB drive > > that I use > > for backups, unmounted. > > > > In the new kernels, we clearly aren't even getting as far as finding the > > hubs and controllers, let alone the drives. > > > > I've attached dmesg.boot from the last boot from last week's good kernel. > > (While briefly in yoyo mode I turned the SMT back on, so now there are 16 > > cores > > instead of the eight mentioned in the crash dump. Didn't help, but I > > haven't turned it back off yet.) > > > > Cheers, > > > > Andrew > > > > > > _______________________________________________ > > freebsd-current@freebsd.org mailing list > > https://lists.freebsd.org/mailman/listinfo/freebsd-current > > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org" > > > > > _______________________________________________ > freebsd-current@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-current > To unsubscribe, send any mail to "freebsd-current-unsubscribe@freebsd.org"
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20180325035005.GA2551>