Date: Thu, 16 Dec 2021 21:16:21 -0600 From: Larry Rosenman <ler@lerctr.org> To: Alexander Motin <mav@freebsd.org> Cc: Freebsd current <freebsd-current@freebsd.org> Subject: Re: Panic: Page Fault in Kernel: Yesterday's CURRENT Message-ID: <35ce3d9ab427375f09b44c2ced1e1704@lerctr.org> In-Reply-To: <ecf4604c9f57aa58c5837a8d6c8bf0b4@lerctr.org> References: <3d1b5249a2c51670de496fad9e8b054c@lerctr.org> <9852ae04-6dd0-1cd4-13fe-e97c68e71b37@FreeBSD.org> <b73464cb07d071b35590e7ee22bae88a@lerctr.org> <ecf4604c9f57aa58c5837a8d6c8bf0b4@lerctr.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On 12/16/2021 9:03 pm, Larry Rosenman wrote: > On 12/10/2021 10:43 am, Larry Rosenman wrote: >> 14-2021_12_07-1217 - - 1.87G 2021-12-07 >> 12:17 >> 14-2021_12_09-1957 NR / 121G 2021-12-09 >> 19:57 >> >> If that's any help >> >> On 12/10/2021 10:36 am, Alexander Motin wrote: >>> Hi Larry, >>> >>> This looks like some use-after-free or otherwise corrupted callout >>> structure. Unfortunately the backtrace does not tell what was the >>> callout. When was the previous update to look what could change? >>> >>> On 10.12.2021 11:24, Larry Rosenman wrote: >>>> FreeBSD borg.lerctr.org 14.0-CURRENT FreeBSD 14.0-CURRENT #15 >>>> main-n251537-ab639f2398b: Thu Dec 9 19:45:37 CST 2021 >>>> root@borg.lerctr.org:/usr/obj/usr/src/amd64.amd64/sys/LER-MINIMAL >>>> amd64 >>>> >>>> VMCORE *IS* available. >>>> >>>> >>>> >>>> >>>> Unread portion of the kernel message buffer: >>>> kernel trap 12 with interrupts disabled >>>> >>>> >>>> Fatal trap 12: page fault while in kernel mode >>>> cpuid = 0; apic id = 20 >>>> fault virtual address = 0x0 >>>> fault code = supervisor write data, page not present >>>> instruction pointer = 0x20:0xffffffff804e0db4 >>>> stack pointer = 0x0:0xfffffe0434de4e10 >>>> frame pointer = 0x0:0xfffffe0434de4e70 >>>> code segment = base 0x0, limit 0xfffff, type 0x1b >>>> = DPL 0, pres 1, long 1, def32 0, gran 1 >>>> processor eflags = resume, IOPL = 0 >>>> current process = 82990 (c++) >>>> trap number = 12 >>>> panic: page fault >>>> cpuid = 0 >>>> time = 1639111198 >>>> KDB: stack backtrace: >>>> #0 0xffffffff8050fc95 at kdb_backtrace+0x65 >>>> #1 0xffffffff804c468f at vpanic+0x17f >>>> #2 0xffffffff804c4503 at panic+0x43 >>>> #3 0xffffffff807a2195 at trap_fatal+0x385 >>>> #4 0xffffffff807a21ef at trap_pfault+0x4f >>>> #5 0xffffffff80779c78 at calltrap+0x8 >>>> #6 0xffffffff8045ddb8 at handleevents+0x188 >>>> #7 0xffffffff8045ea3e at timercb+0x24e >>>> #8 0xffffffff807ca9eb at lapic_handle_timer+0x9b >>>> #9 0xffffffff8077b9b1 at Xtimerint+0xb1 >>>> Uptime: 2h28m57s >>>> Dumping 12829 out of 131023 >>>> MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91% >>>> >>>> __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:55 >>>> 55 __asm("movq %%gs:%P1,%0" : "=r" (td) : "n" >>>> (offsetof(struct pcpu, >>>> (kgdb) #0 __curthread () at >>>> /usr/src/sys/amd64/include/pcpu_aux.h:55 >>>> #1 doadump (textdump=<optimized out>) >>>> at /usr/src/sys/kern/kern_shutdown.c:399 >>>> #2 0xffffffff804c428c in kern_reboot (howto=260) >>>> at /usr/src/sys/kern/kern_shutdown.c:487 >>>> #3 0xffffffff804c46fe in vpanic (fmt=0xffffffff807e1276 "%s", >>>> ap=<optimized out>) at /usr/src/sys/kern/kern_shutdown.c:920 >>>> #4 0xffffffff804c4503 in panic (fmt=<unavailable>) >>>> at /usr/src/sys/kern/kern_shutdown.c:844 >>>> #5 0xffffffff807a2195 in trap_fatal (frame=0xfffffe0434de4d50, >>>> eva=0) >>>> at /usr/src/sys/amd64/amd64/trap.c:946 >>>> #6 0xffffffff807a21ef in trap_pfault (frame=0xfffffe0434de4d50, >>>> usermode=false, signo=<optimized out>, ucode=<optimized out>) >>>> at /usr/src/sys/amd64/amd64/trap.c:765 >>>> #7 <signal handler called> >>>> #8 0xffffffff804e0db4 in callout_process >>>> (now=now@entry=38385536922300) >>>> at /usr/src/sys/kern/kern_timeout.c:488 >>>> #9 0xffffffff8045ddb8 in handleevents >>>> (now=now@entry=38385536922300, >>>> fake=fake@entry=0) at /usr/src/sys/kern/kern_clocksource.c:213 >>>> #10 0xffffffff8045ea3e in timercb (et=0xffffffff80d475e0 <lapic_et>, >>>> arg=<optimized out>) at /usr/src/sys/kern/kern_clocksource.c:357 >>>> #11 0xffffffff807ca9eb in lapic_handle_timer >>>> (frame=0xfffffe0434de4f40) >>>> at /usr/src/sys/x86/x86/local_apic.c:1364 >>>> #12 <signal handler called> >>>> #13 0x000000080df42bb6 in ?? () >>>> Backtrace stopped: Cannot access memory at address 0x7ffffdef2c90 >>>> (kgdb) >>>> >>>> ------------------------------------------------------------------------ >>>> > ' > > I got a new crash on a today's current: > ❯ more core.txt.1 > borg.lerctr.org dumped core - see /var/crash/vmcore.1 > > Thu Dec 16 17:01:37 CST 2021 > > FreeBSD borg.lerctr.org 14.0-CURRENT FreeBSD 14.0-CURRENT #22 > main-n251748-c610426c4de: Thu Dec 16 09:22:52 CST 2021 > root@borg.lerctr.org:/usr/obj/usr/src/amd64.amd64/sys/LER-MINIMAL > amd64 > > panic: page fault > > GNU gdb (GDB) 11.1 [GDB v11.1 for FreeBSD] > Copyright (C) 2021 Free Software Foundation, Inc. > License GPLv3+: GNU GPL version 3 or later > <http://gnu.org/licenses/gpl.html> > This is free software: you are free to change and redistribute it. > There is NO WARRANTY, to the extent permitted by law. > Type "show copying" and "show warranty" for details. > This GDB was configured as "x86_64-portbld-freebsd14.0". > Type "show configuration" for configuration details. > For bug reporting instructions, please see: > <https://www.gnu.org/software/gdb/bugs/>. > Find the GDB manual and other documentation resources online at: > <http://www.gnu.org/software/gdb/documentation/>. > > For help, type "help". > Type "apropos word" to search for commands related to "word"... > Reading symbols from /boot/kernel/kernel... > Reading symbols from /usr/lib/debug//boot/kernel/kernel.debug... > > Unread portion of the kernel message buffer: > kernel trap 12 with interrupts disabled > > > Fatal trap 12: page fault while in kernel mode > cpuid = 0; apic id = 20 > fault virtual address = 0x0 > fault code = supervisor write data, page not present > instruction pointer = 0x20:0xffffffff804e0a34 > stack pointer = 0x0:0xfffffe03441a0e10 > frame pointer = 0x0:0xfffffe03441a0e70 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags = resume, IOPL = 0 > current process = 86479 (c++) > trap number = 12 > panic: page fault > cpuid = 0 > time = 1639694532 > KDB: stack backtrace: > #0 0xffffffff8050f9e5 at kdb_backtrace+0x65 > #1 0xffffffff804c430f at vpanic+0x17f > #2 0xffffffff804c4183 at panic+0x43 > #3 0xffffffff807a2195 at trap_fatal+0x385 > #4 0xffffffff807a21ef at trap_pfault+0x4f > #5 0xffffffff80779728 at calltrap+0x8 > #6 0xffffffff8045da08 at handleevents+0x188 > #7 0xffffffff8045e68e at timercb+0x24e > #8 0xffffffff807ca9eb at lapic_handle_timer+0x9b > #9 0xffffffff8077b461 at Xtimerint+0xb1 > Uptime: 7h7m44s > Dumping 13614 out of 131023 > MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91% > > __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:55 > 55 __asm("movq %%gs:%P1,%0" : "=r" (td) : "n" > (offsetof(struct pcpu, > (kgdb) #0 __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:55 > #1 doadump (textdump=<optimized out>) > at /usr/src/sys/kern/kern_shutdown.c:399 > #2 0xffffffff804c3f0c in kern_reboot (howto=260) > at /usr/src/sys/kern/kern_shutdown.c:487 > #3 0xffffffff804c437e in vpanic (fmt=0xffffffff807e1291 "%s", > ap=<optimized out>) at /usr/src/sys/kern/kern_shutdown.c:920 > #4 0xffffffff804c4183 in panic (fmt=<unavailable>) > at /usr/src/sys/kern/kern_shutdown.c:844 > #5 0xffffffff807a2195 in trap_fatal (frame=0xfffffe03441a0d50, eva=0) > at /usr/src/sys/amd64/amd64/trap.c:946 > #6 0xffffffff807a21ef in trap_pfault (frame=0xfffffe03441a0d50, > usermode=false, signo=<optimized out>, ucode=<optimized out>) > at /usr/src/sys/amd64/amd64/trap.c:765 > #7 <signal handler called> > #8 0xffffffff804e0a34 in callout_process > (now=now@entry=110228055503582) > at /usr/src/sys/kern/kern_timeout.c:488 > #9 0xffffffff8045da08 in handleevents (now=now@entry=110228055503582, > fake=fake@entry=0) at /usr/src/sys/kern/kern_clocksource.c:213 > #10 0xffffffff8045e68e in timercb (et=0xffffffff80d47660 <lapic_et>, > arg=<optimized out>) at /usr/src/sys/kern/kern_clocksource.c:357 > #11 0xffffffff807ca9eb in lapic_handle_timer (frame=0xfffffe03441a0f40) > at /usr/src/sys/x86/x86/local_apic.c:1364 > #12 <signal handler called> > #13 0x0000000003883679 in ?? () > Backtrace stopped: Cannot access memory at address 0x7fffffff6f20 > (kgdb) > > > Core is ALSO available > > Boot Env: > ❯ bectl list > BE Active Mountpoint Space Created > 14-2021-10-26_1554 - - 1.82G 2021-10-26 15:54 > 14-2021-11-03-1800 - - 162M 2021-11-03 18:00 > 14-2021_10_19-1900 - - 1.80G 2021-10-19 18:57 > 14-2021_10_20-0800 - - 1.94G 2021-10-20 08:01 > 14-2021_11_18-1241 - - 1.86G 2021-11-18 11:41 > 14-2021_11_20-1417 - - 1.85G 2021-11-20 13:17 > 14-2021_11_23-1111 - - 1.87G 2021-11-23 11:11 > 14-2021_11_25-1312 - - 1.87G 2021-11-25 12:12 > 14-2021_12_04-2220 - - 13.7M 2021-12-04 22:20 > 14-2021_12_07-1217 - - 1.87G 2021-12-07 12:17 > 14-2021_12_09-1957 - - 1.89G 2021-12-09 19:57 > 14-2021_12_14-0224 - - 1.82G 2021-12-14 02:24 > 14-2021_12_15-0923 - - 18.6M 2021-12-15 09:23 > 14-2021_12_15-2133 - - 1.83G 2021-12-15 21:33 > 14-2021_12_15-2257 - - 1.82G 2021-12-15 22:57 > 14-2021_12_16-0924 NR / 129G 2021-12-16 09:24 > 14-main-first - - 2.69G 2021-10-02 20:11 > 14.0-CURRENT-2021-10-04_1051 - - 16.6M 2021-10-04 10:51 > 14.0-CURRENT_2021-10-06_184540 - - 12.9M 2021-10-06 18:46 > 14.0-CURRENT_2021-11-04_091349 - - 17.7M 2021-11-04 09:13 > 14.0-CURRENT_2021-12-05_204803 - - 10.6M 2021-12-05 20:48 > r363086 - - 4.19G 2020-07-10 15:37 > > what else can I supply? FTR Both crashes were during LONG poudriere runs rebuilding all 800+ ports I use after a FreeBSD_Version bump. -- Larry Rosenman http://www.lerctr.org/~ler Phone: +1 214-642-9640 E-Mail: ler@lerctr.org US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?35ce3d9ab427375f09b44c2ced1e1704>