FreeBSD Mail Archives

Date:      Mon, 3 May 2021 15:45:53 -0400
From:      Mark Johnston <markj@freebsd.org>
To:        Michael Schmiedgen <schmiedgen@gmx.net>
Cc:        freebsd-net@freebsd.org
Subject:   Re: page fault while in kernel mode - after upgrade from 12.2 to 13.0
Message-ID:  <YJBS8YMZFkMtWPEu@nuc>
In-Reply-To: <d7c3bfbd-2e54-c0f4-ec23-5dab08287ea3@gmx.net>
References:  <d7c3bfbd-2e54-c0f4-ec23-5dab08287ea3@gmx.net>

On Mon, May 03, 2021 at 08:04:30PM +0200, Michael Schmiedgen wrote:
> Hi List,
> 
> if I start a Samba jail, after a few seconds the system crashes. Very reproducible.
> 
> System has ~10 jails and 3 bhyve VMs. Dell server, Xeon E3-1240, 64GB RAM, 3 way mirror ZFS.
> 
> It also occurs a few seconds after I start a phone call using the SIP VM of that machine,
> very strange.
> 
> I got some log messages suggesting raising somaxconn, so I did
> 
> kern.ipc.somaxconn=4096
> 
> in sysctl.conf
> 
> 
> Below some debug information, please let me know if I should provide further information.
> 
> Should I open a bug or something?
> 
> Thank you very much!
>    Michael
> 
> 
> 
> Fatal trap 12: page fault while in kernel mode
> cpuid = 0; apic id = 00
> fault virtual address   = 0x0
> fault code              = supervisor read data, page not present
> instruction pointer     = 0x20:0xffffffff80ca52c0
> stack pointer           = 0x28:0xfffffe019d039650
> frame pointer           = 0x28:0xfffffe019d039690
> code segment            = base 0x0, limit 0xfffff, type 0x1b
>                          = DPL 0, pres 1, long 1, def32 0, gran 1
> processor eflags        = interrupt enabled, resume, IOPL = 0
> current process         = 649 (devd)
> trap number             = 12
> panic: page fault
> cpuid = 0
> time = 1620061253
> KDB: stack backtrace:
> #0 0xffffffff80c57345 at kdb_backtrace+0x65
> #1 0xffffffff80c09d21 at vpanic+0x181
> #2 0xffffffff80c09b93 at panic+0x43
> #3 0xffffffff8108b187 at trap_fatal+0x387
> #4 0xffffffff8108b1df at trap_pfault+0x4f
> #5 0xffffffff8108a83d at trap+0x27d
> #6 0xffffffff810617a8 at calltrap+0x8
> #7 0xffffffff80ca51c3 at sbappendaddr_locked+0x93
> #8 0xffffffff80cb437a at uipc_send+0x73a
> #9 0xffffffff80ca9053 at sosend_generic+0x633
> #10 0xffffffff80ca94e0 at sosend+0x50
> #11 0xffffffff80caff2e at kern_sendit+0x20e
> #12 0xffffffff80cb032b at sendit+0x1db
> #13 0xffffffff80cb013d at sys_sendto+0x4d
> #14 0xffffffff8108ba8c at amd64_syscall+0x10c
> #15 0xffffffff810620ce at fast_syscall_common+0xf8
> Uptime: 2m2s
> Dumping 2373 out of 65454 MB:..1%..11%..21%..31%..41%..51%..61%..71%..81%..91%
> 
> __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:55
> 55              __asm("movq %%gs:%P1,%0" : "=r" (td) : "n" (offsetof(struct pcpu,
> (kgdb) list *0xffffffff80ca52c0
> 0xffffffff80ca52c0 is in sbappendaddr_locked_internal (/usr/src/sys/kern/uipc_sockbuf.c:1169).
> 1164            if (ctrl_last)
> 1165                    ctrl_last->m_next = m0; /* concatenate data to control */
> 1166            else
> 1167                    control = m0;
> 1168            m->m_next = control;
> 1169            for (n = m; n->m_next != NULL; n = n->m_next)
> 1170                    sballoc(sb, n);
> 1171            sballoc(sb, n);
> 1172            nlast = n;
> 1173            SBLINKRECORD(sb, m);

So we are crashing because "n" is somehow NULL?  That seems difficult to
explain.  Can you show the local variables in this frame?

Does the panic always have the same stack trace?

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?YJBS8YMZFkMtWPEu>

Header And Logo

Peripheral Links

Site Navigation

Header And Logo

Peripheral Links

Search

Site Navigation