From owner-freebsd-bugs@freebsd.org Sat Sep 1 07:00:18 2018 Return-Path: Delivered-To: freebsd-bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 10FD91095983 for ; Sat, 1 Sep 2018 07:00:18 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mailman.ysv.freebsd.org (mailman.ysv.freebsd.org [IPv6:2001:1900:2254:206a::50:5]) by mx1.freebsd.org (Postfix) with ESMTP id A30377CEC7 for ; Sat, 1 Sep 2018 07:00:17 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: by mailman.ysv.freebsd.org (Postfix) id 652AB109597D; Sat, 1 Sep 2018 07:00:17 +0000 (UTC) Delivered-To: bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 432D6109597C for ; Sat, 1 Sep 2018 07:00:17 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mxrelay.ysv.freebsd.org (mxrelay.ysv.freebsd.org [IPv6:2001:1900:2254:206a::19:3]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mxrelay.ysv.freebsd.org", Issuer "Let's Encrypt Authority X3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id D71597CEC6 for ; Sat, 1 Sep 2018 07:00:16 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mxrelay.ysv.freebsd.org (Postfix) with ESMTPS id 3114F2AFC6 for ; Sat, 1 Sep 2018 07:00:16 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id w8170G8Q079814 for ; Sat, 1 Sep 2018 07:00:16 GMT (envelope-from bugzilla-noreply@freebsd.org) Received: (from www@localhost) by kenobi.freebsd.org (8.15.2/8.15.2/Submit) id w8170GtY079813 for bugs@FreeBSD.org; Sat, 1 Sep 2018 07:00:16 GMT (envelope-from bugzilla-noreply@freebsd.org) X-Authentication-Warning: kenobi.freebsd.org: www set sender to bugzilla-noreply@freebsd.org using -f From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 231067] Random kernel panic after r338257 Date: Sat, 01 Sep 2018 07:00:15 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: CURRENT X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: fuyuhiko.maruyama@gmail.com X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.27 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 01 Sep 2018 07:00:18 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D231067 Bug ID: 231067 Summary: Random kernel panic after r338257 Product: Base System Version: CURRENT Hardware: Any OS: Any Status: New Severity: Affects Some People Priority: --- Component: kern Assignee: bugs@FreeBSD.org Reporter: fuyuhiko.maruyama@gmail.com After r338257, kernel panic happen in ramdom timing. This may be triggered by named, in my case dns/bind911. With kernel after r338257, make buildworld at /usr/src cannot be completed because of kernel panics. Typically kernel goes panics in a few minutes af= ter beginning of make buildworld. I found the origin of this panics by binary searching of svn-head and determined that r338256(with r338260's small fixes for building) is the last stable ke= rnel that can complete buildworld/buildkernel, and r338257(with r338260's small fixes) is the first revision that causes current problems.=20 One instance of panic message is like below(with r338257 kernel), typed by hand: Fatal trap 12: page fault while in kernel mode cpuid =3D 1: apic id =3D 01 fault virtual address =3D 0x98 fault code =3D supervisor read data, page not present instruction pointer =3D 0x20:0xffffffff80988c6f stack pointer =3D 0x28:0xfffffe00a274d370 frame pointer =3D 0x28:0xfffffe00a274d5d0 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, long 1, def32 0, gran 1 processor eflag =3D interrupt enabled, resume, IOPL =3D 0 current process =3D 766 (isc-worker0000) [ thread pic 766 tid 100985 ] Stopped at ip6_output+0xeaf: movl ll+0x77(%rdi),%eax db> bt Tracing pid 766 tid 100993 td 0xfffff80024446000 ip6_output() at ip6_output+0xeaf/frame 0xfffffe00a274d5d0 udp6_send() at udp6_send+0x86a/frame 0xfffffe00a274d790 sosend_dgram() at sosend_dgram+0x33b/frame 0xfffffe00a274d800 sosend() at sosend+0x50/frame 0xfffffe00a274d830 kern_sendit() at kern_sendit+0x237/frame 0xfffffe00a274d8d0 sendit() at sendit+0x19e/frame 0xfffffe00a274d920 sys_sendmsg() at sys_sendmsg+0x61/frame 0xfffffe00a274d980 amd64_syscall() at amd64_syscall+0x369/frame 0xfffffe00a274dab0 fast_syscall_common() at fast_syscall_common+0x101/frame 0xfffffe00a274dab0 --- syscall (28, FreeBSD ELF64, sys_sendmsg), rip =3D 0x800fe0a9a, rsp =3D 0x7fffdfffb528, rbp =3D 0x7fffdfffb560 --- Another example here(with r338406 kernel): Fatal trap 12: page fault while in kernel mode cpuid =3D 2: apic id =3D 02 fault virtual address =3D 0x110 fault code =3D supervisor read data, page not present instruction pointer =3D 0x20:0xffffffff80763428 stack pointer =3D 0x28:0xfffffe00a21242c0 frame pointer =3D 0x28:0xfffffe00a2124360 code segment =3D base 0x0, limit 0xfffff, type 0x1b =3D DPL 0, pres 1, long 1, def32 0, gran 1 processor eflag =3D interrupt enabled, resume, IOPL =3D 0 current process =3D 766 (isc-worker0000) [ thread pic 766 tid 100985 ] Stopped at __rw_wlock_hard+0x1d8: movq (%rbx),%r14 db> --=20 You are receiving this mail because: You are the assignee for the bug.=