From owner-freebsd-net@freebsd.org Wed Sep 21 14:31:31 2016 Return-Path: Delivered-To: freebsd-net@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 6BC40BE3C41 for ; Wed, 21 Sep 2016 14:31:31 +0000 (UTC) (envelope-from gerrit.kuehn@aei.mpg.de) Received: from umail.aei.mpg.de (umail.aei.mpg.de [194.94.224.6]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id F0089693 for ; Wed, 21 Sep 2016 14:31:30 +0000 (UTC) (envelope-from gerrit.kuehn@aei.mpg.de) Received: from mailgate.aei.mpg.de (mailgate.aei.mpg.de [194.94.224.5]) by umail.aei.mpg.de (Postfix) with ESMTP id 142D32000E7 for ; Wed, 21 Sep 2016 16:22:15 +0200 (CEST) Received: from mailgate.aei.mpg.de (localhost [127.0.0.1]) by localhost (Postfix) with SMTP id 049D0406ADE for ; Wed, 21 Sep 2016 16:22:15 +0200 (CEST) Received: from intranet.aei.uni-hannover.de (ahin1.aei.uni-hannover.de [130.75.117.40]) by mailgate.aei.mpg.de (Postfix) with ESMTP id D861C406ADB for ; Wed, 21 Sep 2016 16:22:14 +0200 (CEST) Received: from arc.aei.uni-hannover.de ([130.75.117.1]) by intranet.aei.uni-hannover.de (IBM Domino Release 9.0.1FP5) with ESMTP id 2016092116221448-24781 ; Wed, 21 Sep 2016 16:22:14 +0200 Date: Wed, 21 Sep 2016 16:22:14 +0200 From: Gerrit =?ISO-8859-1?Q?K=FChn?= To: freebsd-net@freebsd.org Subject: igb/pf-related crashes Message-Id: <20160921162214.dcc6dca77d4a42c73df6de5e@aei.mpg.de> Organization: Max Planck Gesellschaft X-Mailer: Sylpheed 3.4.2 (GTK+ 2.24.22; amd64-portbld-freebsd10.0) Mime-Version: 1.0 X-MIMETrack: Itemize by SMTP Server on intranet/aei-hannover(Release 9.0.1FP5|November 22, 2015) at 21/09/2016 16:22:14, Serialize by Router on intranet/aei-hannover(Release 9.0.1FP5|November 22, 2015) at 21/09/2016 16:22:14, Serialize complete at 21/09/2016 16:22:14 X-TNEFEvaluated: 1 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII X-PMX-Version: 6.0.2.2308539, Antispam-Engine: 2.7.2.2107409, Antispam-Data: 2016.9.21.141217 X-PerlMx-Spam: Gauge=IIIIIIII, Probability=8%, Report=' HTML_00_01 0.05, HTML_00_10 0.05, MIME_LOWER_CASE 0.05, SUPERLONG_LINE 0.05, BODYTEXTP_SIZE_3000_LESS 0, BODY_SIZE_2000_2999 0, BODY_SIZE_5000_LESS 0, BODY_SIZE_7000_LESS 0, NO_CTA_URI_FOUND 0, NO_URI_FOUND 0, NO_URI_HTTPS 0, __CT 0, __CTE 0, __CT_TEXT_PLAIN 0, __HAS_FROM 0, __HAS_MSGID 0, __HAS_X_MAILER 0, __MIME_TEXT_ONLY 0, __MIME_TEXT_P 0, __MIME_TEXT_P1 0, __MIME_VERSION 0, __SANE_MSGID 0, __SUBJ_ALPHA_END 0, __TO_MALFORMED_2 0, __TO_NO_NAME 0' X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 21 Sep 2016 14:31:31 -0000 Hi all, Over the last weekend, I saw a couple of crashes on a 10.3-system acting as filtering bridge with pf that all looked like this: --- Unread portion of the kernel message buffer: Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0xe fault code = supervisor write data, page not present instruction pointer = 0x20:0xffffffff81c45fb9 stack pointer = 0x28:0xfffffe01eeaa54a0 frame pointer = 0x28:0xfffffe01eeaa55d0 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 12 (irq275: igb2:que 0) trap number = 12 panic: page fault cpuid = 0 KDB: stack backtrace: #0 0xffffffff8098e390 at kdb_backtrace+0x60 #1 0xffffffff80951066 at vpanic+0x126 #2 0xffffffff80950f33 at panic+0x43 #3 0xffffffff80d55f7b at trap_fatal+0x36b #4 0xffffffff80d5627d at trap_pfault+0x2ed #5 0xffffffff80d558fa at trap+0x47a #6 0xffffffff80d3b8d2 at calltrap+0x8 #7 0xffffffff81c2c4c8 at pf_test+0x1c8 #8 0xffffffff81c3e07d at pf_check_in+0x1d #9 0xffffffff80a24ef4 at pfil_run_hooks+0x84 #10 0xffffffff81c148e3 at bridge_pfil+0x363 #11 0xffffffff81c153a5 at bridge_forward+0x1c5 #12 0xffffffff81c13c55 at bridge_input+0x555 #13 0xffffffff80a1c05a at ether_nh_input+0x2ba #14 0xffffffff80a24092 at netisr_dispatch_src+0x62 #15 0xffffffff804f859c at igb_rxeof+0x60c #16 0xffffffff804f8c41 at igb_msix_que+0x121 #17 0xffffffff8091c99b at intr_event_execute_handlers+0xab --- Before that happening, the system was fine for about 6 months. Over the weekend I saw about 10 identical crashes and subsequent reboots like this (with sometimes only a few minutes of operation between). Running out of other ideas, I updated the system to latest release (-p7), and up to now it didn't happen again. Any ideas what might be causing this? cu Gerrit