From owner-freebsd-stable@FreeBSD.ORG Fri Jul 1 20:38:07 2005 Return-Path: X-Original-To: freebsd-stable@FreeBSD.org Delivered-To: freebsd-stable@FreeBSD.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 357F116A41C for ; Fri, 1 Jul 2005 20:38:07 +0000 (GMT) (envelope-from glebius@FreeBSD.org) Received: from relay.bestcom.ru (relay.bestcom.ru [217.72.144.5]) by mx1.FreeBSD.org (Postfix) with ESMTP id 9EA1843D1D for ; Fri, 1 Jul 2005 20:38:06 +0000 (GMT) (envelope-from glebius@FreeBSD.org) Received: from cell.sick.ru (root@cell.sick.ru [217.72.144.68]) by relay.bestcom.ru (8.13.1/8.12.9) with ESMTP id j61Kc44u010410 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL); Sat, 2 Jul 2005 00:38:05 +0400 (MSD) (envelope-from glebius@FreeBSD.org) Received: from cell.sick.ru (glebius@localhost [127.0.0.1]) by cell.sick.ru (8.13.1/8.12.8) with ESMTP id j61Kc4mM060108 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Sat, 2 Jul 2005 00:38:04 +0400 (MSD) (envelope-from glebius@FreeBSD.org) Received: (from glebius@localhost) by cell.sick.ru (8.13.1/8.13.1/Submit) id j61Kc3D4060107; Sat, 2 Jul 2005 00:38:03 +0400 (MSD) (envelope-from glebius@FreeBSD.org) X-Authentication-Warning: cell.sick.ru: glebius set sender to glebius@FreeBSD.org using -f Date: Sat, 2 Jul 2005 00:38:03 +0400 From: Gleb Smirnoff To: Gary Mu1der Message-ID: <20050701203803.GG59610@cell.sick.ru> References: <42B961B9.7A5856B3@freebsd.org> <20050623104230.GB61389@cell.sick.ru> <20050623141514.GD738@obiwan.tataz.chchile.org> <42BC5EE2.2020003@infotechfl.com> <20050627082958.GB97832@cell.sick.ru> <42C16BBF.4060107@infotechfl.com> <20050701085808.GD52023@cell.sick.ru> <42C58373.60008@infotechfl.com> <20050701201308.GD59610@cell.sick.ru> <42C5A866.1010409@infotechfl.com> Mime-Version: 1.0 Content-Type: text/plain; charset=koi8-r Content-Disposition: inline In-Reply-To: <42C5A866.1010409@infotechfl.com> User-Agent: Mutt/1.5.6i X-Virus-Scanned: ClamAV version devel-20050125, clamav-milter version 0.80ff on relay.bestcom.ru X-Virus-Status: Clean Cc: freebsd-stable@FreeBSD.org Subject: Re: panic in RELENG_5 UMA - two new stack traces X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 01 Jul 2005 20:38:07 -0000 On Fri, Jul 01, 2005 at 04:32:38PM -0400, Gary Mu1der wrote: G> >G> I can reproduce the crash within 60 seconds of firing off 30+ ping/arp G> >G> -d scripts, all running in parallel. G> >G> G> >G> debug.mpsafenet=0 seems to have solved the problem. I'm running 100+ G> >G> instances of the above script and the system has been stable for over G> >an G> hour. G> > G> >Thanks! We definitely see that the bug is a race, not a broken logic. I am G> >almost sure, that you are experiencing the same bug as I described in G> >the beginning of the thread. G> > G> >Although there is no yet fix available for race between 'arp -d' and G> >outgoing packet, there is one for race between incoming ARP reply and G> >outgoing packet. We will probably commit it soon, after more review. G> G> Is this bug specific to only using "arp -d", or does it look like the G> "arp -d" tests identify a bug that might cause TCP/IP related crashes G> with other types of real-world network traffic. G> G> To rephrase: Does it look like fixing this bug may fix a lot of the G> network-related crashes a number of people have reported? See above in the thread. We have two races: one that can fire anytime in runtime, and we are going to fix it. The other with 'arp -d', not fixed yet. I am not sure how many reports on network related panics where related to this race. Let's fix it and see. You can patch your boxes with the patch and see whether they are more stable in runtime. -- Totus tuus, Glebius. GLEBIUS-RIPN GLEB-RIPE