From owner-freebsd-stable@FreeBSD.ORG Wed Feb 29 17:59:55 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id BE531106564A; Wed, 29 Feb 2012 17:59:55 +0000 (UTC) (envelope-from lacombar@gmail.com) Received: from mail-we0-f182.google.com (mail-we0-f182.google.com [74.125.82.182]) by mx1.freebsd.org (Postfix) with ESMTP id 247128FC16; Wed, 29 Feb 2012 17:59:54 +0000 (UTC) Received: by werl4 with SMTP id l4so2809950wer.13 for ; Wed, 29 Feb 2012 09:59:54 -0800 (PST) Received-SPF: pass (google.com: domain of lacombar@gmail.com designates 10.180.86.230 as permitted sender) client-ip=10.180.86.230; Authentication-Results: mr.google.com; spf=pass (google.com: domain of lacombar@gmail.com designates 10.180.86.230 as permitted sender) smtp.mail=lacombar@gmail.com; dkim=pass header.i=lacombar@gmail.com Received: from mr.google.com ([10.180.86.230]) by 10.180.86.230 with SMTP id s6mr19691730wiz.16.1330538394227 (num_hops = 1); Wed, 29 Feb 2012 09:59:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; bh=cONjesBBvGwYwM3mww00kmgm71pZ78KXkL08Yh4wP+A=; b=q0WgTXn03W9qUwhws+vSxlDl/iHF/85zY5WAyZuvCDOYWtNIRJNWcC701dsKK7vZw8 n91ySkCoMBibo+M+isN1FFEr8XUs5n7nfVQP1Vk1Rrcf4oRqyYys2rjJVTM//Q7gy2ON KgGUj9epik1q6Qm3jQlPZxKe1rmh1DfqeIMYs= MIME-Version: 1.0 Received: by 10.180.86.230 with SMTP id s6mr15699078wiz.16.1330538394134; Wed, 29 Feb 2012 09:59:54 -0800 (PST) Received: by 10.216.166.11 with HTTP; Wed, 29 Feb 2012 09:59:54 -0800 (PST) In-Reply-To: References: Date: Wed, 29 Feb 2012 12:59:54 -0500 Message-ID: From: Arnaud Lacombe To: Attilio Rao Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Cc: freebsd-stable Subject: Re: Complete hang on 9.0-RELEASE X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 29 Feb 2012 17:59:55 -0000 Hi, On Mon, Feb 27, 2012 at 12:48 PM, Arnaud Lacombe wrote= : > Hi, > > On Mon, Feb 27, 2012 at 10:36 AM, Attilio Rao wrote= : >> 2012/2/27, Arnaud Lacombe : >>> Hi, >>> >>> On Tue, Feb 14, 2012 at 11:41 AM, Arnaud Lacombe w= rote: >>>> Hi folks, >>>> >>>> For the records, I was running some tests yesterday on top of a >>>> 9.0-RELEASE, amd64, kernel when the box hanged. At the time of the >>>> hang, the box was running a process with about 2800 threads with heavy >>>> IPC between 1400 writers and 1400 readers. The box was in single user >>>> mode (/bin/sh coming from FreeBSD 7.4-STABLE). Here is the beginning >>>> of the dmesg: >>>> >>> This happened a second time, now with FreeBSD 8.2-RELEASE. Complete >>> machine hang. The machine was running about 4000 threads in a single >>> process, all the other condition are the same. >> >> Arnaud, >> can you please break in your kernel via KDB, collect the following >> informations from the DDB prompt: >> - ps >> - alltrace >> - show allpcpu >> - possibly get a coredump with 'call doadump' >> > Will do, but I'll need to rebuild a kernel to include DDB. > >> and in the end provide all those along with kernel binary and possibly >> sources somewhere? >> > I'll be testing a bare `release/8.2.0' with the following patch: > > diff --git a/sys/amd64/conf/GENERIC b/sys/amd64/conf/GENERIC > index c3e0095..7bd997f 100644 > --- a/sys/amd64/conf/GENERIC > +++ b/sys/amd64/conf/GENERIC > @@ -79,6 +79,10 @@ options =A0 =A0 =A0INCLUDE_CONFIG_FILE =A0 =A0 # Inclu= de this > file in kernel > > =A0options =A0 =A0 =A0 =A0KDB =A0 =A0 =A0 =A0 =A0 # Kernel debugger relat= ed code > =A0options =A0 =A0 =A0 =A0KDB_TRACE =A0 =A0 # Print a stack trace for a p= anic > +options =A0 =A0 =A0 =A0DDB > +options =A0 =A0 =A0 =A0BREAK_TO_DEBUGGER > +options =A0 =A0 =A0 =A0ALT_BREAK_TO_DEBUGGER > > =A0# Make an SMP-capable kernel by default > =A0options =A0 =A0 =A0 =A0SMP =A0 =A0 =A0 =A0 =A0 # Symmetric MultiProces= sor Kernel > ok, it happened again after 2 days, the process was running about 3200 threads. I'm trying to break into DDB and let you know, I'm not that successful for now... - Arnaud > =A0- Arnaud > >> Thanks, >> Attilio >> >> >> -- >> Peace can only be achieved by understanding - A. Einstein