From owner-freebsd-current@FreeBSD.ORG Sat Jun 4 09:11:10 2011 Return-Path: Delivered-To: freebsd-current@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 773D0106566C; Sat, 4 Jun 2011 09:11:10 +0000 (UTC) (envelope-from rwatson@FreeBSD.org) Received: from cyrus.watson.org (cyrus.watson.org [65.122.17.42]) by mx1.freebsd.org (Postfix) with ESMTP id 532D68FC08; Sat, 4 Jun 2011 09:11:10 +0000 (UTC) Received: from [192.168.2.112] (host86-173-95-198.range86-173.btcentralplus.com [86.173.95.198]) by cyrus.watson.org (Postfix) with ESMTPSA id 7022346B2E; Sat, 4 Jun 2011 05:11:09 -0400 (EDT) Mime-Version: 1.0 (Apple Message framework v1084) Content-Type: text/plain; charset=us-ascii From: "Robert N. M. Watson" In-Reply-To: <4DE9EB61.3000006@FreeBSD.org> Date: Sat, 4 Jun 2011 10:11:07 +0100 Content-Transfer-Encoding: quoted-printable Message-Id: <8AA26086-DA05-4DDA-9973-AE57328E2C81@FreeBSD.org> References: <4DE8FA2E.4030202@FreeBSD.org> <5E4D0F56-4338-4157-8BC6-17EE2831725F@FreeBSD.org> <4DE9EB61.3000006@FreeBSD.org> To: Andriy Gapon X-Mailer: Apple Mail (2.1084) Cc: freebsd-current@FreeBSD.org, freebsd-stable@FreeBSD.org Subject: Re: [poll / rfc] kdb_stop_cpus X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 04 Jun 2011 09:11:10 -0000 On 4 Jun 2011, at 09:22, Andriy Gapon wrote: > on 03/06/2011 20:57 Robert N. M. Watson said the following: >>=20 >> On 3 Jun 2011, at 16:13, Andriy Gapon wrote: >>=20 >>> I wonder if anybody uses kdb_stop_cpus with non-default value. If, = yes, I >>> am very interested to learn about your usecase for it. >>=20 >> The issue that prompted the sysctl was non-NMI IPIs being used to = enter the >> debugger or reboot following a core hanging with interrupts disabled. = With >> the switch to NMI IPIs in some of those circumstances, life is better = -- at >> least, on hardware that supports non-maskable IPIs. I seem to recall = sparc64 >> doesn't, however? >=20 > Seems to be so as Nathan has also pointed out for PPC. > For this I also plan the following change: >=20 > commit 458ebd9aca7e91fc6e0825c727c7220ab9f61016 >=20 > generic_stop_cpus: move timeout detection code from under = DIAGNOSTIC >=20 > ... and also increase it a bit. > IMO it's better to detect and report the (rather serious) condition = and > allow a system to proceed somehow rather than be stuck in an = endless > loop. Agreed on detecting and reporting. It would be good to confirm that it = works in practice, however, and also that there are no false positives. = I'm not sure what the best test scenarios are for that. Robert