From owner-p4-projects@FreeBSD.ORG Mon Apr 2 20:47:35 2007 Return-Path: X-Original-To: p4-projects@freebsd.org Delivered-To: p4-projects@freebsd.org Received: by hub.freebsd.org (Postfix, from userid 32767) id 77CDB16A406; Mon, 2 Apr 2007 20:47:35 +0000 (UTC) X-Original-To: perforce@freebsd.org Delivered-To: perforce@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0925A16A47D; Mon, 2 Apr 2007 20:47:35 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.freebsd.org (Postfix) with ESMTP id 8543F13C448; Mon, 2 Apr 2007 20:47:34 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from localhost.corp.yahoo.com (john@localhost [127.0.0.1]) (authenticated bits=0) by server.baldwin.cx (8.13.8/8.13.8) with ESMTP id l32KlV9Y088373; Mon, 2 Apr 2007 16:47:32 -0400 (EDT) (envelope-from jhb@freebsd.org) From: John Baldwin To: Marcel Moolenaar Date: Mon, 2 Apr 2007 16:42:06 -0400 User-Agent: KMail/1.9.6 References: <200704012152.l31LqHuB022635@repoman.freebsd.org> <200704021155.21453.jhb@freebsd.org> <645BFA2D-3FC3-4AAB-ADCC-8D18431688E9@mac.com> In-Reply-To: <645BFA2D-3FC3-4AAB-ADCC-8D18431688E9@mac.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200704021642.06901.jhb@freebsd.org> X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-2.0.2 (server.baldwin.cx [127.0.0.1]); Mon, 02 Apr 2007 16:47:32 -0400 (EDT) X-Virus-Scanned: ClamAV 0.88.3/2997/Mon Apr 2 06:19:52 2007 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-4.4 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on server.baldwin.cx Cc: Marcel Moolenaar , Perforce Change Reviews Subject: Re: PERFORCE change 117140 for review X-BeenThere: p4-projects@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: p4 projects tree changes List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 Apr 2007 20:47:35 -0000 On Monday 02 April 2007 01:10:55 pm Marcel Moolenaar wrote: > > On Apr 2, 2007, at 8:55 AM, John Baldwin wrote: > > >> For example, to analyze machine checks on on pluto1, I disabled CPU0 > >> and CPU1 in succession to see if one of the CPUs was the cause of the > >> MC. As such, CPU1 had to be the BSP when CPU0 was disabled. Luckily > >> pluto1 is only a dual-CPU machine, so that disabling a CPU also stops > >> SMP operation :-) > > > > FreeBSD CPU ID's != firmware CPU IDs. > > It is important, and not only for identification, that the logical > CPU ID > used by FreeBSD is the same as used by the firmware (if at all > possible). > A different ID only causes confusion, especially when the firmware draws > its IDs from the same domain. What is called CPU4 within FreeBSD may not > be called CPU4 in the firmware, even though CPU4 may exist. Given cores vs htts, etc. it would seem that what FreeBSD considers a CPU should be a purely logical object, and not required to be tied to whatever firmware, etc. BTW, thanks for ignoring the discussion about this on arch over the past several years. :-/ > > CPU ID's != local APIC ID's on x86 for example, nor > > are they identical to the CPU indices in the hwprb on Alpha. > > They are also not the same on ia64. In fact, on ia64 the APIC ID > consists of 2 elements. The ACPI ID is exactly the kind of ID we > want... Then ia64 likely is not shutting down properly: /* * Shutdown the system cleanly to prepare for reboot, halt, or power off. */ static void boot(int howto) { static int first_buf_printf = 1; #if defined(SMP) /* * Bind us to CPU 0 so that all shutdown code runs there. Some * systems don't shutdown properly (i.e., ACPI power off) if we * run on another processor. */ mtx_lock_spin(&sched_lock); sched_bind(curthread, 0); mtx_unlock_spin(&sched_lock); KASSERT(PCPU_GET(cpuid) == 0, ("boot: not running on cpu 0")); #endif > > FreeBSD CPU > > ID's tend to not be sparse for example, because they are completely > > separate > > from firmware IDs. > > This statement is flawed. Sparseness is unavoidable when CPUs can be > hot-plugged. While we should have a CPU ID that maps trivially to a > bit field for masking purposes, there's no reason to allow sparse IDs > to certain extend. I said "tend to", not "required to". We had a discussion about this when mp_maxid was settled upon for UMA rather than using MAXCPU. > Both ACPI and Open Firmware have CPU IDs that map trivially to a mask > and they tend to be dense. I see no reason to not use the firmware IDs > as FreeBSD's notion of CPU ID. In fact, I see reasons not to create > our own IDs. In such reason is the added overhead of mapping from one > to the other during runtime. ACPI ID's may be sparse, but they don't have to be. They could start at 2 billion if they wanted to (they are UInt32 and are only small right now due to the current whims of BIOS writes), and then they wouldn't fit into the current cpumask. I see no problem with requiring the MD code to determine which CPUs exist from firmware/BIOS/etc. and then map that into a logical ID space that fits into that arch's cpumask_t while keeping cpumask_t simple if possible. > I think that testing for CPU0 when we really want to know if we're > running on the BSP is also flawed. On ia64 there's typically 1 BSP > that is used to boot the machine, but each cell in a NUMA system > has a monarch CPU that serves the purpose of the BSP for that cell. > This means that some tests that check for the BSP may need to be > changed to check for the monarch instead. Since there can obviously > be only 1 CPU0, there will (ipso facto) be BSP-like processors with > an ID != 0. It's therefore better not to assume "special powers" for > CPU0 and instead check the PCPU for flags that corresponds 1-on-1 > with such powers. I'm not a huge fan of assuming CPU 0 is BSP, but that is the current model under which FreeBSD operates. I'm not sure what should happen for shutdown in NUMA systems for example as in the patch above. If you have an alternative design, feel free to present it on arch@. Until then, it would probably be best to at least conform to what is there now. -- John Baldwin