From owner-freebsd-hackers@FreeBSD.ORG Tue Jul 26 14:55:09 2011 Return-Path: Delivered-To: freebsd-hackers@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D24A410656FE; Tue, 26 Jul 2011 14:55:09 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id F0C838FC19; Tue, 26 Jul 2011 14:55:04 +0000 (UTC) Received: from odyssey.starpoint.kiev.ua (alpha-e.starpoint.kiev.ua [212.40.38.101]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id RAA12665; Tue, 26 Jul 2011 17:55:03 +0300 (EEST) (envelope-from avg@FreeBSD.org) Message-ID: <4E2ED546.2080401@FreeBSD.org> Date: Tue, 26 Jul 2011 17:55:02 +0300 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:5.0) Gecko/20110705 Thunderbird/5.0 MIME-Version: 1.0 To: FreeBSD Hackers X-Enigmail-Version: 1.2pre Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Cc: Subject: HTT vs SMT in x86 SMP topology reporting X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 26 Jul 2011 14:55:09 -0000 Can anybody explain to me why our _x86_ SMP topology discovery and reporting code sometimes reports "HTT" and sometimes "SMT"? As in FreeBSD/SMP: %d package(s) x %d core(s) x %d HTT threads vs FreeBSD/SMP: %d package(s) x %d core(s) x %d SMT threads As I understand, and quoting Wikipedia (I know, I know), SMT stands for simultaneous multithreading and is a generic term for a particular kind of hardware multithreading: http://en.wikipedia.org/wiki/Simultaneous_multithreading The only known (to me) implementation of SMT for x86 is Intel's Hyper-Threading Technology aka HTT aka HT Technology aka hyperthreading: http://en.wikipedia.org/wiki/Hyper-threading http://software.intel.com/en-us/articles/intel-hyper-threading-technology-your-questions-answered/?wapkw=%28Intel+Hyper-Threading+Technology%29 I understand that HTT implementation has evolved over time and there could be some reasons to distinguish HTT as implemented in e.g. Pentium 4 from HTT as implemented in the recent architectures like Nehalem and Sandy Bridge. But I still do not understand why we have to call the latter SMT when Intel calls it HTT. Also, right now things like machdep.hyperthreading_allowed act only on old hyperthreads (what we report as "HTT") but completely ignore the new ones (what we report as "SMT"). I am not sure that this is correct either... Perhaps there are no good reasons to disable HTT on modern CPUs, but the current behavior can definitely be confusing. In the end I would like to propose that we always report Intel HTT as HTT. Or even drop HTT/SMT from "x %d threads" altogether. Whether we should internally distinguish old HTT from new HTT is not clear to me. E.g. I see that for old HTT we bind interrupts only to one processing unit (aka logical processor aka hardware thread) from each hyperthreading pair. Not sure if new HTT is so much better that we shouldn't be doing the same for it. P.S. Also currently distinction between old/new HTT technologies is made based on availability of CPUID leaf 0xB. Not sure if this is entirely accurate for e.g. Atom processors, which we seem to detect as old HTT. A lengthy quote from volume 3A as an illustration: 8.7.7 Performance Monitoring Counters Performance counters and their companion control MSRs are shared between the logical processors within a processor core for processors based on Intel NetBurst microarchitecture. As a result, software must manage the use of these resources. The performance counter interrupts, events, and precise event monitoring support can be set up and allocated on a per thread (per logical processor) basis. See Section 19.19, “Performance Monitoring and Intel Hyper-Threading Technology in Processors Based on Intel NetBurst Microarchitecture,” for a discussion of performance monitoring in the Intel Xeon processor MP. In Intel Atom processor family that support Intel Hyper-Threading Technology, the performance counters (general-purpose and fixed-function counters) and their companion control MSRs are duplicated for each logical processor. -- Andriy Gapon