From owner-svn-doc-all@freebsd.org Sun Jul 19 19:59:02 2015
Return-Path: Process-Context Identifiers (PCIDs) is a feature of the
+ A Process-Context Identifier (PCID) is a performance enhancing
+ feature of the
Translation Lookaside Buffer (TLB) on Intel processors,
introduced with the Sandy Bridge micro-architecture. It
allows the TLB to
simultaneously cache translation information for several
address spaces, and gives an opportunity for the operating
- system context switch code to avoid flushing the TLB on the
+ system context switch code to avoid flushing the TLB upon
process switch. Each cached translation is tagged with some
context identifier, and at context switch time, the operating
system instructs the processor which context is becoming
active. The feature slightly reduces context switch time by
- avoiding flush, and more importantly, it reduces the warm-up
- period for the thread after a context switch.
&os; already used PCID, but the existing implementation had several shortcomings. The amd64 pmap (the @@ -1113,9 +1114,9 @@ on the context switch. The bitmap was used to direct Inter-Processor Interrupts to the marked CPU when the operating system needed to perform TLB invalidation. The most - important deficiency of the implementation is the increase of - TLB invalidation IPIs since the bitmap could only grow until - full TLB shootdown is performed. It increases the TLB rate, + important deficiency of the implementation was the increase of + TLB invalidation IPIs, since the bitmap could only grow until + full TLB shootdown was performed. It increased the TLB rate, which negated the positive effects of avoiding TLB flushes on large machines. Secondarily, the bitmap maintenance in both the pmap and the context code was quite complicated, leading @@ -1125,13 +1126,14 @@
The new PCID implementation uses an algorithm described in the U. Vahalia book "UNIX Internals: The New Frontiers". The algorithm is already used, for example, by the MIPS pmap for - assigning the ASIDs to software-managed TLB entries. The pmap + assigning Address Space Identifiers (ASIDs) to software-managed + TLB entries. The pmap maintains a per-CPU generation count, which is assigned to the next unused PCID when the context is activated on CPU. TLB invalidation includes resetting the generation count, which - causes reallocation of PCID when a context switch is + causes reallocation of the PCID when a context switch is performed. As result, the new implementation issues exactly - the same amount of shootdown IPIs as pmap which does not + the same amount of shootdown IPIs as a pmap which does not utilize PCID.
Another change included with the PCID rewrite is a move of @@ -1139,9 +1141,8 @@ making the algorithm easier to understand and validate.
Measurements done with hwpmc(4) on a Haswell machine - indicated that the new implementation reduced the amount of - data TLB misses up to 10 times, without an impact on the IPI - counters.
+ indicated that the new implementation reduced the TLB miss rate by + up to 10 times, without an increase in TLB shootdown IPIs.The rewrite was committed to HEAD at r282684.