Date: Tue, 11 Oct 2011 11:42:08 -0600 (MDT) From: Ian Lepore <freebsd@damnhippie.dyndns.org> To: FreeBSD-gnats-submit@FreeBSD.org Subject: arm/161498: [patch] ARM RAS code can fail to restart an atomic sequence. Message-ID: <201110111742.p9BHg8kJ070893@revolution.hippie.lan> Resent-Message-ID: <201110111750.p9BHo2Ra074403@freefall.freebsd.org>
next in thread | raw e-mail | index | archive | help
>Number: 161498 >Category: arm >Synopsis: [patch] ARM RAS code can fail to restart an atomic sequence. >Confidential: no >Severity: critical >Priority: high >Responsible: freebsd-arm >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Tue Oct 11 17:50:02 UTC 2011 >Closed-Date: >Last-Modified: >Originator: Ian Lepore <freebsd@damnhippie.dyndns.org> >Release: FreeBSD 8.2-STABLE arm >Organization: Symmetricom, Inc. >Environment: FreeBSD tflex 8.2-STABLE FreeBSD 8.2-STABLE #29: Tue Oct 11 13:32:35 UTC 2011 root@revolution.hippie.lan:/usr/obj/arm/usr/src/sys/TFLEX arm >Description: The RAS implementation for ARM can occasionally fail to restart a sequence after being interrupted, leading to a failure of atomic operations which manifests in ways such as granting the same pthread mutex to multiple threads concurrently. The "normal" RAS sequence goes like this: On entry, RAS_START = 0x00000000, RAS_END = 0xffffffff 1: Set RAS_START to address of Step 1 2: Set RAS_END to address of Step 4 3: Do the atomic operation 4: Set RAS_START to 0x00000000 5: Set RAS_END to 0xffffffff While the normal entry condition is for RAS_START/END to be 0x0/0xffffffff the code in PUSHFRAMEINSVC does not reliably enforce this (it sometimes leaves a stale value in RAS_END), which is the source of one of the two bugs in the current implementation. The other bug is that the PUSHFRAMEINSVC code compares the PC to RAS_END using signed-value logic, but proper operation of the RAS sequence requires that 0xffffffff be interpreted as the highest possible memory address, not -1. The first bug happens with interaction between a pair of threads doing RAS sequences at the same time. Thread 1 runs through step 4 of the sequence and gets interrupted before completing step 5. The PUSHFRAMEINSVC code sees a zero in RAS_START and does nothing; notably it does not change the value in RAS_END because START was zero. Thread 2 begins to run and gets interrupted between steps 1 and 2. Now PUSHFRAMEINSVC sees a non-zero RAS_START, and the RAS_END value is whatever thread 1 left there when it got interrupted. The behavior now depends on whether the RAS sequence in thread 2 is at a higher or lower memory address than the one in thread 1. In one case it accidentally works right, in the other case an interrupt between steps 1 and 2 wouldn't lead to a proper restart because the PC is not between RAS_START and RAS_END. But because RAS_START was non-zero when thread 2 got interrupted, RAS_START/END get reset, so when thread 2 gets resumed at step 2 it never reloads RAS_START, and another interrrupt in the sequence leads to misbehavior. The second bug (signed value logic) is conceptually simpler, here is a sequence that triggers that bug: On entry, RAS_START/END are 0x0/0xffffffff. A thread enters a RAS sequence and completes step 1, then gets interrupted before completing step 2. The PUSHFRAMEINSVC code sees a non-zero RAS_START so it compares the PC to RAS_END. Because of the signed-value logic, the PC will never be between RAS_START and -1, so the PUSHFRAMEINSVC code doesn't set up for a restart at the RAS_START location. When the thread eventually resumes at step 2, RAS_START will be zero, and if it gets interrupted again before the end of the atomic operation it would not get restarted. The attached patch addresses these two conditions as follows: Now the PUSHFRAMEINSVC macro always resets RAS_START to zero if it has a non- zero value, and always sets RAS_END to 0xffffffff if it has a different value. This differs from the original in that the value of RAS_START isn't used to decide whether to reset RAS_END. The two decisions are made independently, which allows them to reliably enforce the entry-condition for the next thread regardless of where in the RAS sequence the current thread was interrupted. The macro now uses conditional execution based on an unsigned compare of the PC against RAS_END to decide whether to set up for a restart (strhi rather than strgt instruction). In the case where RAS_START has been set and RAS_END is still at 0xffffffff, this will result in setting up for a restart at the beginning of the sequence so that an interrupt on the second time through will still result in atomic behavior (I.E. yet another restart for a 3rd attempt at the atomic op). Note that there is some typical ARM conditional-execution trickiness in the new logic: the strhi instruction that changes the saved PC to the RAS_START address will execute only if the carry flag is set and the zero flag is not. If the RAS_START value is zero then the compare with the RAS_END address is skipped and the zero flag is still set from the RAS_START compare, causing the strhi to get skipped based soley on the RAS_START value, regardless of the carry flag. If RAS_START is non-zero then the RAS_END compare happens and the strhi executes based on the zero/carry status of the RAS_END compare. Note also that the patched code assumes that if RAS_START is non-zero, the userland PC in PUSHFRAMEINSVC must be greater than RAS_START. This is implicit in the definition of RAS, which prohibits function calls during a RAS sequence. This allows the code to avoid a specific check for the PC being greater than RAS_START if RAS_START is non-zero, and explains why the series of instructions that used to be conditional on 'gt' are now 'ne'. Thanks go to Symmetricom, Inc, for sponsoring this work, and to my collegue there, Lars Boehnke, for crafting test code to demonstrate the original problem and prove the fix works. >How-To-Repeat: The following small program will demonstrate the problem using the atomic_cmpset_int() function. The failure will be provoked most quickly in the presence of lots of interrupts, so running with a kernel that has HZ set unreasonably high will help demonstrate the problem. --- atomictest.c begins here --- /* * atomictest.c - Demonstrate non-atomicity of ARM RAS code. * * Build using: make LDFLAGS=-pthread atomictest * * Run in an environment with lots of interrupts going on (HZ=4096 does * the trick nicely for me). On a 180mhz rm9200 with HZ=100 it takes * hours-to-months to see a failure; with HZ=4096 it takes minutes. */ #include <pthread.h> #include <signal.h> #include <stdio.h> #include <time.h> #include <unistd.h> #include <machine/atomic.h> static int sStopThreads = 0; static const int sNumThreads = 3; static unsigned int sAtomicLock = 0; // The implementation is defined in a macro because the RAS sections need to // be at different addresses to demonstrate the problem. // CMPSET test - // If a thread is able to "lock" sAtomicLock, it should be able to "release" // it. Failure to release indicates multiple threads were able to change the // value of sAtomicLock at the same time. #define THREAD_IMP(thrIdx) \ unsigned int locked, released; \ \ while (! sStopThreads) \ { \ locked = atomic_cmpset_int(&sAtomicLock, 0, (thrIdx+1)); \ released = atomic_cmpset_int(&sAtomicLock, (thrIdx+1), 0); \ if (locked != released) \ { \ printf("*** ERROR: locked=%u, released=%u\n", locked, released); \ sStopThreads = 1; \ break; \ } \ } \ \ return NULL; \ void* ThreadFunc1(void* varg) { THREAD_IMP(0); } void* ThreadFunc2(void* varg) { THREAD_IMP(1); } void* ThreadFunc3(void* varg) { THREAD_IMP(2); } void SigHandler(int sig) { if (sig == SIGINT) sStopThreads = 1; return; } int main(int argc, const char* argv[]) { pthread_t threads[sNumThreads]; size_t idx; time_t start = time(NULL); signal(SIGINT, SigHandler); pthread_create(&threads[0], NULL, ThreadFunc1, NULL); pthread_create(&threads[1], NULL, ThreadFunc2, NULL); pthread_create(&threads[2], NULL, ThreadFunc3, NULL); while (! sStopThreads) sleep(1); // make sure all threads have stopped for (idx = 0; idx < sNumThreads; idx ++) pthread_join(threads[idx], NULL); printf("Elapsed %lld seconds\n", (int64_t)(time(NULL)-start)); return 0; } --- atomictest.c ends here --- >Fix: The following patch fixes the problems described above. The changes to sysarch.h are just updates to the comments to help future maintainers. These diffs were developed for freebsd 8.x but apply cleanly to -current. These changes have been extensively tested at Symmetricom; we are currently deploying products that include these fixes. --- ras-current.diff begins here --- Index: sys/arm/include/asmacros.h =================================================================== RCS file: /local/base/FreeBSD-CVS/src/sys/arm/include/asmacros.h,v retrieving revision 1.8 diff -u -p -r1.8 asmacros.h --- sys/arm/include/asmacros.h 5 Feb 2008 10:22:33 -0000 1.8 +++ sys/arm/include/asmacros.h 11 Oct 2011 17:02:52 -0000 @@ -71,9 +71,8 @@ ldr r0, =ARM_RAS_START; \ mov r1, #0; \ str r1, [r0]; \ - ldr r0, =ARM_RAS_END; \ mov r1, #0xffffffff; \ - str r1, [r0]; + str r1, [r0, #4]; /* * PULLFRAME - macro to pull a trap frame from the stack in the current mode @@ -118,23 +117,22 @@ stmia sp, {r0-r12}; /* Push the user mode registers */ \ add r0, sp, #(4*13); /* Adjust the stack pointer */ \ stmia r0, {r13-r14}^; /* Push the user mode registers */ \ - mov r0, r0; /* NOP for previous instruction */ \ - ldr r5, =ARM_RAS_START; /* Check if there's any RAS */ \ - ldr r3, [r5]; \ - cmp r3, #0; /* Is the update needed ? */ \ - ldrgt lr, [r0, #16]; \ - ldrgt r1, =ARM_RAS_END; \ - ldrgt r4, [r1]; /* Get the end of the RAS */ \ - movgt r2, #0; /* Reset the magic addresses */ \ - strgt r2, [r5]; \ - movgt r2, #0xffffffff; \ - strgt r2, [r1]; \ - cmpgt lr, r3; /* Were we in the RAS ? */ \ - cmpgt r4, lr; \ - strgt r3, [r0, #16]; /* Yes, update the pc */ \ - mrs r0, spsr_all; /* Put the SPSR on the stack */ \ - str r0, [sp, #-4]! - + mov r0, r0; /* NOP for previous instruction */ \ + ldr r5, =ARM_RAS_START; /* Retrieve global RAS_END and */ \ + ldr r4, [r5, #4]; /* reset it to point at the */ \ + cmp r4, #0xffffffff; /* end of memory if necessary; */ \ + movne r1, #0xffffffff; /* leave value in r4 for later */ \ + strne r1, [r5, #4]; /* comparision against PC. */ \ + ldr r3, [r5]; /* Retrieve global RAS_START */ \ + cmp r3, #0; /* and reset it if non-zero. */ \ + movne r1, #0; /* If non-zero RAS_START and */ \ + strne r1, [r5]; /* PC was lower than RAS_END, */ \ + ldrne r1, [r0, #16]; /* adjust the saved PC so that */ \ + cmpne r4, r1; /* execution later resumes at */ \ + strhi r3, [r0, #16]; /* the RAS_START location. */ \ + mrs r0, spsr_all; \ + str r0, [sp, #-4]! + /* * PULLFRAMEFROMSVCANDEXIT - macro to pull a trap frame from the stack * in SVC32 mode and restore the saved processor mode and PC. Index: sys/arm/include/sysarch.h =================================================================== RCS file: /local/base/FreeBSD-CVS/src/sys/arm/include/sysarch.h,v retrieving revision 1.6 diff -u -p -r1.6 sysarch.h --- sys/arm/include/sysarch.h 12 Feb 2009 23:23:30 -0000 1.6 +++ sys/arm/include/sysarch.h 11 Oct 2011 17:02:56 -0000 @@ -42,9 +42,13 @@ * The ARM_TP_ADDRESS points to a special purpose page, which is used as local * store for the ARM per-thread data and Restartable Atomic Sequences support. * Put it just above the "high" vectors' page. - * the cpu_switch() code assumes ARM_RAS_START is ARM_TP_ADDRESS + 4, and + * The cpu_switch() code assumes ARM_RAS_START is ARM_TP_ADDRESS + 4, and * ARM_RAS_END is ARM_TP_ADDRESS + 8, so if that ever changes, be sure to * update the cpu_switch() (and cpu_throw()) code as well. + * In addition, code in arm/include/atomic.h and arm/include/asmacros.h + * assumes that ARM_RAS_END is at ARM_RAS_START+4, so be sure to update those + * if ARM_RAS_END moves in relation to ARM_RAS_START (look for occurrances + * of ldr/str rm,[rn, #4]). */ #define ARM_TP_ADDRESS (ARM_VECTORS_HIGH + 0x1000) #define ARM_RAS_START (ARM_TP_ADDRESS + 4) --- ras-current.diff ends here --- >Release-Note: >Audit-Trail: >Unformatted:
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?201110111742.p9BHg8kJ070893>