Date: Tue, 03 Jul 2001 03:17:12 -0700 From: Terry Lambert <tlambert2@mindspring.com> To: "E.B. Dreger" <eddy+public+spam@noc.everquick.net> Cc: Chris Costello <chris@calldei.com>, freebsd-smp@FreeBSD.ORG, freebsd-hackers@FreeBSD.ORG Subject: Re: libc_r locking... why? Message-ID: <3B419BA8.3D93EB5A@mindspring.com> References: <Pine.LNX.4.20.0106291334350.7587-100000@www.everquick.net>
next in thread | previous in thread | raw e-mail | index | archive | help
"E.B. Dreger" wrote: > [ libc_r locks don't assert "lock", not MP-safe ] > > So the "lock" prefix is the only way to enforce cache coherency? > Do you have handy a good reference on IPIs, other than the kernel > APIC code (and, of course, Google and NorthernLight searches)? See other posting. > Good to know, but, I'm not using libc_r... I was looking at > existing code to help me double-check mine as I go. I'm > synchronizing processes with a "giant lock" token that each > process cooperatively passes to the next... to simplify: > > who_has_lock++ ; > who_has_lock %= process_count ; Your unsimplified assembly is not happy, and neither is this. You want to use a LOCK CMPXCHG to implement your mutexes; the LOCK prefix makes it a barrier instruction, which is needed to ensure that two processors don't operate on their L1 cache contents, and then both attempt to write back, where one wins the race, but both think they own the lock. > Each processes' critical path first checks to see if it holds > the token; if so, it performs the tasks that require it, such as > locking a finer-grained lock or mutex. It then passes the token, > and continues through its critical path. You aren't going to be able to safely hand this off if they are running on two different processors in user space. You _must_ implement an MP safe mutex. > If a thread has nothing to do, I nanosleep(2) to prevent the critical > path from degenerating to an extended spin. I'm considering using > socketpair(2)s, with a process blocking indefinitely on read(2) until > another process write(2)s to awaken it... This would work, but it will destroy your SMP scaling that you want to achieve, since you will effectively serialize your processes running. > > If you "need" kernel threads, look at the Linux kernel > > threads in the ports collection (it's a kernel module > > that builds and installs as a package). You probably > > don't, since performance of kernel threads is really only > > Correct me if I'm wrong, but the only place in my model that really > might benefit from kthreads would be the scheduling? i.e., rather > than screwing around with nanosleep(2) or socket calls, I could cut > the cruft and interact more directly with the scheduler via kthread > mechanisms? Not really. That's the problem with Linux threads: you don't get thread-group affinity, so if you are running anything on your system besides your threaded application, you tend to take full heavy-weight context switches. Some work was done on the Linux scheduler to try and get this affinity, but you really can't do that sanely in the scheduler: it's the wrong place to attack the problem. The planned FreeBSD approach can fix this, if it's implemented correctly, since as long as you have user space threads that are ready to run, you will run out your entire quantum, and do light weight switches from one thread to another within the same process. > > about a 20% increment, if you implement them the SVR4 or > > Solaris (pre-2.7) or Linux way. It's probably better to > > implement with FreeBSD threads as they currently exist, > > and get massive SMP scalability when KSE's are brought > > into the source tree. > > KSEs... where can I read up? http://people.freebsd.org/~jasone/kse/ -- Terry To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-smp" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3B419BA8.3D93EB5A>