Date: Wed, 30 Jan 2008 11:22:51 -0800 From: Landon Fuller <landonf@bikemonkey.org> To: Daniel Eischen <eischen@vigrid.com> Cc: freebsd-threads@freebsd.org Subject: Re: threads/119920: fork broken in libpthread Message-ID: <D3DFD2DE-E6A3-4373-AC37-42C3CAB9B963@bikemonkey.org> In-Reply-To: <Pine.GSO.4.64.0801292320260.15096@sea.ntplx.net> References: <200801240850.m0O8o2JQ023500@freefall.freebsd.org> <4798564B.7070500@elischer.org> <Pine.GSO.4.64.0801240957550.16059@sea.ntplx.net> <488DBC6A-CF33-4E50-B1BB-C396C8957F92@bikemonkey.org> <Pine.GSO.4.64.0801291611130.12689@sea.ntplx.net> <892A73B3-0114-4718-ABC0-CADD45D9D0FA@bikemonkey.org> <Pine.GSO.4.64.0801292320260.15096@sea.ntplx.net>
next in thread | previous in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--Apple-Mail-14-75471841
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
On Jan 29, 2008, at 20:23, Daniel Eischen wrote:
> On Tue, 29 Jan 2008, Landon Fuller wrote:
>
>>
>> On Jan 29, 2008, at 13:13, Daniel Eischen wrote:
>>> There is a bug somewhere else or something is stomping
>>> on the thread's lockuser. It is allocated once when the
>>> thread is created and should never be null thereafter.
>>> Hence, it should never be malloc'd and the reinit should
>>> be sufficient.
>>
>> I'm totally unfamiliar with KSE, so perhaps this a stupid question
>> -- it seems to solve the issue locally, so I'll ask it --
>> Why not place the fork() code inside of _kse_critical_enter /
>> _kse_critical_leave, to ensure upcalls are blocked while re-
>> initializing in the child process post-fork?
>
> That just prevents an upcall from happening (which my patch solves),
> but doesn't prevent the corruption of the lockuser or lock.
My assumption was that lockuser was being corrupted in a post-fork
upcall; after wrapping fork in kse_critical_enter(), and can not
reproduce the lockuser-reallocation I was seeing before.
Is my assumption that kse_critical_enter() prevents any other code
from being run during the critical section incorrect?
--- thread/thr_fork.c.orig 2008-01-30 11:08:45.000000000 -0800
+++ thread/thr_fork.c 2008-01-30 11:09:36.000000000 -0800
@@ -93,12 +93,16 @@
if (_kse_isthreaded() != 0) {
_spinlock(__malloc_lock);
}
+
+ kse_critical_t crit = _kse_critical_enter();
+
if ((ret = __sys_fork()) == 0) {
/* Child process */
errsave = errno;
/* Kernel signal mask is restored in
_kse_single_thread */
_kse_single_thread(curthread);
+ _kse_critical_leave(crit);
/* Run down atfork child handlers. */
TAILQ_FOREACH(af, &_thr_atfork_list, qe) {
@@ -107,6 +111,7 @@
}
_thr_mutex_reinit(&_thr_atfork_mutex);
} else {
+ _kse_critical_leave(crit);
if ((_kse_isthreaded() != 0) && (__malloc_lock !=
NULL)) {
_spinunlock(__malloc_lock);
}
> My patch does solve this in -current, but -stable probably lacks
> a few other patches. It (-stable) really needs all of -current's
> code, not just this patch.
Is this change viable for an errata fix for 6.3?
-landonf
--Apple-Mail-14-75471841
content-type: application/pgp-signature; x-mac-type=70674453;
name=PGP.sig
content-description: This is a digitally signed message part
content-disposition: inline; filename=PGP.sig
content-transfer-encoding: 7bit
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (Darwin)
iD8DBQFHoM6NlplZCE/15mMRAiL0AJ44TC7y1TPRcETD6w5aXo+/bSlPtACffYQ0
hwOy02U/Y5KBmHcX4UuYMB4=
=6kkq
-----END PGP SIGNATURE-----
--Apple-Mail-14-75471841--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?D3DFD2DE-E6A3-4373-AC37-42C3CAB9B963>
