Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 31 Jul 2008 00:12:35 -0400
From:      "Michael B Allen" <ioplex@gmail.com>
To:        "Jilles Tjoelker" <jilles@stack.nl>
Cc:        freebsd-hackers <freebsd-hackers@freebsd.org>
Subject:   Re: Pls sanity check my semtimedop(2) implementation
Message-ID:  <78c6bd860807302112v22d07403hfbadfd6a9a10b0f0@mail.gmail.com>
In-Reply-To: <20080718155856.GA96280@stack.nl>
References:  <78c6bd860807121611w4f6ab44brbebfffea9929682a@mail.gmail.com> <20080718155856.GA96280@stack.nl>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Jul 18, 2008 at 11:58 AM, Jilles Tjoelker <jilles@stack.nl> wrote:
> On Sat, Jul 12, 2008 at 07:11:26PM -0400, Michael B Allen wrote:
>> Below is a semtimedop(2) implementation that I'm using for FreeBSD. I
>> was hoping someone could look it over and tell me if they think the
>> implementation is sound.
>
>> [snip semtimedop implementation that uses SIGALRM and relies on EINTR]
>
>> The code seems to work ok but when stressing the FreeBSD build of my app
>> I have managed to provoke errors related to concurrency (usually when a
>> SIGALRM goes off). The Linux build works flawlessesly so I'm wondering
>> about this one critical function that is different.
>
> In your implementation, the SIGALRM signal may happen before you even
> call semop(2). If so, most likely the semop(2) will hang arbitrarily
> long.

Indeed. And I reconnoiter that condition is likely to happen if called
with a sufficiently small timeout value.

> Another dirty fix is to try non-blocking semop(2) several times with
> sleeps in between.

Actually that seems to work pretty well.

If I force the nanosleep codepath to be used always, the application
actually works pretty well under load. I wonder if the overhead of
managing the signal and timer is worth the trouble.

For posterity, below is the current implementation of semtimedop(2)
for FreeBSD. Further ideas are welcome.

void
_timeval_diff(struct timeval *tv1, struct timeval *tv2, struct timeval *tvd)
{
    tvd->tv_sec = tv1->tv_sec - tv2->tv_sec;
    tvd->tv_usec = tv1->tv_usec - tv2->tv_usec;
    if (tvd->tv_usec < 0) {
        tvd->tv_usec = 1000000 - tvd->tv_usec;
        tvd->tv_sec--;
    }
}
void
signal_ignore(int s, siginfo_t *si, void *ctx)
{
}
int
_semtimedop(int semid, struct sembuf *array, size_t nops, struct
timespec *_timeout)
{
    struct timeval timeout, before, after;
    struct itimerval value, ovalue;
    struct sigaction sa, osa;
    int ret;

    if (_timeout) {
        timeout.tv_sec = _timeout->tv_sec;
        timeout.tv_usec = _timeout->tv_nsec / 1000;

        if (gettimeofday(&before, NULL) == -1) {
            errno = EFAULT;
            return -1;
        }

        if (timeout.tv_sec == 0 && timeout.tv_usec < 5000) {
            struct timeval tsleep, tend;
            struct sembuf wait;

            wait = *array;
            wait.sem_flg |= IPC_NOWAIT;

            tsleep.tv_sec = 0;
            tsleep.tv_usec = 1;

            timeradd(&before, &timeout, &tend);

            for ( ;; ) {
                struct timeval tnow, tleft;
                struct timespec ts;

                ret = semop(semid, &wait, nops);
                if (ret == 0) {
                    return 0;
                } else if (errno != EAGAIN) {
                    break;
                }

                if (gettimeofday(&tnow, NULL) == -1) {
                    errno = EFAULT;
                    break;
                }

                if (timercmp(&tnow, &tend, >=)) {
                    errno = EAGAIN;
                    break;
                }

                timersub(&tend, &tnow, &tleft);

                if (tsleep.tv_usec > tleft.tv_usec)
                    tsleep.tv_usec = tleft.tv_usec;

                ts.tv_sec = 0;
                ts.tv_nsec = tsleep.tv_usec * 1000;
                nanosleep(&ts, NULL);

                tsleep.tv_usec *= 10;
            }

            return -1;
        }

        memset(&value, 0, sizeof value);
        value.it_value = timeout;

        memset(&sa, 0, sizeof sa);
        sa.sa_sigaction = signal_ignore;
        sa.sa_flags = SA_SIGINFO;
        sigemptyset(&sa.sa_mask);
        sigaction(SIGALRM, &sa, &osa);

        if (setitimer(ITIMER_REAL, &value, &ovalue) == -1) {
            sigaction(SIGALRM, &osa, NULL);
            return -1;
        }
    }

    ret = semop(semid, array, nops);

    if (_timeout) {
        ret = setitimer(ITIMER_REAL, &ovalue, NULL);
        if (ret < 0)
            errno = EFAULT;

        sigaction(SIGALRM, &osa, NULL);
    }

    if (ret == -1) {
        if (_timeout) {
            struct timeval elapsed;

            if (gettimeofday(&after, NULL) == -1) {
            } else {

                _timeval_diff(&after, &before, &elapsed);

                if (timercmp(&elapsed, &timeout, >=))
                    errno = EAGAIN;
            }
        }

        return -1;
    }

    return 0;
}



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?78c6bd860807302112v22d07403hfbadfd6a9a10b0f0>