Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 23 Jun 2016 15:42:32 -0700
From:      Sean Chittenden <sean@chittenden.org>
To:        Maxim Sobolev <sobomax@freebsd.org>
Cc:        Konstantin Belousov <kostikbel@gmail.com>, Adrian Chadd <adrian@freebsd.org>, performance@freebsd.org, John Baldwin <jhb@freebsd.org>, Alan Somers <asomers@freebsd.org>, Alan Cox <alc@rice.edu>, Alan Cox <alc@freebsd.org>, freebsd-current <freebsd-current@freebsd.org>, "current@freebsd.org" <current@freebsd.org>
Subject:   Re: PostgreSQL performance on FreeBSD
Message-ID:  <C06B11C3-5D40-43AF-8975-880F272933C5@chittenden.org>
In-Reply-To: <CAH7qZfvy46wWcrjz-ihA%2B%2BEYktm7PqGoJhj1a7hdYWssiEXFuA@mail.gmail.com>
References:  <20140627125613.GT93733@kib.kiev.ua> <CAJ-Vmom-M=R=FaBfHE5c2%2BYxW0SLmJTdFJD8tW4_aOD7MDNwzA@mail.gmail.com> <CAJ-Vmomt=WYjct%2BzsTbHuryxqYp7ELyS52LOb4NEsfENQ1yj1w@mail.gmail.com> <1603235.2ShtoCfSqO@ralph.baldwin.cx> <CAH7qZfuAtHtUG92wEjPhOZ=BGgyFS728uigjJoD0pG%2B-mtUSww@mail.gmail.com> <20160622100241.GM38613@kib.kiev.ua> <CAH7qZfvy46wWcrjz-ihA%2B%2BEYktm7PqGoJhj1a7hdYWssiEXFuA@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help

Small nit:

PostgreSQL used SYSV because it allowed for the detection of dead processes.  If you `kill -9`’ed a process, PostgreSQL can detect that and then shut down and perform an automatic recovery.  In this regard, sysv is pretty clever.  The move to POSIX shared mem was done for a host of reasons, but it means that you don’t have to adjust your SYSV limits.  My understanding from a few years ago is that there is still a ~64KB SYSV memory segment that is still used to act as the latch to signal if a process was killed, but all of the shared buffers are stored in posix mmap’ed regions.

At this point in time this could be replaced with kqueue(2) EVFILT_PROC, but no one has done that yet.

-sc



--
Sean Chittenden
sean@chittenden.org

> On Jun 22, 2016, at 07:26 , Maxim Sobolev <sobomax@freebsd.org> wrote:
> 
> Konstantin,
> 
> Not if you do sem_unlink() immediately, AFAIK. And that's what PG does. So
> the window of opportunity for the leakage is quite small, much smaller than
> for SYSV primitives. Sorry for missing your status update message, I've
> missed it somehow.
> 
> ----
>                mySem = sem_open(semname, O_CREAT | O_EXCL,
>                                                 (mode_t) IPCProtection,
> (unsigned) 1);
> 
> #ifdef SEM_FAILED
>                if (mySem != (sem_t *) SEM_FAILED)
>                        break;
> #else
>                if (mySem != (sem_t *) (-1))
>                        break;
> #endif
> 
>                /* Loop if error indicates a collision */
>                if (errno == EEXIST || errno == EACCES || errno == EINTR)
>                        continue;
> 
>                /*
>                 * Else complain and abort
>                 */
>                elog(FATAL, "sem_open(\"%s\") failed: %m", semname);
>        }
> 
>        /*
>         * Unlink the semaphore immediately, so it can't be accessed
> externally.
>         * This also ensures that it will go away if we crash.
>         */
>        sem_unlink(semname);
> 
>        return mySem;
> ----
> 
> -Max
> 
> On Wed, Jun 22, 2016 at 3:02 AM, Konstantin Belousov <kostikbel@gmail.com>
> wrote:
> 
>> On Tue, Jun 21, 2016 at 12:48:00PM -0700, Maxim Sobolev wrote:
>>> Thanks, Konstantin for the great work, we are definitely looking forward
>> to
>>> get all those improvements to be part of the default FreeBSD kernel/port.
>>> Would be nice if you can post an update some day later as to what's
>>> integrated and what's not.
>> I did posted the update several days earlier.  Since you replying to this
>> thread, it would be not unreasonable to read recent messages that were
>> sent.
>> 
>>> 
>>> Just in case, I've opened #14206 with PG to switch us to using POSIX
>>> semaphores by default. Apart from the mentioned performance benefits,
>> SYSV
>>> semaphores are PITA to deal with as they come in very limited quantities
>> by
>>> default. Also they might stay around if PG dies/gets nuked and prevent it
>>> from starting again due to overflow. We've got some quite ugly code to
>>> clean up those using ipcrm(1) in our build scripts to deal with just
>> that.
>>> I am happy that code could be retired now.
>> 
>> Named semaphores also stuck around if processes are killed without cleanup.
>> 
>> 
> _______________________________________________
> freebsd-performance@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-performance
> To unsubscribe, send any mail to "freebsd-performance-unsubscribe@freebsd.org"




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?C06B11C3-5D40-43AF-8975-880F272933C5>