Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 02 Aug 2012 22:17:52 +0800
From:      David Xu <listlog2011@gmail.com>
To:        freebsd-arch@freebsd.org
Subject:   Re: short read/write and error code
Message-ID:  <501A8C10.6010106@gmail.com>
In-Reply-To: <20120802135103.GX2676@deviant.kiev.zoral.com.ua>
References:  <5018992C.8000207@freebsd.org> <20120801071934.GJ2676@deviant.kiev.zoral.com.ua> <20120801183240.K1291@besplex.bde.org> <20120801162836.GO2676@deviant.kiev.zoral.com.ua> <20120802040542.G2978@besplex.bde.org> <20120802100240.GV2676@deviant.kiev.zoral.com.ua> <501A69EB.9000701@gmail.com> <20120802135103.GX2676@deviant.kiev.zoral.com.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
On 2012/8/2 21:51, Konstantin Belousov wrote:
> On Thu, Aug 02, 2012 at 07:52:11PM +0800, David Xu wrote:
>> On 2012/8/2 18:02, Konstantin Belousov wrote:
>>> diff --git a/sys/kern/sys_pipe.c b/sys/kern/sys_pipe.c
>>> index 338256c..1a61b93 100644
>>> --- a/sys/kern/sys_pipe.c
>>> +++ b/sys/kern/sys_pipe.c
>>> @@ -1286,13 +1286,13 @@ pipe_write(fp, uio, active_cred, flags, td)
>>>   	}
>>>   
>>>   	/*
>>> -	 * Don't return EPIPE if I/O was successful
>>> +	 * Don't return EPIPE if any byte was written.
>>> +	 * EINTR and other interrupts are handled by generic I/O layer.
>>> +	 * Do not pretend that I/O succeeded for obvious user error
>>> +	 * like EFAULT.
>>>   	 */
>>> -	if ((wpipe->pipe_buffer.cnt == 0) &&
>>> -	    (uio->uio_resid == 0) &&
>>> -	    (error == EPIPE)) {
>>> +	if (uio->uio_resid != orig_resid && error == EPIPE)
>>>   		error = 0;
>>> -	}
>>>   
>>>   	if (error == 0)
>>>   		vfs_timestamp(&wpipe->pipe_mtime);
>> I dislike the patch, if I thought it is right, I would have already
>> submit such
>> a patch. Now let us see why your patch is wore than my version (attached):
>> -current:
>>      short write is done, some bytes were sent
>>      dofilewrite returns -1, errno is EPIPE
>>      dofilewrite sends SIGPIPE
>>      application is killed by SIGPIPE
>> -my attached version:
>>     short write is done, some bytes were sent
>>     dofilewrite return number of bytes sent, errno is zero
>>     dofilewrite sends SIGPIPE.
>>    application is killed by SIGPIPE
> I cannot believe that
>> -you version:
>>    short write is done, some bytes were sent.
>>    dofilewrite returns number of bytes sent,  errno is zero.
>>    dofilewrite does not send SIGPIPE signal
>>    application is not killed
>>    application does not check return value from write()
>>    application thinks it is successful, application does other things,
>>    application might begin a bank account transaction.
>>    ...
>>    application never comes back...
> And I do think that this behaviour is right.
This is wrong. An application may don't know a file handle is pipe, 
because in theory
write to a file always successfully, the syscall is blocked until all 
data is written, an
abnormal write (short write) should kill the application, this is how 
SIGPIPE works,
this is the default action.

>
> This only different from the CURRENT by the point where the race between
> writer and exiting reader becomes observable. CURRENT reports that reader
> side was closed earlier, while my patch pretends that it exited slightly
> later.
This is why it is wrong.

>> my patch is more compatible with -current. if application does not
>> setup a signal handler for SIGPIPE, when short write happens, it is killed,
>> it is same as -current. if the application set up a signal handler for
>> the signal,
>> it always should check the return value from write(), this is how
>> traditional
>> code works.
> Now assume that application set the SIGPIPE handler, and did a short
> write. How can it interpret the delivered signal, if the following
> syscall return indicates that the write was successful ?
check if return value is great than zero and less than the requested,
if this is true, it is a short write, don't you check short write when 
writing code to
work with pipe or socket ? do you always think writing to pipe or socket 
will be
successfully ?

> Let me repeat myself: there are two objections against your patch.
> First is the signal generation when no EPIPE is returned to application
> (observable as I described in the previous paragraph).
> Second is that having the change in generic i/o layer requires impossible
> behaviour from the filesystems (ability to roll back failed uio transfer).
>> in your patch, short write does not kill application,  you can not assume
>> that the application will request a next write() on the pipe, and you hope
>> the second write to kill the application, but there is always exception,
>> the next write does not happen,  application works on other things.
>> This is too incompatible with -current.
>>
> This is right behaviour. If the application wrote all data it wanted to write,
> and went doing something else, then there is no reason to kill it.
I don't agree.




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?501A8C10.6010106>