Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 28 Feb 2008 19:35:55 +1100
From:      Peter Jeremy <peterjeremy@optushome.com.au>
To:        John Baldwin <jhb@freebsd.org>
Cc:        arch@freebsd.org
Subject:   Re: Cleaning up FILE in stdio..
Message-ID:  <20080228083555.GX83599@server.vk2pj.dyndns.org>
In-Reply-To: <200802271138.33979.jhb@freebsd.org>
References:  <200802262251.m1QMp7bV021709@hergotha.csail.mit.edu> <200802262355.16519.jhb@freebsd.org> <20080227065925.GK83599@server.vk2pj.dyndns.org> <200802271138.33979.jhb@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help

[-- Attachment #1 --]
On Wed, Feb 27, 2008 at 11:38:33AM -0500, John Baldwin wrote:
>> You could change _file from 'short' to 'unsigned short' without breaking
>> the ABI - this would allow either 65535 or 65536 file descriptors (I'm
>> not sure whether _file == -1 is special or not).  This would postpone
>> the problem for some time.
>
>-1 is used a lot in the stdio code for file's not backed by an fd.  My problem 
>though is that this doesn't help with existing binaries that are already 
>compiled (which is what I have to deal with).  Had fileno() not been inlined 
>I would have been ok, but that's pretty much done for me as far as my current 
>problem on 6.x.  Had I just been able to change FILE * and not had inlines, 
>then a new fopen would have worked fine in my case.

My suggestion was based on short and ushort having the same size and
(short)-1 and (ushort)65535 having the same bit pattern.  Any code
accessing fileno() should not be checking or validating the result
but just passing it to low-level I/O routines.  This would provide the
following:
               Existing code     New code
		   short          ushort
    FD         sign-extended   zero-extended
   -1              -1           65535
    0..32767        0..32767        0..32767
32768..65534   -32768..-2 [*]   32768..65534
 >65534          EMFILE            EMFILE

[*] This could potentially be fixed using libc or kernel shims.

>> e) Don asbestos underwear and re-arrange struct __sFILE to grow _file etc.
>
>We can't do e) because thanks to symbol versioning, 8.x and 9.x will have 
>libc.so.7, so a 7.0 binary will still use the brand new libc, so it has to 
>preserve the ABI of the currently exported fields pretty much forever.

Erk.  I forgot about that.

>think we can get away with renaming '_file' to '_ofile' and adding a new 'int 
>_file' at the bottom of the struct and making sure '_ofile' is always in sync 
>(when possible, truncated when _file is too bug).

Truncation opens up the possibility that old executables could fopen()
lots of files (without getting any indication of a problem) and then
use fileno() to reference a truncated _ofile - causing it to access
some totally unrelated file.  Admittedly, that is no worse than would
happen today.

>Also, I think we can do the new _file in HEAD for 8.0 w/o any worries.  I 
>don't think waiting until 9.0 buys anything there.

I was thinking in terms of changing _file to int without backward
compatibility.  I'm not sure that that could be done for 8.0 (though,
as you have pointed out, it can't be done at all).

>  Given that, I think I'd 
>rather just patch the current stable branches to handle the edge case better 
>and work on making _file an int in HEAD (with the ABI compat _ofile).

The EMFILE patch is definitely a good interim step and I support your
efforts in removing the limit on the number of open files.

-- 
Peter Jeremy
Please excuse any delays as the result of my ISP's inability to implement
an MTA that is either RFC2821-compliant or matches their claimed behaviour.

[-- Attachment #2 --]
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.4 (FreeBSD)

iD8DBQFHxnJr/opHv/APuIcRApt/AJ0Qt+qUxxS2/4DlxAtKZhmoDec/iACfbDW5
5lxxTVS2ZkAeLNjdZ2Pm0QI=
=QU7W
-----END PGP SIGNATURE-----

Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080228083555.GX83599>