From owner-freebsd-fs@FreeBSD.ORG Sun Nov 23 00:27:09 2014 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 82AE4E6A for ; Sun, 23 Nov 2014 00:27:09 +0000 (UTC) Received: from esa-annu.net.uoguelph.ca (esa-annu.mail.uoguelph.ca [131.104.91.36]) by mx1.freebsd.org (Postfix) with ESMTP id 30C01909 for ; Sun, 23 Nov 2014 00:27:08 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Ar8EAO4ocVSDaFve/2dsb2JhbABchDwEgwLQBQKBFgEBAQEBfYQCAQEBAwEjBFIbDgMDAQIBAgINGQIjLggGE4gsAwkJtxuPTQ2GQgEBAQEBAQEDAQEBAQEBARuBLY0hggk0B4J5gVUFjCWQR486gneGfoQbKjCBSIEDAQEB X-IronPort-AV: E=Sophos;i="5.07,439,1413259200"; d="scan'208";a="171780913" Received: from muskoka.cs.uoguelph.ca (HELO zcs3.mail.uoguelph.ca) ([131.104.91.222]) by esa-annu.net.uoguelph.ca with ESMTP; 22 Nov 2014 19:27:01 -0500 Received: from zcs3.mail.uoguelph.ca (localhost.localdomain [127.0.0.1]) by zcs3.mail.uoguelph.ca (Postfix) with ESMTP id 0AEE8B4034; Sat, 22 Nov 2014 19:27:02 -0500 (EST) Date: Sat, 22 Nov 2014 19:27:02 -0500 (EST) From: Rick Macklem To: Konstantin Belousov Message-ID: <933608854.5622292.1416702422030.JavaMail.root@uoguelph.ca> In-Reply-To: <20141122175531.GZ17068@kib.kiev.ua> Subject: Re: RFC: patch to make d_fileno 64bits MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Originating-IP: [172.17.91.209] X-Mailer: Zimbra 7.2.6_GA_2926 (ZimbraWebClient - FF3.0 (Win)/7.2.6_GA_2926) Cc: FreeBSD Filesystems , Kirk McKusick X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 23 Nov 2014 00:27:09 -0000 Kostik wrote: > On Sat, Nov 22, 2014 at 09:23:41AM -0800, Kirk McKusick wrote: > > > Date: Sat, 22 Nov 2014 17:34:27 +0200 > > > From: Konstantin Belousov > > > To: Rick Macklem > > > Subject: Re: RFC: patch to make d_fileno 64bits > > > Cc: FreeBSD Filesystems > > > > > > On Fri, Nov 21, 2014 at 06:45:52PM -0500, Rick Macklem wrote: > > >> Kostik wrote: > > >> What about old binaries that do getdirentries(2) and expect the > > >> old > > >> structure with 32bit d_fileno or the linux compatibility stuff? > > >> I suspect that there are some old staticly linked binaries out > > >> there > > >> that does/expects the old getdirentries. > > > > > > No, let me restate my position. There are two places for > > > backward > > > compatibility, one is in-kernel binary interface, and another is > > > applications ,i.e. KBI and ABI. > > > > > > My opinion is that we must provide strict backward ABI > > > compatibility > > > to have even right to be called useful OS. In particular, the > > > syscalls > > > like current getdirentries (156 and 196) providing 32-bit > > > inonums, must > > > be kept with their current binary contract. The userspace issues > > > do > > > not end there, but this is not the currently discussed item. > > > > > > On the other hand, providing KBI compat for filesystems which > > > work > > > right now with 32bit inode numbers, should not be done. I.e., no > > > VOP_READDIR_32INO(), all filesystems must be converted once. > > > Yes, I did understand that others felt having the 2 VOP_READDIR_XX() wasn't appropriate and that is fine with me. (I just did it to try and allow file systems to be converted when someone got around to it.) What I will probably do for my testing is implement a MNTK_DIRENT64 flag that file systems set when they are converted to generating the new dirent64. (I'm not sure if this could go into head, since I'm not sure if it would work for stacked file systems? Maybe the stacked file system could just set the flag if the flag is set for the underlying fs, but I'm not sure. Anyhow, the object will be to convert all file systems so this doesn't need to be in head.) > > > For syscalls 156 and 196 (and some more), the converter must be > > > written > > > in the vfs_syscalls.c which translates the new dirents into old > > > dirents, > > > at the level of best efforts. > > > > I believe that we are all in agreement with you on the kernel > > approach > > at this point. > Well, I think this was the Rick patch and proposal to have compat > ino32 > in kernel. > Yes, I agree to one VOP_READDIR() and a converter (already written) from new->old in vfs_syscalls.c. > > > > Do we have a way of versioning libc so that we can have the old > > version > > that provides the 32-bit version of the syscalls (156 and 196) > > along > > with 32-bit higher-level functions like fts and friends and then a > > new > > libc version that has the 64-bit version of the syscalls and other > > higher-level functions? > > We do not need several versions of libc. We support symbol > versioning, > i.e. we can have old getdirents symbol which resolves to syscall stub > for 196, and new getdirents for new syscall. > > It is somewhat convoluted feature, you could look at example in > sys/kern/sysv_*.c, for instance, freebsd7_shmctl and shmctl. Also > look at libc/include/compat.h. For pure usermode compat shims, > lib/libc/gen/fts-compat.c was already handled one time. > > I promise to write neccessary magic for libc versioning when needed. > As I explained before, unfortunately the libc is not the final point > for the userspace compat. > I'll keep plugging away at the dirent structure and am glad you guys are going to attack the tough userland parts. Good luck with it, rick